State whether the following statements are True or False. Give reason in support of your answer: (5xx2=10)(5 \times 2=10)
(a) If the probability of non rejection of H_(0)\mathrm{H}_0 when H_(1)\mathrm{H}_1 is true is 0.4 then power of the test will be 0.6 .
(b) If T_(1)T_1 and T_(2)T_2 are two estimators of the parameter theta\theta such that Var(T_(1))=1//n\operatorname{Var}\left(T_1\right)=1 / \mathrm{n} and Var(T_(2))=n\operatorname{Var}\left(\mathrm{T}_2\right)=\mathrm{n} then T_(1)\mathrm{T}_1 is more efficient than T_(2)\mathrm{T}_2.
(c) A 95%95 \% confidence interval is smaller than 99%99 \% confidence interval.
(d) If the level of significance is the same, the area of the rejection region in a two-tailed test is less than that in a one-tailed test.
(e) Non parametric tests are more powerful than the parametric tests.
If a finite population has four elements: 6,1,3,26,1,3,2.
(a) How many different samples of size n=2\mathrm{n}=2 can be selected from this population if you sample without replacement?
(b) List all possible samples of size n=2\mathrm{n}=2.
(c) Compute the sample mean for each of the samples given in part bb.
(d) Find the sampling distribution of bar(x)\overline{\mathrm{x}} and draw the histogram.
(e) Compute standard error.
(f) If all four population values are equally likely, calculate the value of the population mean ^(mu){ }^\mu. Do any of the samples listed in part (b) produce a value of bar(x)\bar{x} exactly equal to mu_(“? “)\mu_{\text {? }}
A study was conducted to compare the mean numbers of police emergency calls per 8 -hour shift in two districts of a large city. Samples of 1008 -hour shifts were randomly selected from the police records for each of the two regions and the number of emergency calls was recorded for each shift. The sample statistics are listed here:
Find a 90%90 \% confidence interval for the difference in the mean numbers of police emergency calls per shift between the two districts of the city. Interpret the interval.
A bond proposal for school construction will be submitted to the voters at the next municipal election. A major portion of the money derived form this bond issue will be used to build schools in a rapidly developing selection of the city, and the remainder will be used to renovate and update school buildings in the rest of the city. To assess the viability of the bond proposal, a random sample of n_(1)=50\mathrm{n}_1=50 residents in the developing section and n_(2)=100\mathrm{n}_2=100 residents from the other parts of the city were asked whether they plan to vote for the proposal. The results are tabulated below
Sample Values for Opinion on Bond Proposal
Developing Section
Rest of the City
Sample size
50
100
Number favoring
proposal
Number favoring
proposal| Number favoring |
| :— |
| proposal |
38
65
Developing Section Rest of the City
Sample size 50 100
“Number favoring
proposal” 38 65| | Developing Section | Rest of the City |
| :— | :— | :— |
| Sample size | 50 | 100 |
| Number favoring <br> proposal | 38 | 65 |
(a) Estimate the difference in the true proportions favoring the bond proposal with a 99%99 \% confidence interval.
(b) If both samples were pooled into one sample of size n=150\mathrm{n}=150, with 103 in favor of the proposal, provide a point estimate of the proportion of city residents who will vote for the bond proposal.
5. The following data relate to the number of items produced per shift by two workers for a number of days:
Can it be inferred that Worker A is more stable worker compared to B by testing the variation in the item produced by them at 5%5 \% level of significance.
If magnitude of earthquakes recorded in a region of a country follows a distribution with parameter mu\mu whose pdf is given below:
then show that the estimators of the parameter mu\mu using maximum likelihood and method of moments are same.
7. A company plans to promote a new product by using one of three advertising campaigns. To investigate the extent of product recognition from these three campaigns, 15 market areas were selected and five were randomly assigned to each advertising plan. At the end of the ad campaigns, random samples of 400 adults were selected in each area and the proportions who were familiar with new product were recorded. The responses were not approximately normal. Is there a significant difference among the three population distributions from which these samples came? Use an appropriate nonparametric method to answer this question at 5%5 \% level of significance.
(15)
8. A psychology class performed an experiment to determine whether a recall score in which instructions to form images of 25 words were given differs from an initial recall score for which no imagery instructions were given. Twenty students participated in the experiment with the results listed in the table:
(a) What two testing procedures can be used to test for differences in the distribution of recall scores with and without imagery? What assumptions are required for the parametric procedure? Do these data satisfy these assumptions?
(b) Use both the parametric and non-parametric tests for differences in the distributions of recall scores under these two conditions.
(c) Compare the results of the tests in part b. Are the conclusions for same? If not, why not?
Solution
Question:-1
State whether the following statements are True or False. Give reason in support of your answer:
(a) If the probability of non rejection of H_(0)\mathrm{H}_0 when H_(1)\mathrm{H}_1 is true is 0.4 then power of the test will be 0.6.
(b) If T_(1)T_1 and T_(2)T_2 are two estimators of the parameter theta\theta such that Var(T_(1))=1//n\operatorname{Var}\left(T_1\right)=1 / \mathrm{n} and Var(T_(2))=n\operatorname{Var}\left(\mathrm{T}_2\right)=\mathrm{n} then T_(1)\mathrm{T}_1 is more efficient than T_(2)\mathrm{T}_2.
(c) A 95%95 \% confidence interval is smaller than 99%99 \% confidence interval.
(d) If the level of significance is the same, the area of the rejection region in a two-tailed test is less than that in a one-tailed test.
(e) Non parametric tests are more powerful than the parametric tests.
Answer:
(a)Statement: If the probability of non-rejection of H_(0)H_0 when H_(1)H_1 is true is 0.4, then the power of the test will be 0.6.
Evaluation: True.
Justification: The power of a test is the probability of rejecting H_(0)H_0 when H_(1)H_1 is true, i.e., “Power”=P(“Reject “H_(0)∣H_(1)” true”)\text{Power} = P(\text{Reject } H_0 \mid H_1 \text{ true}). The probability of non-rejection of H_(0)H_0 when H_(1)H_1 is true is P(“Fail to reject “H_(0)∣H_(1)” true”)=1-“Power”P(\text{Fail to reject } H_0 \mid H_1 \text{ true}) = 1 – \text{Power}. Given this probability is 0.4, we have 1-“Power”=0.41 – \text{Power} = 0.4, so “Power”=1-0.4=0.6\text{Power} = 1 – 0.4 = 0.6.
(b)Statement: If T_(1)T_1 and T_(2)T_2 are two estimators of the parameter theta\theta such that Var(T_(1))=1//n\operatorname{Var}(T_1) = 1/n and Var(T_(2))=n\operatorname{Var}(T_2) = n, then T_(1)T_1 is more efficient than T_(2)T_2.
Evaluation: True, assuming both estimators are unbiased.
Justification: The efficiency of an estimator (for unbiased estimators) is determined by its variance; a lower variance indicates a more efficient estimator. Since Var(T_(1))=1//n\operatorname{Var}(T_1) = 1/n and Var(T_(2))=n\operatorname{Var}(T_2) = n, and assuming n > 0n > 0, we have 1//n < n1/n < n for n > 1n > 1 (and even for 0 < n < 10 < n < 1, 1//n > n1/n > n, but variance is typically positive and context suggests nn is a sample size, so n >= 1n \geq 1). Thus, Var(T_(1)) < Var(T_(2))\operatorname{Var}(T_1) < \operatorname{Var}(T_2), so T_(1)T_1 is more efficient. If the estimators are biased, efficiency comparisons require mean squared error, but the context suggests unbiasedness is assumed.
(c)Statement: A 95% confidence interval is smaller than a 99% confidence interval.
Evaluation: True.
Justification: The width of a confidence interval is proportional to the critical value from the standard normal or t-distribution. For a 95% confidence interval, the critical value (e.g., z~~1.96z \approx 1.96 for large samples) is smaller than for a 99% confidence interval (e.g., z~~2.576z \approx 2.576). Since the width is typically of the form “estimate”+-z*”standard error”\text{estimate} \pm z \cdot \text{standard error}, a smaller zz results in a narrower interval, assuming the same standard error and sample size.
(d)Statement: If the level of significance is the same, the area of the rejection region in a two-tailed test is less than that in a one-tailed test.
Evaluation: False.
Justification: The level of significance alpha\alpha represents the total probability of the rejection region. In a one-tailed test, the entire rejection region is in one tail, so the area is alpha\alpha. In a two-tailed test, the rejection region is split between both tails, with each tail having area alpha//2\alpha/2, so the total area is still alpha\alpha. Thus, the total area of the rejection region is the same in both tests, not less in the two-tailed test. For example, if alpha=0.05\alpha = 0.05, a one-tailed test has a rejection region of 0.05 in one tail, while a two-tailed test has 0.025 in each tail, totaling 0.05.
(e)Statement: Non-parametric tests are more powerful than parametric tests.
Evaluation: False.
Justification: Non-parametric tests are generally less powerful than parametric tests when the assumptions of the parametric tests (e.g., normality, equal variances) are met. Parametric tests use specific distributional assumptions to model the data, leading to higher power for detecting true effects. Non-parametric tests, which do not assume a specific distribution, are more robust but sacrifice power. For example, the Mann-Whitney U test (non-parametric) is less powerful than the t-test (parametric) for normally distributed data with equal variances. However, non-parametric tests can be more powerful when parametric assumptions are violated, but the statement is absolute and does not account for this condition.