Sample Solution

MST-004 Solved Assignment 2025

  1. State whether the following statements are True or False. Give reason in support of your answer:
    ( 5 × 2 = 10 ) ( 5 × 2 = 10 ) (5xx2=10)(5 \times 2=10)(5×2=10)
    (a) If the probability of non rejection of H 0 H 0 H_(0)\mathrm{H}_0H0 when H 1 H 1 H_(1)\mathrm{H}_1H1 is true is 0.4 then power of the test will be 0.6 .
    (b) If T 1 T 1 T_(1)T_1T1 and T 2 T 2 T_(2)T_2T2 are two estimators of the parameter θ θ theta\thetaθ such that Var ( T 1 ) = 1 / n Var T 1 = 1 / n Var(T_(1))=1//n\operatorname{Var}\left(T_1\right)=1 / \mathrm{n}Var(T1)=1/n and Var ( T 2 ) = n Var T 2 = n Var(T_(2))=n\operatorname{Var}\left(\mathrm{T}_2\right)=\mathrm{n}Var(T2)=n then T 1 T 1 T_(1)\mathrm{T}_1T1 is more efficient than T 2 T 2 T_(2)\mathrm{T}_2T2.
    (c) A 95 % 95 % 95%95 \%95% confidence interval is smaller than 99 % 99 % 99%99 \%99% confidence interval.
    (d) If the level of significance is the same, the area of the rejection region in a two-tailed test is less than that in a one-tailed test.
    (e) Non parametric tests are more powerful than the parametric tests.
  2. If a finite population has four elements: 6 , 1 , 3 , 2 6 , 1 , 3 , 2 6,1,3,26,1,3,26,1,3,2.
    (a) How many different samples of size n = 2 n = 2 n=2\mathrm{n}=2n=2 can be selected from this population if you sample without replacement?
    (b) List all possible samples of size n = 2 n = 2 n=2\mathrm{n}=2n=2.
    (c) Compute the sample mean for each of the samples given in part b b bbb.
    (d) Find the sampling distribution of x x ¯ bar(x)\overline{\mathrm{x}}x and draw the histogram.
    (e) Compute standard error.
    (f) If all four population values are equally likely, calculate the value of the population mean μ μ ^(mu){ }^\muμ. Do any of the samples listed in part (b) produce a value of x ¯ x ¯ bar(x)\bar{x}x¯ exactly equal to μ ? μ ? mu_(“? “)\mu_{\text {? }}μ?
  3. A study was conducted to compare the mean numbers of police emergency calls per 8 -hour shift in two districts of a large city. Samples of 1008 -hour shifts were randomly selected from the police records for each of the two regions and the number of emergency calls was recorded for each shift. The sample statistics are listed here:
Region
1 1 1\mathbf{1}1 2 2 2\mathbf{2}2
Sample size 100 100
Sample mean 2.4 3.1
Sample variance 1.44 2.64
Region 1 2 Sample size 100 100 Sample mean 2.4 3.1 Sample variance 1.44 2.64| | Region | | | :— | :— | :— | | | $\mathbf{1}$ | $\mathbf{2}$ | | Sample size | 100 | 100 | | Sample mean | 2.4 | 3.1 | | Sample variance | 1.44 | 2.64 |
Find a 90 % 90 % 90%90 \%90% confidence interval for the difference in the mean numbers of police emergency calls per shift between the two districts of the city. Interpret the interval.
  1. A bond proposal for school construction will be submitted to the voters at the next municipal election. A major portion of the money derived form this bond issue will be used to build schools in a rapidly developing selection of the city, and the remainder will be used to renovate and update school buildings in the rest of the city. To assess the viability of the bond proposal, a random sample of n 1 = 50 n 1 = 50 n_(1)=50\mathrm{n}_1=50n1=50 residents in the developing section and n 2 = 100 n 2 = 100 n_(2)=100\mathrm{n}_2=100n2=100 residents from the other parts of the city were asked whether they plan to vote for the proposal. The results are tabulated below
Sample Values for Opinion on Bond Proposal
Developing Section Rest of the City
Sample size 50 100
Number favoring
proposal
Number favoring proposal| Number favoring | | :— | | proposal |
38 65
Developing Section Rest of the City Sample size 50 100 “Number favoring proposal” 38 65| | Developing Section | Rest of the City | | :— | :— | :— | | Sample size | 50 | 100 | | Number favoring <br> proposal | 38 | 65 |
(a) Estimate the difference in the true proportions favoring the bond proposal with a 99 % 99 % 99%99 \%99% confidence interval.
(b) If both samples were pooled into one sample of size n = 150 n = 150 n=150\mathrm{n}=150n=150, with 103 in favor of the proposal, provide a point estimate of the proportion of city residents who will vote for the bond proposal.
5. The following data relate to the number of items produced per shift by two workers for a number of days:
Worker A 19 22 24 27 24 18 20 19 25
Worker B 26 37 40 35 30 40 26 30 35 45
Worker A 19 22 24 27 24 18 20 19 25 Worker B 26 37 40 35 30 40 26 30 35 45| Worker A | 19 | 22 | 24 | 27 | 24 | 18 | 20 | 19 | 25 | | | :— | :— | :— | :— | :— | :— | :— | :— | :— | :— | :— | | Worker B | 26 | 37 | 40 | 35 | 30 | 40 | 26 | 30 | 35 | 45 |
Can it be inferred that Worker A is more stable worker compared to B by testing the variation in the item produced by them at 5 % 5 % 5%5 \%5% level of significance.
  1. If magnitude of earthquakes recorded in a region of a country follows a distribution with parameter μ μ mu\muμ whose pdf is given below:
f ( x ) = 1 2 π e 1 2 ( x μ ) 2 , < x , μ < f ( x ) = 1 2 π e 1 2 ( x μ ) 2 , < x , μ < f(x)=(1)/(sqrt(2pi))e^(-(1)/(2)(x-mu)^(2)),-oo < x,mu < oof(x)=\frac{1}{\sqrt{2 \pi}} e^{-\frac{1}{2}(x-\mu)^2},-\infty<x, \mu<\inftyf(x)=12πe12(xμ)2,<x,μ<
then show that the estimators of the parameter μ μ mu\muμ using maximum likelihood and method of moments are same.
7. A company plans to promote a new product by using one of three advertising campaigns. To investigate the extent of product recognition from these three campaigns, 15 market areas were selected and five were randomly assigned to each advertising plan. At the end of the ad campaigns, random samples of 400 adults were selected in each area and the proportions who were familiar with new product were recorded. The responses were not approximately normal. Is there a significant difference among the three population distributions from which these samples came? Use an appropriate nonparametric method to answer this question at 5 % 5 % 5%5 \%5% level of significance.
Campaign
1 1 1\mathbf{1}1 2 2 2\mathbf{2}2 3 3 3\mathbf{3}3
0.33 0.28 0.21
0.29 0.41 0.30
Campaign 1 2 3 0.33 0.28 0.21 0.29 0.41 0.30| Campaign | | | | :— | :— | :— | | $\mathbf{1}$ | $\mathbf{2}$ | $\mathbf{3}$ | | 0.33 | 0.28 | 0.21 | | 0.29 | 0.41 | 0.30 |
0.21 0.34 0.26
0.32 0.39 0.33
0.25 0.27 0.31
0.21 0.34 0.26 0.32 0.39 0.33 0.25 0.27 0.31| 0.21 | 0.34 | 0.26 | | :— | :— | :— | | 0.32 | 0.39 | 0.33 | | 0.25 | 0.27 | 0.31 |
(15)
8. A psychology class performed an experiment to determine whether a recall score in which instructions to form images of 25 words were given differs from an initial recall score for which no imagery instructions were given. Twenty students participated in the experiment with the results listed in the table:
Student With Imagery Without Imagery Student With Imagery Without Imagery
1 20 5 11 17 8
2 24 9 12 20 16
3 20 5 13 20 10
4 18 9 14 16 12
5 22 6 15 24 7
6 19 11 16 22 9
7 20 8 17 25 21
8 19 11 18 21 14
9 17 7 19 19 12
10 21 9 20 23 13
Student With Imagery Without Imagery Student With Imagery Without Imagery 1 20 5 11 17 8 2 24 9 12 20 16 3 20 5 13 20 10 4 18 9 14 16 12 5 22 6 15 24 7 6 19 11 16 22 9 7 20 8 17 25 21 8 19 11 18 21 14 9 17 7 19 19 12 10 21 9 20 23 13| Student | With Imagery | Without Imagery | Student | With Imagery | Without Imagery | | :— | :— | :— | :— | :— | :— | | 1 | 20 | 5 | 11 | 17 | 8 | | 2 | 24 | 9 | 12 | 20 | 16 | | 3 | 20 | 5 | 13 | 20 | 10 | | 4 | 18 | 9 | 14 | 16 | 12 | | 5 | 22 | 6 | 15 | 24 | 7 | | 6 | 19 | 11 | 16 | 22 | 9 | | 7 | 20 | 8 | 17 | 25 | 21 | | 8 | 19 | 11 | 18 | 21 | 14 | | 9 | 17 | 7 | 19 | 19 | 12 | | 10 | 21 | 9 | 20 | 23 | 13 |
(a) What two testing procedures can be used to test for differences in the distribution of recall scores with and without imagery? What assumptions are required for the parametric procedure? Do these data satisfy these assumptions?
(b) Use both the parametric and non-parametric tests for differences in the distributions of recall scores under these two conditions.
(c) Compare the results of the tests in part b. Are the conclusions for same? If not, why not?

Solution

Question:-1

State whether the following statements are True or False. Give reason in support of your answer:

(a) If the probability of non rejection of H 0 H 0 H_(0)\mathrm{H}_0H0 when H 1 H 1 H_(1)\mathrm{H}_1H1 is true is 0.4 then power of the test will be 0.6.
(b) If T 1 T 1 T_(1)T_1T1 and T 2 T 2 T_(2)T_2T2 are two estimators of the parameter θ θ theta\thetaθ such that Var ( T 1 ) = 1 / n Var T 1 = 1 / n Var(T_(1))=1//n\operatorname{Var}\left(T_1\right)=1 / \mathrm{n}Var(T1)=1/n and Var ( T 2 ) = n Var T 2 = n Var(T_(2))=n\operatorname{Var}\left(\mathrm{T}_2\right)=\mathrm{n}Var(T2)=n then T 1 T 1 T_(1)\mathrm{T}_1T1 is more efficient than T 2 T 2 T_(2)\mathrm{T}_2T2.
(c) A 95 % 95 % 95%95 \%95% confidence interval is smaller than 99 % 99 % 99%99 \%99% confidence interval.
(d) If the level of significance is the same, the area of the rejection region in a two-tailed test is less than that in a one-tailed test.
(e) Non parametric tests are more powerful than the parametric tests.

Answer:


(a) Statement: If the probability of non-rejection of H 0 H 0 H_(0)H_0H0 when H 1 H 1 H_(1)H_1H1 is true is 0.4, then the power of the test will be 0.6.
  • Evaluation: True.
  • Justification: The power of a test is the probability of rejecting H 0 H 0 H_(0)H_0H0 when H 1 H 1 H_(1)H_1H1 is true, i.e., Power = P ( Reject H 0 H 1 true ) Power = P ( Reject H 0 H 1 true ) “Power”=P(“Reject “H_(0)∣H_(1)” true”)\text{Power} = P(\text{Reject } H_0 \mid H_1 \text{ true})Power=P(Reject H0H1 true). The probability of non-rejection of H 0 H 0 H_(0)H_0H0 when H 1 H 1 H_(1)H_1H1 is true is P ( Fail to reject H 0 H 1 true ) = 1 Power P ( Fail to reject H 0 H 1 true ) = 1 Power P(“Fail to reject “H_(0)∣H_(1)” true”)=1-“Power”P(\text{Fail to reject } H_0 \mid H_1 \text{ true}) = 1 – \text{Power}P(Fail to reject H0H1 true)=1Power. Given this probability is 0.4, we have 1 Power = 0.4 1 Power = 0.4 1-“Power”=0.41 – \text{Power} = 0.41Power=0.4, so Power = 1 0.4 = 0.6 Power = 1 0.4 = 0.6 “Power”=1-0.4=0.6\text{Power} = 1 – 0.4 = 0.6Power=10.4=0.6.

(b) Statement: If T 1 T 1 T_(1)T_1T1 and T 2 T 2 T_(2)T_2T2 are two estimators of the parameter θ θ theta\thetaθ such that Var ( T 1 ) = 1 / n Var ( T 1 ) = 1 / n Var(T_(1))=1//n\operatorname{Var}(T_1) = 1/nVar(T1)=1/n and Var ( T 2 ) = n Var ( T 2 ) = n Var(T_(2))=n\operatorname{Var}(T_2) = nVar(T2)=n, then T 1 T 1 T_(1)T_1T1 is more efficient than T 2 T 2 T_(2)T_2T2.
  • Evaluation: True, assuming both estimators are unbiased.
  • Justification: The efficiency of an estimator (for unbiased estimators) is determined by its variance; a lower variance indicates a more efficient estimator. Since Var ( T 1 ) = 1 / n Var ( T 1 ) = 1 / n Var(T_(1))=1//n\operatorname{Var}(T_1) = 1/nVar(T1)=1/n and Var ( T 2 ) = n Var ( T 2 ) = n Var(T_(2))=n\operatorname{Var}(T_2) = nVar(T2)=n, and assuming n > 0 n > 0 n > 0n > 0n>0, we have 1 / n < n 1 / n < n 1//n < n1/n < n1/n<n for n > 1 n > 1 n > 1n > 1n>1 (and even for 0 < n < 1 0 < n < 1 0 < n < 10 < n < 10<n<1, 1 / n > n 1 / n > n 1//n > n1/n > n1/n>n, but variance is typically positive and context suggests n n nnn is a sample size, so n 1 n 1 n >= 1n \geq 1n1). Thus, Var ( T 1 ) < Var ( T 2 ) Var ( T 1 ) < Var ( T 2 ) Var(T_(1)) < Var(T_(2))\operatorname{Var}(T_1) < \operatorname{Var}(T_2)Var(T1)<Var(T2), so T 1 T 1 T_(1)T_1T1 is more efficient. If the estimators are biased, efficiency comparisons require mean squared error, but the context suggests unbiasedness is assumed.

(c) Statement: A 95% confidence interval is smaller than a 99% confidence interval.
  • Evaluation: True.
  • Justification: The width of a confidence interval is proportional to the critical value from the standard normal or t-distribution. For a 95% confidence interval, the critical value (e.g., z 1.96 z 1.96 z~~1.96z \approx 1.96z1.96 for large samples) is smaller than for a 99% confidence interval (e.g., z 2.576 z 2.576 z~~2.576z \approx 2.576z2.576). Since the width is typically of the form estimate ± z standard error estimate ± z standard error “estimate”+-z*”standard error”\text{estimate} \pm z \cdot \text{standard error}estimate±zstandard error, a smaller z z zzz results in a narrower interval, assuming the same standard error and sample size.

(d) Statement: If the level of significance is the same, the area of the rejection region in a two-tailed test is less than that in a one-tailed test.
  • Evaluation: False.
  • Justification: The level of significance α α alpha\alphaα represents the total probability of the rejection region. In a one-tailed test, the entire rejection region is in one tail, so the area is α α alpha\alphaα. In a two-tailed test, the rejection region is split between both tails, with each tail having area α / 2 α / 2 alpha//2\alpha/2α/2, so the total area is still α α alpha\alphaα. Thus, the total area of the rejection region is the same in both tests, not less in the two-tailed test. For example, if α = 0.05 α = 0.05 alpha=0.05\alpha = 0.05α=0.05, a one-tailed test has a rejection region of 0.05 in one tail, while a two-tailed test has 0.025 in each tail, totaling 0.05.

(e) Statement: Non-parametric tests are more powerful than parametric tests.
  • Evaluation: False.
  • Justification: Non-parametric tests are generally less powerful than parametric tests when the assumptions of the parametric tests (e.g., normality, equal variances) are met. Parametric tests use specific distributional assumptions to model the data, leading to higher power for detecting true effects. Non-parametric tests, which do not assume a specific distribution, are more robust but sacrifice power. For example, the Mann-Whitney U test (non-parametric) is less powerful than the t-test (parametric) for normally distributed data with equal variances. However, non-parametric tests can be more powerful when parametric assumptions are violated, but the statement is absolute and does not account for this condition.


Scroll to Top
Scroll to Top