Sample Solution

MST-005 Solved Assignment 2025

  1. State whether the following statements are true or false and also give the reason in support of your answer:
    (a) The total number of all possible samples of size 2 without replacement from a population of size 7 is 21 .
    (b) Consecutive 3 random numbers starting from 8937 by ‘middle square method’ are 8937, 8699, 6726.
    (c) RBD is suitable in situations where it is not possible to divide the experimental material into a number of homogeneous blocks.
    (d) As we increase the sample size, representativeness of the population by the sample decreases.
    (e) In a big hall, there are 50 rows and each row has 60 students. A research scholar selects 10 rows randomly and then randomly selects 15 students from each selected row. It is an example of cluster sampling procedure.
  2. (a) Draw all possible samples of size 2 from the population [2, 3, 4] and verify that E ( x ¯ ) = X ¯ E ( x ¯ ) = X ¯ E( bar(x))= bar(X)E(\bar{x})=\bar{X}E(x¯)=X¯. Also find variance of x ¯ x ¯ bar(x)\bar{x}x¯.
    (b) A sample of 60 students is to be drawn from a population consisting of 600 students belonging to two villages, A and B . The means and standard deviations of their marks are give below:
Villages Stratum sizes ( N i ) N i (N_(i))\left(\mathbf{N}_{\mathbf{i}}\right)(Ni) Means ( x i ) x i (x_(i))\left(\mathbf{x}_{\mathbf{i}}\right)(xi) Standard deviations
Village A 400 60 20
Village B 200 120 80
Villages Stratum sizes (N_(i)) Means (x_(i)) Standard deviations Village A 400 60 20 Village B 200 120 80| Villages | Stratum sizes $\left(\mathbf{N}_{\mathbf{i}}\right)$ | Means $\left(\mathbf{x}_{\mathbf{i}}\right)$ | Standard deviations | | :— | :—: | :—: | :—: | | Village A | 400 | 60 | 20 | | Village B | 200 | 120 | 80 |
What are the sample sizes for the two villages using proportional allocation technique?
  1. To determine the yield rate of wheat in a district of Punjab, 6 groups of 6 plots each were constructed. The data are given in the following table:
Plot No. Group 1 Group 2 Group 3 Group 4 Group 5 Group 6
1 8 6 18 13 17 12
2 13 5 8 7 15 15
3 11 16 6 13 10 11
4 26 5 10 6 21 17
5 13 16 16 7 20 8
6 31 5 20 2 25 10
Plot No. Group 1 Group 2 Group 3 Group 4 Group 5 Group 6 1 8 6 18 13 17 12 2 13 5 8 7 15 15 3 11 16 6 13 10 11 4 26 5 10 6 21 17 5 13 16 16 7 20 8 6 31 5 20 2 25 10| Plot No. | Group 1 | Group 2 | Group 3 | Group 4 | Group 5 | Group 6 | | :— | :— | :— | :— | :— | :— | :— | | 1 | 8 | 6 | 18 | 13 | 17 | 12 | | 2 | 13 | 5 | 8 | 7 | 15 | 15 | | 3 | 11 | 16 | 6 | 13 | 10 | 11 | | 4 | 26 | 5 | 10 | 6 | 21 | 17 | | 5 | 13 | 16 | 16 | 7 | 20 | 8 | | 6 | 31 | 5 | 20 | 2 | 25 | 10 |
Select a cluster sample of 3 clusters from the above data and find its sample mean. Further, explain the procedure of two-stage sampling if we want to draw a sample of 6 plots. Which are the 6 plots in your sample?
(7)
4. The following data relate to production in kg of three varieties P , Q , R P , Q , R P,Q,R\mathrm{P}, \mathrm{Q}, \mathrm{R}P,Q,R of wheat:
P : : ::: 14 16 18
Q : 14 13 15 22
R : 18 16 19 15 20
P : 14 16 18 Q : 14 13 15 22 R : 18 16 19 15 20| P | $:$ | 14 | 16 | 18 | | | :— | :— | :— | :—: | :— | :— | | Q : | 14 | 13 | 15 | 22 | | | R : | 18 | 16 | 19 | 15 | 20 |
Is there any significant difference among the three varieties at 5 % 5 % 5%5 \%5% level of significance?
(7)
5. A researcher wants to test four diets A , B , C , D A , B , C , D A,B,C,D\mathrm{A}, \mathrm{B}, \mathrm{C}, \mathrm{D}A,B,C,D on growth rate in mice. These animals are divided into 3 groups according to their weights. Heaviest 4, next 4 and lightest 4 are put in Block I, Block II, and Block III, respectively. Within each block, one of the diets is given at random to the animals. After 15 days, increase in weight is noted, which is given in the following table:
Blocks Treatments/Diets
A B C D
I 12 8 6 5
II 15 12 9 6
III 14 10 8 5
Blocks Treatments/Diets A B C D I 12 8 6 5 II 15 12 9 6 III 14 10 8 5| Blocks | Treatments/Diets | | | | | :— | :— | :— | :— | :— | | | A | B | C | D | | I | 12 | 8 | 6 | 5 | | II | 15 | 12 | 9 | 6 | | III | 14 | 10 | 8 | 5 |
Perform a two-way ANOVA to test whether the data indicates any significant difference between’ the four diets due to different blocks.
  1. In the following data, two values are missing. Estimate these values by Yates method and analyse the data by suitable technique.
Treatments Blocks
I II III
A 12 14 12
B 10 y 8
C x 15 10
Treatments Blocks I II III A 12 14 12 B 10 y 8 C x 15 10| Treatments | Blocks | | | | :— | :— | :— | :— | | | I | II | III | | A | 12 | 14 | 12 | | B | 10 | y | 8 | | C | x | 15 | 10 |
  1. Identify the design given in the following table and then carry out the analysis:
Column Row
I II III IV
I A 8 C 18 B 11 D 8
II C 16 B 10 D 7 A 4
III B 12 D 10 A 6 C 20
IV D 10 A 9 C 28 B 16
Column Row I II III IV I A 8 C 18 B 11 D 8 II C 16 B 10 D 7 A 4 III B 12 D 10 A 6 C 20 IV D 10 A 9 C 28 B 16| Column | Row | | | | | :— | :— | :— | :— | :— | | | I | II | III | IV | | I | A 8 | C 18 | B 11 | D 8 | | II | C 16 | B 10 | D 7 | A 4 | | III | B 12 | D 10 | A 6 | C 20 | | IV | D 10 | A 9 | C 28 | B 16 |
  1. (a) The distribution function of Pareto distribution is given by
f ( x ) = 1 ( k x ) a , a > 0 , 0 < k x f ( x ) = 1 k x a , a > 0 , 0 < k x f(x)=1-((k)/(x))^(a),quad a > 0,0 < k <= xf(x)=1-\left(\frac{k}{x}\right)^a, \quad a>0,0<k \leq xf(x)=1(kx)a,a>0,0<kx
Given a U U ( 0 , 1 ) U U ( 0 , 1 ) U∼U(0,1)\mathrm{U} \sim \mathrm{U}(0,1)UU(0,1), generate a random number from the above distribution, when a = a = a=\mathrm{a}=a= 2 and k = 1 k = 1 k=1k=1k=1. Suppose U = 0.5 U = 0.5 U=0.5U=0.5U=0.5, then find x x xxx.
(b) Generate a complete cycle for the LCG given below: x i = ( 5 x i 1 + 3 ) mod 16 x i = 5 x i 1 + 3 mod 16 x_(i)=(5x_(i-1)+3)mod 16x_i=\left(5 x_{i-1}+3\right) \bmod 16xi=(5xi1+3)mod16, with x 0 = x 0 = x_(0)=x_0=x0= 5. A man tosses an unbiased coin ten times. Using the first ten random numbers generated above, obtain a sequence of heads and tails by taking Head ( H ) ( H ) (H)(\mathrm{H})(H) as u 0 5 u 0 5 u >= 0-5u \geq 0-5u05.
9. Times between successive crashes of a computer system were generated for a 6-month period and are given in increasing order as follows (time in hours):
1 10 20 30 40 52 63 70 80 90 100 102
130 140 190 210 266 310 530 590 640 1340
1 10 20 30 40 52 63 70 80 90 100 102 130 140 190 210 266 310 530 590 640 1340 | 1 | 10 | 20 | 30 | 40 | 52 | 63 | 70 | 80 | 90 | 100 | 102 | | :— | :— | :— | :— | :— | :— | :— | :— | :— | :— | :— | :— | | 130 | 140 | 190 | 210 | 266 | 310 | 530 | 590 | 640 | 1340 | | |
The parameter a = 0.00435 a = 0.00435 a=0.00435\mathrm{a}=0.00435a=0.00435, mean = 1 / α = 230 hrs = 1 / α = 230 hrs =1//alpha=230hrs=1 / \alpha=230 \mathrm{hrs}=1/α=230hrs.
Use the Kolmogorov-Smirnov test to examine the goodness of fit of exponential distribution.

Solution

Question:-1

State whether the following statements are true or false and also give the reason in support of your answer:

(a) The total number of all possible samples of size 2 without replacement from a population of size 7 is 21.
(b) Consecutive 3 random numbers starting from 8937 by ‘middle square method’ are 8937, 8699, 6726.
(c) RBD is suitable in situations where it is not possible to divide the experimental material into a number of homogeneous blocks.
(d) As we increase the sample size, representativeness of the population by the sample decreases.
(e) In a big hall, there are 50 rows and each row has 60 students. A research scholar selects 10 rows randomly and then randomly selects 15 students from each selected row. It is an example of cluster sampling procedure.

Answer:


(a) The total number of all possible samples of size 2 without replacement from a population of size 7 is 21.
  • True.
  • Justification: The number of ways to choose 2 items from 7 without replacement is given by the combination formula C ( n , k ) = n ! k ! ( n k ) ! C ( n , k ) = n ! k ! ( n k ) ! C(n,k)=(n!)/(k!(n-k)!)C(n, k) = \frac{n!}{k!(n-k)!}C(n,k)=n!k!(nk)!. Here, n = 7 n = 7 n=7n = 7n=7, k = 2 k = 2 k=2k = 2k=2, so C ( 7 , 2 ) = 7 6 2 1 = 21 C ( 7 , 2 ) = 7 6 2 1 = 21 C(7,2)=(7*6)/(2*1)=21C(7, 2) = \frac{7 \cdot 6}{2 \cdot 1} = 21C(7,2)=7621=21. Thus, there are 21 possible samples.

(b) Consecutive 3 random numbers starting from 8937 by ‘middle square method’ are 8937, 8699, 6726.
  • False.
  • Justification: In the middle square method, a number is squared, and the middle digits are extracted to form the next number. Start with 8937:
    • 8937 2 = 79 , 869 , 969 8937 2 = 79 , 869 , 969 8937^(2)=79,869,9698937^2 = 79,869,96989372=79,869,969. For a 4-digit number, extract the middle 4 digits (positions 3–6, assuming 8 digits): 9869.
    • Next, 9869 2 = 97 , 397 , 161 9869 2 = 97 , 397 , 161 9869^(2)=97,397,1619869^2 = 97,397,16198692=97,397,161. Middle 4 digits: 7397.
    • The sequence is 8937, 9869, 7397, not 8937, 8699, 6726. Thus, the given sequence is incorrect.

(c) RBD is suitable in situations where it is not possible to divide the experimental material into a number of homogeneous blocks.
  • False.
  • Justification: Randomized Block Design (RBD) requires dividing experimental material into homogeneous blocks to control for variability within blocks. If it’s not possible to form homogeneous blocks, RBD is unsuitable, and other designs (e.g., completely randomized design) may be considered.

(d) As we increase the sample size, representativeness of the population by the sample decreases.
  • False.
  • Justification: Larger sample sizes generally increase representativeness, as they reduce sampling error and better capture the population’s characteristics, assuming random sampling. For example, in a population with mean μ μ mu\muμ, the sample mean’s variance decreases as sample size n n nnn increases (by the formula Var ( x ¯ ) = σ 2 n Var ( x ¯ ) = σ 2 n “Var”( bar(x))=(sigma^(2))/(n)\text{Var}(\bar{x}) = \frac{\sigma^2}{n}Var(x¯)=σ2n).

(e) In a big hall, there are 50 rows and each row has 60 students. A research scholar selects 10 rows randomly and then randomly selects 15 students from each selected row. It is an example of cluster sampling procedure.
  • False.
  • Justification: This is two-stage sampling, not cluster sampling. In cluster sampling, entire clusters (e.g., rows) are randomly selected, and all units within selected clusters are sampled. Here, only 15 students are randomly selected from each of the 10 chosen rows, not all 60. This resembles a combination of cluster and simple random sampling within clusters.


Scroll to Top
Scroll to Top