State whether the following statements are true or false and also give the reason in support of your answer:
(a) The total number of all possible samples of size 2 without replacement from a population of size 7 is 21 .
(b) Consecutive 3 random numbers starting from 8937 by ‘middle square method’ are 8937, 8699, 6726.
(c) RBD is suitable in situations where it is not possible to divide the experimental material into a number of homogeneous blocks.
(d) As we increase the sample size, representativeness of the population by the sample decreases.
(e) In a big hall, there are 50 rows and each row has 60 students. A research scholar selects 10 rows randomly and then randomly selects 15 students from each selected row. It is an example of cluster sampling procedure.
(a) Draw all possible samples of size 2 from the population [2, 3, 4] and verify that E( bar(x))= bar(X)E(\bar{x})=\bar{X}. Also find variance of bar(x)\bar{x}.
(b) A sample of 60 students is to be drawn from a population consisting of 600 students belonging to two villages, A and B . The means and standard deviations of their marks are give below:
Villages Stratum sizes (N_(i)) Means (x_(i)) Standard deviations
Village A 400 60 20
Village B 200 120 80| Villages | Stratum sizes $\left(\mathbf{N}_{\mathbf{i}}\right)$ | Means $\left(\mathbf{x}_{\mathbf{i}}\right)$ | Standard deviations |
| :— | :—: | :—: | :—: |
| Village A | 400 | 60 | 20 |
| Village B | 200 | 120 | 80 |
What are the sample sizes for the two villages using proportional allocation technique?
To determine the yield rate of wheat in a district of Punjab, 6 groups of 6 plots each were constructed. The data are given in the following table:
Select a cluster sample of 3 clusters from the above data and find its sample mean. Further, explain the procedure of two-stage sampling if we want to draw a sample of 6 plots. Which are the 6 plots in your sample?
(7)
4. The following data relate to production in kg of three varieties P,Q,R\mathrm{P}, \mathrm{Q}, \mathrm{R} of wheat:
Is there any significant difference among the three varieties at 5%5 \% level of significance?
(7)
5. A researcher wants to test four diets A,B,C,D\mathrm{A}, \mathrm{B}, \mathrm{C}, \mathrm{D} on growth rate in mice. These animals are divided into 3 groups according to their weights. Heaviest 4, next 4 and lightest 4 are put in Block I, Block II, and Block III, respectively. Within each block, one of the diets is given at random to the animals. After 15 days, increase in weight is noted, which is given in the following table:
Blocks
Treatments/Diets
A
B
C
D
I
12
8
6
5
II
15
12
9
6
III
14
10
8
5
Blocks Treatments/Diets
A B C D
I 12 8 6 5
II 15 12 9 6
III 14 10 8 5| Blocks | Treatments/Diets | | | |
| :— | :— | :— | :— | :— |
| | A | B | C | D |
| I | 12 | 8 | 6 | 5 |
| II | 15 | 12 | 9 | 6 |
| III | 14 | 10 | 8 | 5 |
Perform a two-way ANOVA to test whether the data indicates any significant difference between’ the four diets due to different blocks.
In the following data, two values are missing. Estimate these values by Yates method and analyse the data by suitable technique.
Treatments
Blocks
I
II
III
A
12
14
12
B
10
y
8
C
x
15
10
Treatments Blocks
I II III
A 12 14 12
B 10 y 8
C x 15 10| Treatments | Blocks | | |
| :— | :— | :— | :— |
| | I | II | III |
| A | 12 | 14 | 12 |
| B | 10 | y | 8 |
| C | x | 15 | 10 |
Identify the design given in the following table and then carry out the analysis:
Column
Row
I
II
III
IV
I
A 8
C 18
B 11
D 8
II
C 16
B 10
D 7
A 4
III
B 12
D 10
A 6
C 20
IV
D 10
A 9
C 28
B 16
Column Row
I II III IV
I A 8 C 18 B 11 D 8
II C 16 B 10 D 7 A 4
III B 12 D 10 A 6 C 20
IV D 10 A 9 C 28 B 16| Column | Row | | | |
| :— | :— | :— | :— | :— |
| | I | II | III | IV |
| I | A 8 | C 18 | B 11 | D 8 |
| II | C 16 | B 10 | D 7 | A 4 |
| III | B 12 | D 10 | A 6 | C 20 |
| IV | D 10 | A 9 | C 28 | B 16 |
(a) The distribution function of Pareto distribution is given by
f(x)=1-((k)/(x))^(a),quad a > 0,0 < k <= xf(x)=1-\left(\frac{k}{x}\right)^a, \quad a>0,0<k \leq x
Given a U∼U(0,1)\mathrm{U} \sim \mathrm{U}(0,1), generate a random number from the above distribution, when a=\mathrm{a}= 2 and k=1k=1. Suppose U=0.5U=0.5, then find xx.
(b) Generate a complete cycle for the LCG given below: x_(i)=(5x_(i-1)+3)mod 16x_i=\left(5 x_{i-1}+3\right) \bmod 16, with x_(0)=x_0= 5. A man tosses an unbiased coin ten times. Using the first ten random numbers generated above, obtain a sequence of heads and tails by taking Head (H)(\mathrm{H}) as u >= 0-5u \geq 0-5.
9. Times between successive crashes of a computer system were generated for a 6-month period and are given in increasing order as follows (time in hours):
The parameter a=0.00435\mathrm{a}=0.00435, mean =1//alpha=230hrs=1 / \alpha=230 \mathrm{hrs}.
Use the Kolmogorov-Smirnov test to examine the goodness of fit of exponential distribution.
Solution
Question:-1
State whether the following statements are true or false and also give the reason in support of your answer:
(a) The total number of all possible samples of size 2 without replacement from a population of size 7 is 21.
(b) Consecutive 3 random numbers starting from 8937 by ‘middle square method’ are 8937, 8699, 6726.
(c) RBD is suitable in situations where it is not possible to divide the experimental material into a number of homogeneous blocks.
(d) As we increase the sample size, representativeness of the population by the sample decreases.
(e) In a big hall, there are 50 rows and each row has 60 students. A research scholar selects 10 rows randomly and then randomly selects 15 students from each selected row. It is an example of cluster sampling procedure.
Answer:
(a) The total number of all possible samples of size 2 without replacement from a population of size 7 is 21.
True.
Justification: The number of ways to choose 2 items from 7 without replacement is given by the combination formula C(n,k)=(n!)/(k!(n-k)!)C(n, k) = \frac{n!}{k!(n-k)!}. Here, n=7n = 7, k=2k = 2, so C(7,2)=(7*6)/(2*1)=21C(7, 2) = \frac{7 \cdot 6}{2 \cdot 1} = 21. Thus, there are 21 possible samples.
(b) Consecutive 3 random numbers starting from 8937 by ‘middle square method’ are 8937, 8699, 6726.
False.
Justification: In the middle square method, a number is squared, and the middle digits are extracted to form the next number. Start with 8937:
8937^(2)=79,869,9698937^2 = 79,869,969. For a 4-digit number, extract the middle 4 digits (positions 3–6, assuming 8 digits): 9869.
The sequence is 8937, 9869, 7397, not 8937, 8699, 6726. Thus, the given sequence is incorrect.
(c) RBD is suitable in situations where it is not possible to divide the experimental material into a number of homogeneous blocks.
False.
Justification: Randomized Block Design (RBD) requires dividing experimental material into homogeneous blocks to control for variability within blocks. If it’s not possible to form homogeneous blocks, RBD is unsuitable, and other designs (e.g., completely randomized design) may be considered.
(d) As we increase the sample size, representativeness of the population by the sample decreases.
False.
Justification: Larger sample sizes generally increase representativeness, as they reduce sampling error and better capture the population’s characteristics, assuming random sampling. For example, in a population with mean mu\mu, the sample mean’s variance decreases as sample size nn increases (by the formula “Var”( bar(x))=(sigma^(2))/(n)\text{Var}(\bar{x}) = \frac{\sigma^2}{n}).
(e) In a big hall, there are 50 rows and each row has 60 students. A research scholar selects 10 rows randomly and then randomly selects 15 students from each selected row. It is an example of cluster sampling procedure.
False.
Justification: This is two-stage sampling, not cluster sampling. In cluster sampling, entire clusters (e.g., rows) are randomly selected, and all units within selected clusters are sampled. Here, only 15 students are randomly selected from each of the 10 chosen rows, not all 60. This resembles a combination of cluster and simple random sampling within clusters.