IGNOU MST-004 Previous Year Paper Solution

IGNOU MST-004 Previous Year Paper Solution for PGDAST

Solved By – Narendra Kr. Sharma – M.Sc (Mathematics Honors) – Delhi University

₹365.00

Get Instant Access

Share with your Friends

MST-004 Previous Year Paper Solution - Sample

MST-004 Dec 2023

Question:-01

1.State whether the following statements are True or False. Give reasons in support of your answers :

(a) If sample size of a survey has increased 3 times, then the standard error will be increased 3 times.

Answer:

The statement "If the sample size of a survey has increased 3 times, then the standard error will be increased 3 times" is false.

To justify why this statement is false, we can look at the relationship between the standard error of the mean and the sample size. The standard error of the mean (SEM) is given by the formula:

SEM = \frac{σ}{\sqrt{n}}

where

σ

is the population standard deviation and

n

is the sample size. This formula shows that the standard error of the mean is inversely proportional to the square root of the sample size.

When the sample size

n

is increased by a factor of 3, the new sample size becomes

3 n

. Plugging this into the formula for the standard error gives:

{SEM}_{new} = \frac{σ}{\sqrt{3 n}}

To see how the standard error changes, compare the new SEM with the original SEM:

{SEM}_{new} = \frac{σ}{\sqrt{3 n}} = \frac{σ}{\sqrt{3} \sqrt{n}} = \frac{1}{\sqrt{3}} \frac{σ}{\sqrt{n}} = \frac{1}{\sqrt{3}} {SEM}_{old}

Since

\frac{1}{\sqrt{3}} \approx 0.577

, the new standard error is approximately

0.577

times the original standard error. This means that the standard error does not increase three times; rather, it decreases by a factor of about

\sqrt{3}

or decreases to about 57.7% of its original value.

Therefore, the statement is false because increasing the sample size by three times results in the standard error decreasing, not increasing, and specifically decreasing by a factor of about

\sqrt{3}

(b) If probability density function of a random variable

X

follows

F

-distribution

f (x) = \frac{1}{(1 + x)^{2}}; 0 < x < \infty,

then the degree of freedom of the distribution will be

(2, 2)

Answer:

The statement "If the probability density function of a random variable

X

follows

F

-distribution

f (x) = \frac{1}{(1 + x)^{2}}; 0 < x < \infty

, then the degree of freedom of the distribution will be

(2, 2)

" is true.

To justify this, let’s analyze the given information:

The probability density function (PDF) provided is:

f (x) = \frac{1}{(1 + x)^{2}}, 0 < x < \infty

The PDF of the

F

-distribution with degrees of freedom

v_{1}

and

v_{2}

is given by:

f (x) = (\frac{v_{1} / v_{2}}{B (\frac{v_{1}}{2}, \frac{v_{2}}{2})}) {(\frac{v_{1}}{v_{2}})}^{v_{1} / 2} x^{(v_{1} / 2) - 1} {(1 + \frac{v_{1}}{v_{2}} x)}^{- (v_{1} + v_{2}) / 2}

Given:

f (x) = \frac{1}{(1 + x)^{2}}, 0 < x < \infty

We can rewrite the given PDF as:

f (x) = \frac{(2 / 2)^{2 / 2} x^{(2 / 2) - 1}}{B (2 / 2, 2 / 2) {(1 + \frac{2}{2} x)}^{(2 + 2) / 2}}

Comparing this with the standard form of the

F

-distribution PDF, we observe that the given PDF matches if

v_{1} = 2

and

v_{2} = 2

Thus, the degrees of freedom for the given distribution are

(2, 2)

Therefore, the statement is true.

H_{0} : θ = 2

against

H_{1} : θ = 3

, the

p d f

of the variable is given by

f (x, θ) = \frac{1}{θ}; 0 \leq x \leq θ

If the critical region is

x \geq 0.6

, the size of the test will be 0.6 .

Answer:

The statement "For testing

H_{0} : θ = 2

against

H_{1} : θ = 3

, the

p d f

of the variable is given by

f (x, θ) = \frac{1}{θ}; 0 \leq x \leq θ

. If the critical region is

x \geq 0.6

, the size of the test will be 0.6" is false.

To justify this, let’s understand the concepts of hypothesis testing and the size of a test.

The size of a test, also known as the significance level (α), is the probability of rejecting the null hypothesis

H_{0}

when it is actually true. This is also known as the Type I error rate.

Given:

$H_{0} : θ = 2$
$H_{1} : θ = 3$
The probability density function (pdf) of the variable: $f (x, θ) = \frac{1}{θ}$ , for $0 \leq x \leq θ$

The critical region is

x \geq 0.6

First, let’s find the size of the test under the null hypothesis

H_{0} : θ = 2

The pdf under

H_{0}

is:

f (x, θ = 2) = \frac{1}{2}, 0 \leq x \leq 2

To find the size of the test, we need to calculate the probability that

x \geq 0.6

when

θ = 2

Size of the test = P (x \geq 0.6 ∣ θ = 2) = \int_{0.6}^{2} \frac{1}{2} d x

Calculating the integral:

\int_{0.6}^{2} \frac{1}{2} d x = \frac{1}{2} {[x]}_{0.6}^{2} = \frac{1}{2} (2 - 0.6) = \frac{1}{2} \times 1.4 = 0.7

Therefore, the size of the test is 0.7, not 0.6.

Hence, the statement is false because the size of the test under the null hypothesis

θ = 2

with the critical region

x \geq 0.6

is 0.7, not 0.6.

(d) Kruskal-Wallis test is a non-parametric version of two-way analysis of variance.

Answer:

The statement "Kruskal-Wallis test is a non-parametric version of two-way analysis of variance" is false.

To justify this, let’s review the purposes and uses of the Kruskal-Wallis test and two-way analysis of variance (ANOVA).

Kruskal-Wallis Test:

The Kruskal-Wallis test is a non-parametric method used to test whether there are statistically significant differences between the medians of three or more independent groups.
It is the non-parametric alternative to one-way ANOVA and is used when the assumptions of one-way ANOVA (such as normality) are not met.
The Kruskal-Wallis test ranks all the data from all groups together and then compares the sum of ranks between the groups.

Two-Way ANOVA:

Two-way ANOVA is a parametric method used to determine if there are any significant differences between the means of three or more groups, considering two independent variables (factors).
It can test the interaction between the two factors, as well as the individual effects of each factor.

Key Differences:

Number of Factors: The Kruskal-Wallis test deals with one factor with multiple levels (one-way analysis), while two-way ANOVA involves two factors.
Type of Data: Kruskal-Wallis is non-parametric and uses ranks, making it suitable for ordinal data or data that do not meet the assumptions of normality. Two-way ANOVA is parametric and assumes normal distribution of the residuals.

Correct Non-Parametric Alternative:

The non-parametric alternative to two-way ANOVA is the Friedman test, which is used for analyzing differences in treatments across multiple test attempts (blocks) when the data is not normally distributed.

Therefore, the Kruskal-Wallis test is not a non-parametric version of two-way ANOVA; it is a non-parametric version of one-way ANOVA. The statement is false.

(e) A sample of size 4 is drawn randomly

(X_{1}, X_{2}, X_{3}

and

X_{4}

) from a normal population with unknown mean

μ

, then

\frac{X_{1} + 2 X_{2} + 3 X_{3} + X_{4}}{7}

is an unbiased estimator of

μ

Answer:

To determine whether

\frac{X_{1} + 2 X_{2} + 3 X_{3} + X_{4}}{7}

is an unbiased estimator of the population mean

μ

, we need to check if the expected value of this estimator equals

μ

Given that

X_{1}, X_{2}, X_{3}

, and

X_{4}

are drawn from a normal population with unknown mean

μ

, the expected value of each

X_{i}

μ

The estimator in question is:

\hat{μ} = \frac{X_{1} + 2 X_{2} + 3 X_{3} + X_{4}}{7}

To check if this is an unbiased estimator of

μ

, we calculate the expected value of

\hat{μ}

E (\hat{μ}) = E (\frac{X_{1} + 2 X_{2} + 3 X_{3} + X_{4}}{7})

Using the linearity of expectation, we have:

E (\hat{μ}) = \frac{1}{7} E (X_{1} + 2 X_{2} + 3 X_{3} + X_{4})

E (\hat{μ}) = \frac{1}{7} (E (X_{1}) + 2 E (X_{2}) + 3 E (X_{3}) + E (X_{4}))

Since

E (X_{i}) = μ

for all

i

, we get:

E (\hat{μ}) = \frac{1}{7} (μ + 2 μ + 3 μ + μ)

E (\hat{μ}) = \frac{1}{7} (7 μ)

E (\hat{μ}) = μ

Thus, the expected value of the estimator

\frac{X_{1} + 2 X_{2} + 3 X_{3} + X_{4}}{7}

is equal to

μ

, which means it is an unbiased estimator of

μ

Therefore, the statement "A sample of size 4 is drawn randomly (

X_{1}, X_{2}, X_{3}

, and

X_{4}

) from a normal population with unknown mean

μ

, then

\frac{X_{1} + 2 X_{2} + 3 X_{3} + X_{4}}{7}

is an unbiased estimator of

μ

" is true.

Question:-02

2.The systolic blood pressure (SBP) of five women are given as follows :

120, 110, 130, 140, 100

(a) How many samples of size 2 can be drawn without replacement? Write them.

(b) Compute the mean of all samples of size 2 and set up the sampling distribution of the sample mean.

(d) How many samples of the same size 2 are possible with replacement ? Calculate expected value of the sample mean and compare it with the expected value calculated in the case of without replacement.

Answer:

Let’s address each part of the problem step by step.

Given Data:

Systolic Blood Pressure (SBP) of five women:

120, 110, 130, 140, 100

(a) How many samples of size 2 can be drawn without replacement? Write them.

The number of samples of size 2 that can be drawn without replacement from 5 items is given by the combination formula:

(\binom{5}{2}) = \frac{5!}{2! (5 - 2)!} = 10

The possible samples without replacement are:

(120, 110)
(120, 130)
(120, 140)
(120, 100)
(110, 130)
(110, 140)
(110, 100)
(130, 140)
(130, 100)
(140, 100)

(b) Compute the mean of all samples of size 2 and set up the sampling distribution of the sample mean.

First, compute the means of each sample:

(120, 110): $\frac{120 + 110}{2} = 115$
(120, 130): $\frac{120 + 130}{2} = 125$
(120, 140): $\frac{120 + 140}{2} = 130$
(120, 100): $\frac{120 + 100}{2} = 110$
(110, 130): $\frac{110 + 130}{2} = 120$
(110, 140): $\frac{110 + 140}{2} = 125$
(110, 100): $\frac{110 + 100}{2} = 105$
(130, 140): $\frac{130 + 140}{2} = 135$
(130, 100): $\frac{130 + 100}{2} = 115$
(140, 100): $\frac{140 + 100}{2} = 120$

The sampling distribution of the sample mean is:

105, 110, 115, 115, 120, 120, 120, 125, 125, 135

(c) Compute the expected value of the sample mean.

The expected value of the sample mean

\bar{X}

is the average of the sample means:

E (\bar{X}) = \frac{1}{10} \sum {\bar{X}}_{i} = \frac{105 + 110 + 115 + 115 + 120 + 120 + 120 + 125 + 125 + 135}{10} = \frac{1190}{10} = 119

(d) How many samples of the same size 2 are possible with replacement? Calculate the expected value of the sample mean and compare it with the expected value calculated in the case of without replacement.

The number of samples of size 2 that can be drawn with replacement from 5 items is given by the formula for permutations with replacement:

5^{2} = 25

The possible samples with replacement are:

(120, 120), (120, 110), (120, 130), (120, 140), (120, 100),

(110, 120), (110, 110), (110, 130), (110, 140), (110, 100),

(130, 120), (130, 110), (130, 130), (130, 140), (130, 100),

(140, 120), (140, 110), (140, 130), (140, 140), (140, 100),

(100, 120), (100, 110), (100, 130), (100, 140), (100, 100)

Compute the means of each sample:

\bar{X}

120, 115, 125, 130, 110,

115, 110, 120, 125, 105,

125, 120, 130, 135, 115,

130, 125, 135, 140, 120,

110, 105, 115, 120, 100

The expected value of the sample mean with replacement is:

E (\bar{X}) = \frac{1}{25} \sum {\bar{X}}_{i} = \frac{120 + 115 + 125 + 130 + 110 + 115 + 110 + 120 + 125 + 105 + 125 + 120 + 130 + 135 + 115 + 130 + 125 + 135 + 140 + 120 + 110 + 105 + 115 + 120 + 100}{25}

Summing these values:

\sum {\bar{X}}_{i} = 2925

So,

E (\bar{X}) = \frac{2925}{25} = 117

Comparison:

The expected value of the sample mean without replacement is 119.
The expected value of the sample mean with replacement is 117.

Thus, the expected value of the sample mean without replacement is slightly higher than that with replacement in this case.

Question:-03

3.(a) The following table gives the classification of 150 products according to types of tools and materials used to produce these products :

Tool	Material
Tool	A	B	C
$T_{1}$	15	5	20
$T_{2}$	20	10	30
$T_{3}$	25	15	10

Test whether the tools and materials used are independent at

5 %

level of significance.

Answer:

To test whether the tools and materials used are independent at the

5 %

level of significance, we can use the Chi-Square Test of Independence. Here are the steps:

State the Hypotheses:
- $H_{0}$ : The tools and materials used are independent.
- $H_{1}$ : The tools and materials used are not independent.
Observed Frequencies (O):

\begin{array}{cccc} Tool & A & B & C \\ T_{1} & 15 & 5 & 20 \\ T_{2} & 20 & 10 & 30 \\ T_{3} & 25 & 15 & 10 \end{array}

Calculate the row and column totals and the grand total:

\begin{array}{ccccc} Tool & A & B & C & Row Total \\ T_{1} & 15 & 5 & 20 & 40 \\ T_{2} & 20 & 10 & 30 & 60 \\ T_{3} & 25 & 15 & 10 & 50 \\ Column Total & 60 & 30 & 60 & 150 \end{array}

Calculate the Expected Frequencies (E):

E_{i j} = \frac{(Row Total of i) \times (Column Total of j)}{Grand Total}

\begin{array}{cccc} Tool & A & B & C \\ T_{1} & \frac{40 \times 60}{150} = 16 & \frac{40 \times 30}{150} = 8 & \frac{40 \times 60}{150} = 16 \\ T_{2} & \frac{60 \times 60}{150} = 24 & \frac{60 \times 30}{150} = 12 & \frac{60 \times 60}{150} = 24 \\ T_{3} & \frac{50 \times 60}{150} = 20 & \frac{50 \times 30}{150} = 10 & \frac{50 \times 60}{150} = 20 \end{array}

Compute the Chi-Square Test Statistic:

χ^{2} = \sum \frac{(O_{i j} - E_{i j})^{2}}{E_{i j}}

\begin{array}{cccc} Tool & A & B & C \\ T_{1} & \frac{(15 - 16)^{2}}{16} = \frac{1}{16} & \frac{(5 - 8)^{2}}{8} = \frac{9}{8} & \frac{(20 - 16)^{2}}{16} = \frac{16}{16} \\ T_{2} & \frac{(20 - 24)^{2}}{24} = \frac{16}{24} & \frac{(10 - 12)^{2}}{12} = \frac{4}{12} & \frac{(30 - 24)^{2}}{24} = \frac{36}{24} \\ T_{3} & \frac{(25 - 20)^{2}}{20} = \frac{25}{20} & \frac{(15 - 10)^{2}}{10} = \frac{25}{10} & \frac{(10 - 20)^{2}}{20} = \frac{100}{20} \end{array}

Calculate each term and sum them up:

χ^{2} = \frac{1}{16} + \frac{9}{8} + \frac{16}{16} + \frac{16}{24} + \frac{4}{12} + \frac{36}{24} + \frac{25}{20} + \frac{25}{10} + \frac{100}{20}

χ^{2} = 0.0625 + 1.125 + 1 + 0.6667 + 0.3333 + 1.5 + 1.25 + 2.5 + 5

χ^{2} = 13.4375

Degrees of Freedom:

df = (r - 1) (c - 1) = (3 - 1) (3 - 1) = 2 \times 2 = 4

Critical Value and Conclusion:

At the

5 %

level of significance and

4

degrees of freedom, the critical value from the Chi-Square distribution table is approximately

9.488

Since

χ^{2} = 13.4375

is greater than the critical value

9.488

, we reject the null hypothesis

H_{0}

Conclusion:

There is sufficient evidence to conclude that the tools and materials used are not independent at the

5 %

level of significance.

(b) Explain the general procedure of testing of hypothesis.

Answer:

Testing a hypothesis is a statistical method that uses sample data to evaluate a hypothesis about a population parameter. The general procedure involves several steps, which are outlined below:

1. Formulate Hypotheses

Null Hypothesis ( $H_{0}$ ): This is the hypothesis that there is no effect or no difference, and it represents the status quo. For example, $H_{0} : μ = μ_{0}$ .
Alternative Hypothesis ( $H_{1}$ or $H_{a}$ ): This is what you want to prove. It represents a change, effect, or difference. For example, $H_{1} : μ \neq μ_{0}$ , $H_{1} : μ > μ_{0}$ , or $H_{1} : μ < μ_{0}$ .

2. Choose the Significance Level ( $α$ )

The significance level is the probability of rejecting the null hypothesis when it is actually true (Type I error). Common values are 0.05, 0.01, and 0.10.

3. Select the Appropriate Test Statistic

The test statistic is a standardized value that is calculated from sample data during a hypothesis test. Depending on the nature of the data and the hypothesis, this could be a Z-test, t-test, chi-square test, F-test, etc.

4. Formulate the Decision Rule

The decision rule involves determining the critical value(s) from the statistical distribution of the test statistic (e.g., normal distribution, t-distribution, chi-square distribution) that correspond to the chosen significance level. This critical value(s) define the rejection region(s).

5. Collect Data and Compute the Test Statistic

Gather the sample data and compute the value of the test statistic based on the sample data.

6. Make a Decision

Compare the test statistic to the critical value(s):
- If the test statistic falls within the rejection region, reject the null hypothesis ( $H_{0}$ ).
- If the test statistic does not fall within the rejection region, do not reject the null hypothesis.

7. Draw a Conclusion

Based on the decision made in the previous step, interpret the results in the context of the research question or problem.

Example

Step 1: Formulate Hypotheses

$H_{0}$ : The mean systolic blood pressure ( $μ$ ) is 120 mmHg.
$H_{1}$ : The mean systolic blood pressure ( $μ$ ) is not 120 mmHg.

Step 2: Choose the Significance Level

$α = 0.05$

Step 3: Select the Appropriate Test Statistic

Assume the sample size is large and the population standard deviation is known. Use a Z-test.

Step 4: Formulate the Decision Rule

For a two-tailed test with $α = 0.05$ , the critical values are $\pm 1.96$ (from the standard normal distribution).

Step 5: Collect Data and Compute the Test Statistic

Suppose we collect a sample of 30 individuals with a sample mean ( $\bar{x}$ ) of 123 mmHg and a population standard deviation ( $σ$ ) of 10 mmHg.
Compute the Z-score: $Z = \frac{\bar{x} - μ_{0}}{σ / \sqrt{n}} = \frac{123 - 120}{10 / \sqrt{30}} \approx 1.64$

Step 6: Make a Decision

The test statistic (1.64) is not greater than 1.96 or less than -1.96, so it does not fall within the rejection region.

Step 7: Draw a Conclusion

Do not reject the null hypothesis. There is not enough evidence to conclude that the mean systolic blood pressure is different from 120 mmHg at the 5% significance level.

This general procedure provides a structured approach to hypothesis testing, ensuring that conclusions drawn from the sample data are statistically valid and reliable.

Question:-04

4.(a) A random sample of 15 stores was taken to analyse the sales of mobiles during last month. The correlation coefficient between sales and expenditure on advertisement was found to be 0.68 . Assuming that sales and expenditure on advertisement follow normal distribution, then test if these two are positively correlated at

1 %

level of significance.

Answer:

To test if the sales and expenditure on advertisement are positively correlated at the

1 %

level of significance, we can perform a hypothesis test for the population correlation coefficient

ρ

. Here’s the step-by-step procedure:

1. Formulate Hypotheses

Null Hypothesis ( $H_{0}$ ): There is no positive correlation between sales and expenditure on advertisement ( $ρ \leq 0$ ).
Alternative Hypothesis ( $H_{1}$ ): There is a positive correlation between sales and expenditure on advertisement ( $ρ > 0$ ).

2. Choose the Significance Level

$α = 0.01$ (1%)

3. Select the Test Statistic

For testing the significance of the sample correlation coefficient

r

, we use the t-distribution with

n - 2

degrees of freedom, where

n

is the sample size. The test statistic

t

is given by:

t = \frac{r \sqrt{n - 2}}{\sqrt{1 - r^{2}}}

4. Formulate the Decision Rule

Determine the critical value $t_{α, n - 2}$ from the t-distribution table for $n - 2$ degrees of freedom. For a one-tailed test at $α = 0.01$ and $n - 2 = 13$ degrees of freedom, the critical value can be found in t-distribution tables or using statistical software.

5. Collect Data and Compute the Test Statistic

Sample size $n = 15$
Sample correlation coefficient $r = 0.68$

Calculate the test statistic:

t = \frac{0.68 \sqrt{15 - 2}}{\sqrt{1 - {0.68}^{2}}}

t = \frac{0.68 \sqrt{13}}{\sqrt{1 - 0.4624}}

t = \frac{0.68 \sqrt{13}}{\sqrt{0.5376}}

t = \frac{0.68 \times 3.605}{0.7333}

t \approx 3.34

6. Determine the Critical Value

Using a t-distribution table, we look up the critical value for a one-tailed test at

α = 0.01

with 13 degrees of freedom. The critical value

t_{0.01, 13}

is approximately 2.650.

7. Make a Decision

Compare the computed test statistic with the critical value:
- If $t > t_{α, n - 2}$ , reject the null hypothesis.
- If $t \leq t_{α, n - 2}$ , do not reject the null hypothesis.

In this case,

t \approx 3.34

is greater than the critical value 2.650.

8. Draw a Conclusion

Since the test statistic

t \approx 3.34

is greater than the critical value 2.650, we reject the null hypothesis

H_{0}

at the

1 %

level of significance.

Conclusion

There is sufficient evidence at the

1 %

level of significance to conclude that there is a positive correlation between sales and expenditure on advertisement.

(b) An electric equipment manufacturing company claims that at most

10 %

of its products are defective. A store wants to purchase its products but before that they decided to test a sample of 200. If there are 30 defective products among these 200, can we agree with the manufacturer’s claim at

1 %

level of significance?

Answer:

To test the manufacturer’s claim that at most

10 %

of its products are defective, we can perform a hypothesis test for a population proportion. Here are the steps:

1. Formulate Hypotheses

Null Hypothesis ( $H_{0}$ ): The proportion of defective products is at most $10 %$ ( $p \leq 0.10$ ).
Alternative Hypothesis ( $H_{1}$ ): The proportion of defective products is greater than $10 %$ ( $p > 0.10$ ).

2. Choose the Significance Level

$α = 0.01$ (1%)

3. Select the Test Statistic

For testing the population proportion, we use the z-test for proportions. The test statistic

z

is given by:

z = \frac{\hat{p} - p_{0}}{\sqrt{\frac{p_{0} (1 - p_{0})}{n}}}

where:

$\hat{p}$ is the sample proportion of defectives.
$p_{0}$ is the claimed population proportion.
$n$ is the sample size.

4. Formulate the Decision Rule

Determine the critical value $z_{α}$ from the standard normal distribution for a one-tailed test at $α = 0.01$ . The critical value $z_{0.01}$ is approximately 2.33.

5. Collect Data and Compute the Test Statistic

Sample size $n = 200$
Number of defective products in the sample = 30
Sample proportion $\hat{p} = \frac{30}{200} = 0.15$
Claimed population proportion $p_{0} = 0.10$

Calculate the test statistic:

z = \frac{\hat{p} - p_{0}}{\sqrt{\frac{p_{0} (1 - p_{0})}{n}}}

z = \frac{0.15 - 0.10}{\sqrt{\frac{0.10 (1 - 0.10)}{200}}}

z = \frac{0.05}{\sqrt{\frac{0.10 \times 0.90}{200}}}

z = \frac{0.05}{\sqrt{\frac{0.09}{200}}}

z = \frac{0.05}{\sqrt{0.00045}}

z = \frac{0.05}{0.0212}

z \approx 2.36

6. Make a Decision

Compare the computed test statistic with the critical value:
- If $z > z_{α}$ , reject the null hypothesis.
- If $z \leq z_{α}$ , do not reject the null hypothesis.

In this case,

z \approx 2.36

is greater than the critical value 2.33.

7. Draw a Conclusion

Since the test statistic

z \approx 2.36

is greater than the critical value 2.33, we reject the null hypothesis

H_{0}

at the

1 %

level of significance.

Conclusion

There is sufficient evidence at the

1 %

level of significance to reject the manufacturer’s claim that at most

10 %

of its products are defective. Therefore, we cannot agree with the manufacturer’s claim based on this sample.

Question:-05

5.(a) An experiment was conducted to compare the defective items produced by two different machines

A

and

B

. The data on number of defective items produced by the machines were observed and given in the table as follows :

A	B
26	19
37	22
40	24
35	27
30	24
30	18
40	20
26	19
30	25
35
45

Obtain

95 %

confidence interval for variance ratio of the number of defective items produced by machines

A

and

B

, respectively.

Answer:

To obtain the

95 %

confidence interval for the variance ratio of the number of defective items produced by machines A and B, we need to calculate the variances of the samples from both machines and then use the F-distribution.

Step 1: Calculate Sample Variances

First, let’s calculate the sample variances for both machines A and B.

Machine A

The data for machine A is:

26, 37, 40, 35, 30, 30, 40, 26, 30, 35, 45

Calculate the mean ( ${\bar{X}}_{A}$ ) of the data for machine A.

{\bar{X}}_{A} = \frac{26 + 37 + 40 + 35 + 30 + 30 + 40 + 26 + 30 + 35 + 45}{11} = \frac{374}{11} \approx 34

Calculate the sample variance ( $S_{A}^{2}$ ).

S_{A}^{2} = \frac{\sum (X_{i} - {\bar{X}}_{A})^{2}}{n_{A} - 1}

S_{A}^{2} = \frac{(26 - 34)^{2} + (37 - 34)^{2} + (40 - 34)^{2} + (35 - 34)^{2} + (30 - 34)^{2} + (30 - 34)^{2} + (40 - 34)^{2} + (26 - 34)^{2} + (30 - 34)^{2} + (35 - 34)^{2} + (45 - 34)^{2}}{11 - 1}

S_{A}^{2} = \frac{(64 + 9 + 36 + 1 + 16 + 16 + 36 + 64 + 16 + 1 + 121)}{10}

S_{A}^{2} = \frac{379}{10} = 37.9

Machine B

The data for machine B is:

19, 22, 24, 27, 24, 18, 20, 19, 25

Calculate the mean ( ${\bar{X}}_{B}$ ) of the data for machine B.

{\bar{X}}_{B} = \frac{19 + 22 + 24 + 27 + 24 + 18 + 20 + 19 + 25}{9} = \frac{198}{9} \approx 22

Calculate the sample variance ( $S_{B}^{2}$ ).

S_{B}^{2} = \frac{\sum (X_{i} - {\bar{X}}_{B})^{2}}{n_{B} - 1}

S_{B}^{2} = \frac{(19 - 22)^{2} + (22 - 22)^{2} + (24 - 22)^{2} + (27 - 22)^{2} + (24 - 22)^{2} + (18 - 22)^{2} + (20 - 22)^{2} + (19 - 22)^{2} + (25 - 22)^{2}}{9 - 1}

S_{B}^{2} = \frac{(9 + 0 + 4 + 25 + 4 + 16 + 4 + 9 + 9)}{8}

S_{B}^{2} = \frac{80}{8} = 10

Step 2: Calculate the Variance Ratio

The variance ratio

F

is:

F = \frac{S_{A}^{2}}{S_{B}^{2}} = \frac{37.9}{10} \approx 3.79

Step 3: Find the Critical Values for the F-distribution

To find the

95 %

confidence interval for the variance ratio, we need to use the F-distribution with degrees of freedom

d f_{1} = n_{A} - 1 = 10

and

d f_{2} = n_{B} - 1 = 8

Using an F-distribution table or calculator, we find the critical values

F_{0.025, 10, 8}

and

F_{0.975, 10, 8}

F_{0.025, 10, 8} \approx 4.82 (upper critical value)

F_{0.975, 10, 8} \approx 0.23 (lower critical value)

Step 4: Calculate the Confidence Interval

The

95 %

confidence interval for the variance ratio is given by:

(\frac{S_{A}^{2}}{S_{B}^{2}} \cdot \frac{1}{F_{0.025, 10, 8}}, \frac{S_{A}^{2}}{S_{B}^{2}} \cdot F_{0.975, 10, 8})

(\frac{37.9}{10} \cdot \frac{1}{4.82}, \frac{37.9}{10} \cdot 4.82)

(3.79 \cdot \frac{1}{4.82}, 3.79 \cdot 4.82)

(\frac{3.79}{4.82}, 3.79 \cdot 4.82)

(0.786, 18.27)

Conclusion

The

95 %

confidence interval for the variance ratio of the number of defective items produced by machines A and B is approximately

(0.786, 18.27)

(b) Write four differences between parametric and non-parametric tests.

Answer:

Here are four key differences between parametric and non-parametric tests:

1. Assumptions about the Population Distribution

Parametric Tests:
- These tests assume that the data follows a certain distribution, usually a normal distribution.
- Examples include the t-test and ANOVA, which assume the data is normally distributed.
Non-Parametric Tests:
- These tests do not assume a specific distribution for the data.
- They are often used when the data does not meet the assumptions of parametric tests or when dealing with ordinal data or ranks.
- Examples include the Mann-Whitney U test and the Kruskal-Wallis test.

2. Data Type

Parametric Tests:
- Typically used for interval or ratio data, which are numerical and can be meaningfully averaged.
- The data should be measured on a continuous scale and have meaningful zero points.
Non-Parametric Tests:
- Suitable for nominal or ordinal data, which may include ranks or categories without a meaningful average.
- They can also be used for interval or ratio data that do not meet the assumptions required for parametric tests.

3. Robustness to Outliers and Non-Normality

Parametric Tests:
- Less robust to outliers and non-normal distributions. Outliers can significantly affect the results of these tests.
- They rely on specific assumptions about the population distribution, and violations of these assumptions can lead to incorrect conclusions.
Non-Parametric Tests:
- More robust to outliers and non-normality. They make fewer assumptions about the underlying data distribution.
- These tests are based on ranks or medians, which are less affected by extreme values and skewed distributions.

4. Efficiency

Parametric Tests:
- Generally more powerful and efficient when the assumptions about the data are met. They can detect smaller differences or effects with a given sample size.
- They make full use of the data by considering the exact values of observations.
Non-Parametric Tests:
- Generally less powerful than parametric tests when the data actually meets the assumptions of parametric tests.
- They may require a larger sample size to achieve the same power as parametric tests because they use ranks or categories rather than the actual data values.

These differences highlight the contexts in which each type of test is appropriate and the trade-offs involved in choosing between parametric and non-parametric methods.

Question:-06

6.(a) The length of a steel rod is distributed normally with mean 12 metre and standard deviation 0.1 metre. For a random sample of size 10 , find :
(i) Mean and variance of the sampling distribution of mean.

(ii) The probability that the sample mean lies between 11.94 metre and 12.06 metre.

Answer:

Let’s address each part of the problem step by step.

Given Data:

Population mean ( $μ$ ) = 12 meters
Population standard deviation ( $σ$ ) = 0.1 meters
Sample size ( $n$ ) = 10

(i) Mean and Variance of the Sampling Distribution of the Mean

The sampling distribution of the sample mean

\bar{X}

for a normally distributed population is also normally distributed with:

Mean: $μ_{\bar{X}} = μ$
Variance: $σ_{\bar{X}}^{2} = \frac{σ^{2}}{n}$

Mean of the Sampling Distribution:

μ_{\bar{X}} = 12 meters

Variance of the Sampling Distribution:

σ_{\bar{X}}^{2} = \frac{σ^{2}}{n} = \frac{(0.1)^{2}}{10} = \frac{0.01}{10} = 0.001 square meters

Standard Deviation of the Sampling Distribution:

σ_{\bar{X}} = \sqrt{σ_{\bar{X}}^{2}} = \sqrt{0.001} = 0.0316 meters

(ii) The Probability that the Sample Mean Lies Between 11.94 meters and 12.06 meters

To find this probability, we use the Z-score formula for the sampling distribution of the mean:

Z = \frac{\bar{X} - μ_{\bar{X}}}{σ_{\bar{X}}}

First, calculate the Z-scores for 11.94 meters and 12.06 meters.

For

\bar{X} = 11.94

Z_{11.94} = \frac{11.94 - 12}{0.0316} = \frac{- 0.06}{0.0316} \approx - 1.90

For

\bar{X} = 12.06

Z_{12.06} = \frac{12.06 - 12}{0.0316} = \frac{0.06}{0.0316} \approx 1.90

Next, we find the probability corresponding to these Z-scores using the standard normal distribution table (or a calculator).

Using standard normal distribution tables or software:

P (Z \leq 1.90) \approx 0.9713

P (Z \leq - 1.90) \approx 0.0287

The probability that the sample mean lies between 11.94 meters and 12.06 meters is the difference between these two probabilities:

P (11.94 \leq \bar{X} \leq 12.06) = P (Z \leq 1.90) - P (Z \leq - 1.90) = 0.9713 - 0.0287 = 0.9426

Summary

Mean of the sampling distribution of the mean: 12 meters
Variance of the sampling distribution of the mean: 0.001 square meters
Probability that the sample mean lies between 11.94 meters and 12.06 meters: $0.9426$ or $94.26 %$

(b) The reduction of weight (in

k g

) after a dietplan are recorded as follows :

6.5, 7.7, 5.6, 7.3, 6.7, 7.8, 6.7, 6.2, 5.2, 6.6

, 6.0, 7.0,

7.2, 6.8

and 7.2 .
It is observed that reduction in weight follows an exponential distribution with parameter

θ

whose pdf is given by :

f (x) = \frac{1}{θ} e^{- x / θ}; x \geq 0, θ > 0

(i) Find the maximum likelihood estimator of the parameter

θ

(ii) Determine the maximum likelihood estimate of

θ

on the basis of the given data.

Answer:

To find the maximum likelihood estimator (MLE) and the maximum likelihood estimate of the parameter

θ

for the given exponential distribution, we will follow these steps:

Given Data

Reduction in weight (in kg):

6.5, 7.7, 5.6, 7.3, 6.7, 7.8, 6.7, 6.2, 5.2, 6.6, 6.0, 7.0, 7.2, 6.8, 7.2

Probability Density Function (PDF) of Exponential Distribution

f (x) = \frac{1}{θ} e^{- x / θ}, x \geq 0, θ > 0

(i) Find the Maximum Likelihood Estimator (MLE) of the Parameter $θ$

To find the MLE of

θ

, we need to set up the likelihood function and then maximize it.

The likelihood function

L (θ)

for the exponential distribution is:

L (θ) = \prod_{i = 1}^{n} f (x_{i}) = \prod_{i = 1}^{n} (\frac{1}{θ} e^{- x_{i} / θ})

Given the data points

x_{1}, x_{2}, \dots, x_{n}

, the likelihood function becomes:

L (θ) = {(\frac{1}{θ})}^{n} \exp (- \frac{\sum_{i = 1}^{n} x_{i}}{θ})

The log-likelihood function

ℓ (θ)

is:

ℓ (θ) = \log L (θ) = n \log (\frac{1}{θ}) - \frac{\sum_{i = 1}^{n} x_{i}}{θ}

ℓ (θ) = - n \log (θ) - \frac{\sum_{i = 1}^{n} x_{i}}{θ}

To find the MLE, we take the derivative of the log-likelihood function with respect to

θ

and set it to zero:

\frac{d ℓ (θ)}{d θ} = - \frac{n}{θ} + \frac{\sum_{i = 1}^{n} x_{i}}{θ^{2}} = 0

Solving for

θ

- \frac{n}{θ} + \frac{\sum_{i = 1}^{n} x_{i}}{θ^{2}} = 0

\frac{\sum_{i = 1}^{n} x_{i}}{θ^{2}} = \frac{n}{θ}

\sum_{i = 1}^{n} x_{i} = n θ

θ = \frac{\sum_{i = 1}^{n} x_{i}}{n}

Thus, the MLE of

θ

is the sample mean:

\hat{θ} = \bar{x} = \frac{\sum_{i = 1}^{n} x_{i}}{n}

(ii) Determine the Maximum Likelihood Estimate of $θ$ on the Basis of the Given Data

First, we calculate the sample mean

\bar{x}

Given the data points:

6.5, 7.7, 5.6, 7.3, 6.7, 7.8, 6.7, 6.2, 5.2, 6.6, 6.0, 7.0, 7.2, 6.8, 7.2

Sum of the data points:

\sum_{i = 1}^{15} x_{i} = 6.5 + 7.7 + 5.6 + 7.3 + 6.7 + 7.8 + 6.7 + 6.2 + 5.2 + 6.6 + 6.0 + 7.0 + 7.2 + 6.8 + 7.2

\sum_{i = 1}^{15} x_{i} = 102.5

Sample mean:

\bar{x} = \frac{102.5}{15} \approx 6.833

Thus, the maximum likelihood estimate of

θ

is:

\hat{θ} \approx 6.833

Summary

The maximum likelihood estimator (MLE) of the parameter $θ$ is $\hat{θ} = \bar{x}$ .
The maximum likelihood estimate of $θ$ based on the given data is approximately 6.833.

Question:-07

7.(a) Explain the properties of good estimator with examples.

Answer:

In statistics, a good estimator should possess several key properties to ensure that it provides accurate and reliable estimates of the population parameters. These properties are:

1. Unbiasedness

An estimator is unbiased if the expected value of the estimator is equal to the true value of the population parameter. In other words, on average, the estimator hits the true parameter value.

Example:

The sample mean $\bar{X}$ is an unbiased estimator of the population mean $μ$ , since $E (\bar{X}) = μ$ .

2. Consistency

An estimator is consistent if, as the sample size increases, the estimator converges in probability to the true value of the population parameter. This means that with larger samples, the estimator becomes more accurate.

Example:

The sample mean $\bar{X}$ is a consistent estimator of the population mean $μ$ . As the sample size $n$ increases, $\bar{X}$ gets closer to $μ$ .

3. Efficiency

An estimator is efficient if it has the smallest variance among all unbiased estimators of the parameter. Efficiency is often measured by the Mean Squared Error (MSE), which is the sum of the variance and the square of the bias. An efficient estimator has the smallest MSE.

Example:

Among all unbiased estimators of the population mean $μ$ , the sample mean $\bar{X}$ is the most efficient, meaning it has the smallest variance.

4. Sufficiency

An estimator is sufficient if it uses all the information in the data about the parameter. A sufficient estimator captures all the information that the sample provides about the population parameter, leaving no "leftover" information.

Example:

The sample mean $\bar{X}$ and sample variance $S^{2}$ are jointly sufficient for the parameters $μ$ and $σ^{2}$ of a normal distribution.

5. Robustness

An estimator is robust if it is not unduly affected by small deviations from the assumptions (e.g., normality) or by outliers. Robust estimators provide reliable estimates even when the data has some level of contamination or when the assumptions are only approximately met.

Example:

The sample median is a robust estimator of the population median, as it is less affected by outliers compared to the sample mean.

Examples of Good Estimators

Example 1: Sample Mean

Unbiasedness: $E (\bar{X}) = μ$ .
Consistency: As $n \to \infty$ , $\bar{X} \to μ$ .
Efficiency: Among all unbiased estimators of $μ$ , $\bar{X}$ has the smallest variance.
Sufficiency: For a normally distributed population, $\bar{X}$ is a sufficient estimator of $μ$ .

Example 2: Sample Variance

Unbiasedness: $E (S^{2}) = σ^{2}$ .
Consistency: As $n \to \infty$ , $S^{2} \to σ^{2}$ .
Efficiency: In a normal distribution, $S^{2}$ is the most efficient unbiased estimator of $σ^{2}$ .

Conclusion

A good estimator should be unbiased, consistent, efficient, sufficient, and robust. These properties ensure that the estimator provides accurate, reliable, and useful estimates of the population parameters based on sample data. Understanding these properties helps statisticians and researchers choose the appropriate estimators for their analyses.

(b) The measurements of length (in

c m

) of a random sample of 10 boxes are given as follows :
20.2, 24.1, 21.3, 17.2, 19.8, 16.5, 21.8, 18.7, 17.1 and 19.9.
Use suitable test to test the hypothesis that the sample is taken from a population which is symmetrical about

18 c m

against the alternative that symmetry is about the point which is greater than

18 c m

5 %

level of significance.

Answer:

To test the hypothesis that the sample is taken from a population symmetrical about 18 cm against the alternative that symmetry is about a point greater than 18 cm, we can use the one-sample Wilcoxon signed-rank test. This non-parametric test is suitable for testing the median (or symmetry) of a population when the population distribution is not assumed to be normal.

Given Data:

Measurements of lengths (in cm): 20.2, 24.1, 21.3, 17.2, 19.8, 16.5, 21.8, 18.7, 17.1, 19.9

Hypotheses:

Null Hypothesis ( $H_{0}$ ): The sample is taken from a population symmetrical about 18 cm (the median is 18 cm).
Alternative Hypothesis ( $H_{1}$ ): The sample is taken from a population symmetrical about a point greater than 18 cm (the median is greater than 18 cm).

Significance Level:

α = 0.05

Step-by-Step Procedure:

Calculate the Differences from the Hypothesized Median (18 cm):

D_{i} = X_{i} - 18

\begin{array}{ccc} Length (cm) & D_{i} = X_{i} - 18 & Absolute D_{i} \\ 20.2 & 20.2 - 18 = 2.2 & 2.2 \\ 24.1 & 24.1 - 18 = 6.1 & 6.1 \\ 21.3 & 21.3 - 18 = 3.3 & 3.3 \\ 17.2 & 17.2 - 18 = - 0.8 & 0.8 \\ 19.8 & 19.8 - 18 = 1.8 & 1.8 \\ 16.5 & 16.5 - 18 = - 1.5 & 1.5 \\ 21.8 & 21.8 - 18 = 3.8 & 3.8 \\ 18.7 & 18.7 - 18 = 0.7 & 0.7 \\ 17.1 & 17.1 - 18 = - 0.9 & 0.9 \\ 19.9 & 19.9 - 18 = 1.9 & 1.9 \end{array}

Rank the Absolute Differences (ignoring zero differences, if any):

\begin{array}{cccc} Length (cm) & D_{i} = X_{i} - 18 & Absolute D_{i} & Rank \\ 20.2 & 2.2 & 2.2 & 6 \\ 24.1 & 6.1 & 6.1 & 10 \\ 21.3 & 3.3 & 3.3 & 8 \\ 17.2 & - 0.8 & 0.8 & 2 \\ 19.8 & 1.8 & 1.8 & 5 \\ 16.5 & - 1.5 & 1.5 & 3 \\ 21.8 & 3.8 & 3.8 & 9 \\ 18.7 & 0.7 & 0.7 & 1 \\ 17.1 & - 0.9 & 0.9 & 4 \\ 19.9 & 1.9 & 1.9 & 7 \end{array}

Assign the Signs of the Original Differences to the Ranks:

\begin{array}{ccccc} Length (cm) & D_{i} = X_{i} - 18 & Absolute D_{i} & Rank & Signed Rank \\ 20.2 & 2.2 & 2.2 & 6 & 6 \\ 24.1 & 6.1 & 6.1 & 10 & 10 \\ 21.3 & 3.3 & 3.3 & 8 & 8 \\ 17.2 & - 0.8 & 0.8 & 2 & - 2 \\ 19.8 & 1.8 & 1.8 & 5 & 5 \\ 16.5 & - 1.5 & 1.5 & 3 & - 3 \\ 21.8 & 3.8 & 3.8 & 9 & 9 \\ 18.7 & 0.7 & 0.7 & 1 & 1 \\ 17.1 & - 0.9 & 0.9 & 4 & - 4 \\ 19.9 & 1.9 & 1.9 & 7 & 7 \end{array}

Calculate the Test Statistic:
Sum of positive ranks ( $W^{+}$ ):

W^{+} = 6 + 10 + 8 + 5 + 9 + 1 + 7 = 46

Sum of negative ranks (

W^{-}

W^{-} = 2 + 3 + 4 = 9

The Wilcoxon signed-rank test statistic

W

is the smaller of

W^{+}

and

W^{-}

W = 9

Determine the Critical Value:
For a one-tailed test at the $5 %$ significance level with $n = 10$ , we use the Wilcoxon signed-rank test table to find the critical value. For $n = 10$ and $α = 0.05$ , the critical value is $8$ .
Make the Decision:
Compare the test statistic $W$ with the critical value:

If $W \leq$ critical value, reject $H_{0}$ .
If $W >$ critical value, do not reject $H_{0}$ .

In this case,

W = 9

is greater than the critical value

8

Conclusion

We do not reject the null hypothesis. There is insufficient evidence at the

5 %

level of significance to conclude that the sample is taken from a population symmetrical about a point greater than 18 cm.

Get Instant Access

Frequently Asked Questions (FAQs)

How do I access the Complete Solution?

You can access the Complete Solution through our app, which can be downloaded using this link:

App Link

Simply click “Install” to download and install the app, and then follow the instructions to purchase the required assignment solution. Currently, the app is only available for Android devices. We are working on making the app available for iOS in the future, but it is not currently available for iOS devices.

Is this Complete Solution for IGNOU Assignments?

Yes, It is Complete Solution, a comprehensive solution to the assignments for IGNOU. Valid from January 1, 2023 to December 31, 2023.

Is the Complete Solution aligned with the IGNOU requirements?

Yes, the Complete Solution is aligned with the IGNOU requirements and has been solved accordingly.

Is the Complete Solution guaranteed error-free?

Yes, the Complete Solution is guaranteed to be error-free.The solutions are thoroughly researched and verified by subject matter experts to ensure their accuracy.

Can I access the Complete Solution anytime?

As of now, you have access to the Complete Solution for a period of 6 months after the date of purchase, which is sufficient to complete the assignment. However, we can extend the access period upon request. You can access the solution anytime through our app.

What if I need help with a specific assignment question?

The app provides complete solutions for all assignment questions. If you still need help, you can contact the support team for assistance at Whatsapp +91-9958288900

Can I access the educational materials on multiple devices?

No, access to the educational materials is limited to one device only, where you have first logged in. Logging in on multiple devices is not allowed and may result in the revocation of access to the educational materials.

How do I make a payment for accessing the solutions?

Payments can be made through various secure online payment methods available in the app.Your payment information is protected with industry-standard security measures to ensure its confidentiality and safety. You will receive a receipt for your payment through email or within the app, depending on your preference.

What are the instructions for formatting my assignments?

The instructions for formatting your assignments are detailed in the Assignment Booklet, which includes details on paper size, margins, precision, and submission requirements. It is important to strictly follow these instructions to facilitate evaluation and avoid delays.

Terms and Conditions

The educational materials provided in the app are the sole property of the app owner and are protected by copyright laws.
Reproduction, distribution, or sale of the educational materials without prior written consent from the app owner is strictly prohibited and may result in legal consequences.
Any attempt to modify, alter, or use the educational materials for commercial purposes is strictly prohibited.
The app owner reserves the right to revoke access to the educational materials at any time without notice for any violation of these terms and conditions.
The app owner is not responsible for any damages or losses resulting from the use of the educational materials.
The app owner reserves the right to modify these terms and conditions at any time without notice.
By accessing and using the app, you agree to abide by these terms and conditions.
Access to the educational materials is limited to one device only. Logging in to the app on multiple devices is not allowed and may result in the revocation of access to the educational materials.

Our educational materials are solely available on our website and application only. Users and students can report the dealing or selling of the copied version of our educational materials by any third party at our email address (abstract4math@gmail.com) or mobile no. (+91-9958288900).

In return, such users/students can expect free our educational materials/assignments and other benefits as a bonafide gesture which will be completely dependent upon our discretion.