MST-002 Dec 2023

MST-002 Dec 2023

Question:-01

1.State whether the following statements are True or False. Give reasons in support of your answers :
(a) If two variables are related in the form Y = X 2 Y = X 2 Y=X^(2)\mathrm{Y}=\mathrm{X}^2Y=X2, then the variables are highly linearly related.

Answer:

The statement "If two variables are related in the form Y = X 2 Y = X 2 Y=X^(2)Y = X^2Y=X2, then the variables are highly linearly related" is false.

Justification:

Definition of Linear Relationship:
A linear relationship between two variables X X XXX and Y Y YYY implies that the relationship can be described by an equation of the form:
Y = a X + b Y = a X + b Y=aX+bY = aX + bY=aX+b
where a a aaa and b b bbb are constants. This means that for every unit change in X X XXX, Y Y YYY changes by a constant amount a a aaa.
Nature of Y = X 2 Y = X 2 Y=X^(2)Y = X^2Y=X2:
The equation Y = X 2 Y = X 2 Y=X^(2)Y = X^2Y=X2 describes a quadratic relationship, not a linear one. This means that the relationship between X X XXX and Y Y YYY involves the square of X X XXX, rather than a constant multiple of X X XXX.
Non-linearity:
  1. Plot Analysis:
    • If you plot Y = X 2 Y = X 2 Y=X^(2)Y = X^2Y=X2 on a graph with X X XXX on the x-axis and Y Y YYY on the y-axis, you will get a parabolic curve that opens upwards. This is not a straight line, which would be characteristic of a linear relationship.
  2. Rate of Change:
    • In a linear relationship, the rate of change of Y Y YYY with respect to X X XXX is constant. However, in Y = X 2 Y = X 2 Y=X^(2)Y = X^2Y=X2, the rate of change (slope) of Y Y YYY is given by the derivative d Y d X = 2 X d Y d X = 2 X (dY)/(dX)=2X\frac{dY}{dX} = 2XdYdX=2X, which is not constant but varies with X X XXX.
Example to Illustrate Non-linearity:
Consider two points:
  • When X = 1 X = 1 X=1X = 1X=1, Y = 1 2 = 1 Y = 1 2 = 1 Y=1^(2)=1Y = 1^2 = 1Y=12=1
  • When X = 2 X = 2 X=2X = 2X=2, Y = 2 2 = 4 Y = 2 2 = 4 Y=2^(2)=4Y = 2^2 = 4Y=22=4
  • When X = 3 X = 3 X=3X = 3X=3, Y = 3 2 = 9 Y = 3 2 = 9 Y=3^(2)=9Y = 3^2 = 9Y=32=9
The differences in Y Y YYY for equal changes in X X XXX are not constant (from X = 1 X = 1 X=1X = 1X=1 to X = 2 X = 2 X=2X = 2X=2, Y Y YYY changes by 3, but from X = 2 X = 2 X=2X = 2X=2 to X = 3 X = 3 X=3X = 3X=3, Y Y YYY changes by 5), demonstrating a non-linear relationship.

Conclusion:

Since the relationship Y = X 2 Y = X 2 Y=X^(2)Y = X^2Y=X2 does not meet the criteria for linearity (a constant rate of change and a linear plot), the statement is false.

(b) In regression analysis, the two regression coefficients are -2 and 2 / 3 2 / 3 -2//3-2 / 32/3.

Answer:

The statement "In regression analysis, the two regression coefficients are -2 and 2 / 3 2 / 3 -2//3-2 / 32/3" is false.

Justification:

In regression analysis involving two variables X X XXX and Y Y YYY, there are typically two regression equations:
  1. The regression of Y Y YYY on X X XXX:
    Y = a + b X Y = a + b X Y=a+bXY = a + bXY=a+bX
    where b b bbb is the regression coefficient of Y Y YYY on X X XXX.
  2. The regression of X X XXX on Y Y YYY:
    X = c + d Y X = c + d Y X=c+dYX = c + dYX=c+dY
    where d d ddd is the regression coefficient of X X XXX on Y Y YYY.
Relationship Between Regression Coefficients:
There is a specific relationship between these two regression coefficients, b b bbb and d d ddd:
b × d = r 2 b × d = r 2 b xx d=r^(2)b \times d = r^2b×d=r2
where r r rrr is the correlation coefficient between X X XXX and Y Y YYY.

Proof Using Given Coefficients:

Given b = 2 b = 2 b=-2b = -2b=2 and d = 2 3 d = 2 3 d=-(2)/(3)d = -\frac{2}{3}d=23, let’s check the relationship:
b × d = ( 2 ) × ( 2 3 ) = 4 3 b × d = ( 2 ) × 2 3 = 4 3 b xx d=(-2)xx(-(2)/(3))=(4)/(3)b \times d = (-2) \times \left(-\frac{2}{3}\right) = \frac{4}{3}b×d=(2)×(23)=43
For this product to be valid, it must equal r 2 r 2 r^(2)r^2r2, where r r rrr is the correlation coefficient.

Constraints on r 2 r 2 r^(2)r^2r2:

The value of r r rrr (and thus r 2 r 2 r^(2)r^2r2) must lie within the interval [ 1 , 1 ] [ 1 , 1 ] [-1,1][-1, 1][1,1]:
0 r 2 1 0 r 2 1 0 <= r^(2) <= 10 \leq r^2 \leq 10r21
However, in this case:
r 2 = 4 3 r 2 = 4 3 r^(2)=(4)/(3)r^2 = \frac{4}{3}r2=43
This is not possible because r 2 r 2 r^(2)r^2r2 cannot exceed 1.

Conclusion:

Since the product b × d b × d b xx db \times db×d should equal r 2 r 2 r^(2)r^2r2 and r 2 r 2 r^(2)r^2r2 cannot be greater than 1, the given regression coefficients b = 2 b = 2 b=-2b = -2b=2 and d = 2 3 d = 2 3 d=-(2)/(3)d = -\frac{2}{3}d=23 cannot exist simultaneously in a valid regression model. Hence, the statement is false.

(c) Sum of deviations of the observations from their mean is zero.

Answer:

The statement "Sum of deviations of the observations from their mean is zero" is true.

Justification:

Definition of Mean:
The mean (average) of a set of observations x 1 , x 2 , , x n x 1 , x 2 , , x n x_(1),x_(2),dots,x_(n)x_1, x_2, \ldots, x_nx1,x2,,xn is given by:
x ¯ = 1 n i = 1 n x i x ¯ = 1 n i = 1 n x i bar(x)=(1)/(n)sum_(i=1)nx_(i)\bar{x} = \frac{1}{n} \sum_{i=1} {n} x_ix¯=1ni=1nxi
Deviation from the Mean:
The deviation of an observation x i x i x_(i)x_ixi from the mean x ¯ x ¯ bar(x)\bar{x}x¯ is given by:
x i x ¯ x i x ¯ x_(i)- bar(x)x_i – \bar{x}xix¯
Sum of Deviations:
The sum of the deviations of all observations from the mean is:
i = 1 n ( x i x ¯ ) i = 1 n ( x i x ¯ ) sum_(i=1)^(n)(x_(i)- bar(x))\sum_{i=1}^{n} (x_i – \bar{x})i=1n(xix¯)
We can expand this sum:
i = 1 n ( x i x ¯ ) = i = 1 n x i i = 1 n x ¯ i = 1 n ( x i x ¯ ) = i = 1 n x i i = 1 n x ¯ sum_(i=1)^(n)(x_(i)- bar(x))=sum_(i=1)^(n)x_(i)-sum_(i=1)^(n) bar(x)\sum_{i=1}^{n} (x_i – \bar{x}) = \sum_{i=1}^{n} x_i – \sum_{i=1}^{n} \bar{x}i=1n(xix¯)=i=1nxii=1nx¯
Since x ¯ x ¯ bar(x)\bar{x}x¯ is the mean, it is a constant value for all n n nnn observations. Therefore:
i = 1 n x ¯ = n x ¯ i = 1 n x ¯ = n x ¯ sum_(i=1)^(n) bar(x)=n* bar(x)\sum_{i=1}^{n} \bar{x} = n \cdot \bar{x}i=1nx¯=nx¯
So we can rewrite the sum of deviations as:
i = 1 n ( x i x ¯ ) = i = 1 n x i n x ¯ i = 1 n ( x i x ¯ ) = i = 1 n x i n x ¯ sum_(i=1)^(n)(x_(i)- bar(x))=sum_(i=1)^(n)x_(i)-n* bar(x)\sum_{i=1}^{n} (x_i – \bar{x}) = \sum_{i=1}^{n} x_i – n \cdot \bar{x}i=1n(xix¯)=i=1nxinx¯
But by the definition of the mean:
x ¯ = 1 n i = 1 n x i x ¯ = 1 n i = 1 n x i bar(x)=(1)/(n)sum_(i=1)^(n)x_(i)\bar{x} = \frac{1}{n} \sum_{i=1}^{n} x_ix¯=1ni=1nxi
Multiplying both sides by n n nnn:
n x ¯ = i = 1 n x i n x ¯ = i = 1 n x i n* bar(x)=sum_(i=1)^(n)x_(i)n \cdot \bar{x} = \sum_{i=1}^{n} x_inx¯=i=1nxi
Substituting back into the sum of deviations:
i = 1 n ( x i x ¯ ) = i = 1 n x i i = 1 n x i = 0 i = 1 n ( x i x ¯ ) = i = 1 n x i i = 1 n x i = 0 sum_(i=1)^(n)(x_(i)- bar(x))=sum_(i=1)^(n)x_(i)-sum_(i=1)^(n)x_(i)=0\sum_{i=1}^{n} (x_i – \bar{x}) = \sum_{i=1}^{n} x_i – \sum_{i=1}^{n} x_i = 0i=1n(xix¯)=i=1nxii=1nxi=0

Conclusion:

The sum of the deviations of the observations from their mean is indeed zero. Hence, the statement is true.

(d) If the value of β 2 < 3 β 2 < 3 beta_(2) < 3\beta_2<3β2<3, then the curve is said to be leptokurtic.

Answer:

The statement "If the value of β 2 < 3 β 2 < 3 beta_(2) < 3\beta_2 < 3β2<3, then the curve is said to be leptokurtic" is false.

Justification:

Kurtosis Overview:
Kurtosis is a statistical measure that describes the distribution of data points in the tails relative to the overall shape of the distribution. It indicates the "tailedness" of the distribution. The excess kurtosis is often used to compare with the normal distribution’s kurtosis.
Beta Coefficient ( β 2 β 2 beta_(2)\beta_2β2):
  • β 2 β 2 beta_(2)\beta_2β2 is a measure of kurtosis. For a normal distribution, β 2 = 3 β 2 = 3 beta_(2)=3\beta_2 = 3β2=3.
Types of Kurtosis:
  1. Leptokurtic: Distributions with kurtosis greater than 3 ( β 2 > 3 β 2 > 3 beta_(2) > 3\beta_2 > 3β2>3). These distributions have fatter tails and a sharper peak compared to the normal distribution.
  2. Mesokurtic: Distributions with kurtosis equal to 3 ( β 2 = 3 β 2 = 3 beta_(2)=3\beta_2 = 3β2=3). The normal distribution is an example of a mesokurtic distribution.
  3. Platykurtic: Distributions with kurtosis less than 3 ( β 2 < 3 β 2 < 3 beta_(2) < 3\beta_2 < 3β2<3). These distributions have thinner tails and a flatter peak compared to the normal distribution.

Analysis of the Given Statement:

The statement asserts that if β 2 < 3 β 2 < 3 beta_(2) < 3\beta_2 < 3β2<3, the curve is leptokurtic. However, by the definitions provided:
  • If β 2 < 3 β 2 < 3 beta_(2) < 3\beta_2 < 3β2<3, the curve is actually platykurtic, not leptokurtic.
  • A leptokurtic curve would require β 2 > 3 β 2 > 3 beta_(2) > 3\beta_2 > 3β2>3.

Conclusion:

Given the definitions, the correct classification for β 2 < 3 β 2 < 3 beta_(2) < 3\beta_2 < 3β2<3 is platykurtic, not leptokurtic. Therefore, the statement is false.

(e) In a company of 1000 persons, 750 were male out of whom 530 were married. Among females, the number of married ones was 350 , then the data is consistent.

Answer:

To determine whether the given data is consistent, we need to verify if all the numbers provided logically add up without any contradictions.

Given Data:

  • Total number of persons in the company: 1000
  • Number of males: 750
  • Number of married males: 530
  • Number of females: (Total persons – Number of males) = 1000 – 750 = 250
  • Number of married females: 350

Consistency Check:

The main point of inconsistency would be in the number of married persons. Specifically, if the number of married males and married females exceeds the total population, the data is inconsistent.
  1. Number of Married Persons:
    • Married males: 530
    • Married females: 350
    • Total number of married persons = Married males + Married females = 530 + 350 = 880
  2. Total Population:
    • Total persons in the company = 1000
The total number of married persons (880) is less than the total population (1000), which is logically possible. Thus, there is no immediate contradiction from this calculation alone. However, a closer look at the gender-wise distribution of the unmarried persons should be considered to fully confirm consistency.

Calculation of Unmarried Persons:

  1. Unmarried Males:
    • Total males = 750
    • Married males = 530
    • Unmarried males = 750 – 530 = 220
  2. Unmarried Females:
    • Total females = 250
    • Married females = 350
    • This presents an inconsistency because the number of married females (350) cannot exceed the total number of females (250).

Conclusion:

The data given is inconsistent because the number of married females (350) exceeds the total number of females (250). This contradiction indicates a clear error in the provided information. Therefore, the statement that the data is consistent is false.

Question:-02

2.(a) A candidate obtained the following percentage of marks in different courses of PGDAST programme :
MST-001-46%
MST-002-67%
MST-003-72%
MST-004-58%
MST-005-53%
It is agreed to give double weights to marks in MST-001 and MST-002 as compared to other courses. What is the simple mean and weighted mean?

Answer:

To calculate both the simple mean and the weighted mean of the percentages obtained by the candidate in the different courses, we need to follow these steps:

Step 1: List the percentages obtained

  • MST-001: 46%
  • MST-002: 67%
  • MST-003: 72%
  • MST-004: 58%
  • MST-005: 53%

Step 2: Calculate the Simple Mean

The simple mean is the average of all the percentages. It is calculated by summing up all the percentages and then dividing by the number of courses.
Simple Mean = Percentages Number of Courses Simple Mean = Percentages Number of Courses “Simple Mean”=(sum”Percentages”)/(“Number of Courses”)\text{Simple Mean} = \frac{\sum \text{Percentages}}{\text{Number of Courses}}Simple Mean=PercentagesNumber of Courses
Simple Mean = 46 + 67 + 72 + 58 + 53 5 Simple Mean = 46 + 67 + 72 + 58 + 53 5 “Simple Mean”=(46+67+72+58+53)/(5)\text{Simple Mean} = \frac{46 + 67 + 72 + 58 + 53}{5}Simple Mean=46+67+72+58+535
Simple Mean = 296 5 Simple Mean = 296 5 “Simple Mean”=(296)/(5)\text{Simple Mean} = \frac{296}{5}Simple Mean=2965
Simple Mean = 59.2 % Simple Mean = 59.2 % “Simple Mean”=59.2%\text{Simple Mean} = 59.2 \%Simple Mean=59.2%

Step 3: Calculate the Weighted Mean

Given that MST-001 and MST-002 have double weights compared to the other courses, we need to assign appropriate weights. Let’s denote the weight for MST-001 and MST-002 as w w www and for the other courses as w / 2 w / 2 w//2w/2w/2.
We can arbitrarily set w = 2 w = 2 w=2w = 2w=2 for simplicity. Therefore:
  • Weight for MST-001: 2
  • Weight for MST-002: 2
  • Weight for MST-003: 1
  • Weight for MST-004: 1
  • Weight for MST-005: 1
Now, calculate the weighted mean using these weights:
Weighted Mean = ( Percentage × Weight ) Weights Weighted Mean = ( Percentage × Weight ) Weights “Weighted Mean”=(sum(“Percentage”xx”Weight”))/(sum”Weights”)\text{Weighted Mean} = \frac{\sum (\text{Percentage} \times \text{Weight})}{\sum \text{Weights}}Weighted Mean=(Percentage×Weight)Weights
Weighted Mean = ( 46 × 2 ) + ( 67 × 2 ) + ( 72 × 1 ) + ( 58 × 1 ) + ( 53 × 1 ) 2 + 2 + 1 + 1 + 1 Weighted Mean = ( 46 × 2 ) + ( 67 × 2 ) + ( 72 × 1 ) + ( 58 × 1 ) + ( 53 × 1 ) 2 + 2 + 1 + 1 + 1 “Weighted Mean”=((46 xx2)+(67 xx2)+(72 xx1)+(58 xx1)+(53 xx1))/(2+2+1+1+1)\text{Weighted Mean} = \frac{(46 \times 2) + (67 \times 2) + (72 \times 1) + (58 \times 1) + (53 \times 1)}{2 + 2 + 1 + 1 + 1}Weighted Mean=(46×2)+(67×2)+(72×1)+(58×1)+(53×1)2+2+1+1+1
Weighted Mean = ( 92 + 134 + 72 + 58 + 53 ) 7 Weighted Mean = ( 92 + 134 + 72 + 58 + 53 ) 7 “Weighted Mean”=((92+134+72+58+53))/(7)\text{Weighted Mean} = \frac{(92 + 134 + 72 + 58 + 53)}{7}Weighted Mean=(92+134+72+58+53)7
Weighted Mean = 409 7 Weighted Mean = 409 7 “Weighted Mean”=(409)/(7)\text{Weighted Mean} = \frac{409}{7}Weighted Mean=4097
Weighted Mean = 58.4286 % Weighted Mean = 58.4286 % “Weighted Mean”=58.4286%\text{Weighted Mean} = 58.4286\%Weighted Mean=58.4286%

Conclusion:

  • Simple Mean: 59.2 % 59.2 % 59.2%59.2\%59.2%
  • Weighted Mean: 58.4286 % 58.4286 % 58.4286%58.4286\%58.4286%

(b) For two Firms A and B, the following details are available :
A B
No. of employees 100 200
Average salary 16000 18000
SD of salary 16 18
A B No. of employees 100 200 Average salary 16000 18000 SD of salary 16 18| | A | B | | :— | :—: | :—: | | No. of employees | 100 | 200 | | Average salary | 16000 | 18000 | | SD of salary | 16 | 18 |
Compute the following :
(i) Which Firm pays larger package of salary?
(ii) Which Firm shows greater variability in the distribution of salary?
(iii) Compute the combined average salary and combined variance of both firms.

Answer:

Let’s address each of the questions step by step:

Given Data:

  • Firm A:
    • Number of employees: n A = 100 n A = 100 n_(A)=100n_A = 100nA=100
    • Average salary: μ A = 16000 μ A = 16000 mu _(A)=16000\mu_A = 16000μA=16000
    • Standard deviation of salary: σ A = 16 σ A = 16 sigma _(A)=16\sigma_A = 16σA=16
  • Firm B:
    • Number of employees: n B = 200 n B = 200 n_(B)=200n_B = 200nB=200
    • Average salary: μ B = 18000 μ B = 18000 mu _(B)=18000\mu_B = 18000μB=18000
    • Standard deviation of salary: σ B = 18 σ B = 18 sigma _(B)=18\sigma_B = 18σB=18

(i) Which Firm Pays a Larger Package of Salary?

The larger package of salary refers to the average salary. Comparing the average salaries of both firms:
  • Firm A: 16000
  • Firm B: 18000
Answer: Firm B pays a larger package of salary because the average salary at Firm B (18000) is higher than that at Firm A (16000).

(ii) Which Firm Shows Greater Variability in the Distribution of Salary?

The variability in the distribution of salary can be compared using the standard deviation.
  • Firm A: 16
  • Firm B: 18
Answer: Firm B shows greater variability in the distribution of salary because the standard deviation of salaries at Firm B (18) is higher than that at Firm A (16).

(iii) Compute the Combined Average Salary and Combined Variance of Both Firms

Combined Average Salary

The combined average salary ( μ μ mu\muμ) for both firms can be calculated using a weighted average:
μ = n A μ A + n B μ B n A + n B μ = n A μ A + n B μ B n A + n B mu=(n_(A)mu _(A)+n_(B)mu _(B))/(n_(A)+n_(B))\mu = \frac{n_A \mu_A + n_B \mu_B}{n_A + n_B}μ=nAμA+nBμBnA+nB
Substituting the given values:
μ = 100 × 16000 + 200 × 18000 100 + 200 μ = 100 × 16000 + 200 × 18000 100 + 200 mu=(100 xx16000+200 xx18000)/(100+200)\mu = \frac{100 \times 16000 + 200 \times 18000}{100 + 200}μ=100×16000+200×18000100+200
μ = 1600000 + 3600000 300 μ = 1600000 + 3600000 300 mu=(1600000+3600000)/(300)\mu = \frac{1600000 + 3600000}{300}μ=1600000+3600000300
μ = 5200000 300 μ = 5200000 300 mu=(5200000)/(300)\mu = \frac{5200000}{300}μ=5200000300
μ = 17333. 3 ¯ μ = 17333. 3 ¯ mu=17333. bar(3)\mu = 17333.\overline{3}μ=17333.3¯
So, the combined average salary is approximately 17333.33 17333.33 17333.3317333.3317333.33.

Combined Variance

To compute the combined variance ( σ 2 σ 2 sigma^(2)\sigma^2σ2), we need to use the following formula:
σ 2 = 1 N [ i = 1 k n i ( σ i 2 + μ i 2 ) ( i = 1 k n i μ i ) 2 / N ] σ 2 = 1 N i = 1 k n i ( σ i 2 + μ i 2 ) i = 1 k n i μ i 2 / N sigma^(2)=(1)/(N)[sum_(i=1)^(k)n_(i)(sigma_(i)^(2)+mu_(i)^(2))-(sum_(i=1)^(k)n_(i)mu _(i))^(2)//N]\sigma^2 = \frac{1}{N} \left[ \sum_{i=1}^{k} n_i (\sigma_i^2 + \mu_i^2) – \left( \sum_{i=1}^{k} n_i \mu_i \right)^2 / N \right]σ2=1N[i=1kni(σi2+μi2)(i=1kniμi)2/N]
Where:
  • N = n A + n B N = n A + n B N=n_(A)+n_(B)N = n_A + n_BN=nA+nB
  • σ i 2 σ i 2 sigma_(i)^(2)\sigma_i^2σi2 is the variance of each firm (the square of the standard deviation).
First, let’s calculate the individual components:
  • N = 100 + 200 = 300 N = 100 + 200 = 300 N=100+200=300N = 100 + 200 = 300N=100+200=300
  • σ A 2 = 16 2 = 256 σ A 2 = 16 2 = 256 sigma_(A)^(2)=16^(2)=256\sigma_A^2 = 16^2 = 256σA2=162=256
  • σ B 2 = 18 2 = 324 σ B 2 = 18 2 = 324 sigma_(B)^(2)=18^(2)=324\sigma_B^2 = 18^2 = 324σB2=182=324
Now, calculate the terms inside the formula:
i = 1 k n i ( σ i 2 + μ i 2 ) = 100 × ( 256 + 16000 2 ) + 200 × ( 324 + 18000 2 ) i = 1 k n i ( σ i 2 + μ i 2 ) = 100 × ( 256 + 16000 2 ) + 200 × ( 324 + 18000 2 ) sum_(i=1)^(k)n_(i)(sigma_(i)^(2)+mu_(i)^(2))=100 xx(256+16000^(2))+200 xx(324+18000^(2))\sum_{i=1}^{k} n_i (\sigma_i^2 + \mu_i^2) = 100 \times (256 + 16000^2) + 200 \times (324 + 18000^2)i=1kni(σi2+μi2)=100×(256+160002)+200×(324+180002)
= 100 × ( 256 + 256000000 ) + 200 × ( 324 + 324000000 ) = 100 × ( 256 + 256000000 ) + 200 × ( 324 + 324000000 ) =100 xx(256+256000000)+200 xx(324+324000000)= 100 \times (256 + 256000000) + 200 \times (324 + 324000000)=100×(256+256000000)+200×(324+324000000)
= 100 × 256000256 + 200 × 324000324 = 100 × 256000256 + 200 × 324000324 =100 xx256000256+200 xx324000324= 100 \times 256000256 + 200 \times 324000324=100×256000256+200×324000324
= 25600025600 + 64800064800 = 25600025600 + 64800064800 =25600025600+64800064800= 25600025600 + 64800064800=25600025600+64800064800
= 90400090400 = 90400090400 =90400090400= 90400090400=90400090400
Now, calculate the term ( i = 1 k n i μ i ) 2 i = 1 k n i μ i 2 (sum_(i=1)^(k)n_(i)mu _(i))^(2)\left( \sum_{i=1}^{k} n_i \mu_i \right)^2(i=1kniμi)2:
( 100 × 16000 + 200 × 18000 ) 2 100 × 16000 + 200 × 18000 2 (100 xx16000+200 xx18000)^(2)\left( 100 \times 16000 + 200 \times 18000 \right)^2(100×16000+200×18000)2
= ( 1600000 + 3600000 ) 2 = ( 1600000 + 3600000 ) 2 =(1600000+3600000)^(2)= (1600000 + 3600000)^2=(1600000+3600000)2
= 5200000 2 = 5200000 2 =5200000^(2)= 5200000^2=52000002
= 27040000000000 = 27040000000000 =27040000000000= 27040000000000=27040000000000
Now, put these into the formula:
σ 2 = 1 300 ( 90400090400 27040000000000 300 ) σ 2 = 1 300 90400090400 27040000000000 300 sigma^(2)=(1)/(300)(90400090400-(27040000000000)/(300))\sigma^2 = \frac{1}{300} \left( 90400090400 – \frac{27040000000000}{300} \right)σ2=1300(9040009040027040000000000300)
σ 2 = 1 300 ( 90400090400 90133333333.33 ) σ 2 = 1 300 90400090400 90133333333.33 sigma^(2)=(1)/(300)(90400090400-90133333333.33)\sigma^2 = \frac{1}{300} \left( 90400090400 – 90133333333.33 \right)σ2=1300(9040009040090133333333.33)
σ 2 = 1 300 × 266757066.67 σ 2 = 1 300 × 266757066.67 sigma^(2)=(1)/(300)xx266757066.67\sigma^2 = \frac{1}{300} \times 266757066.67σ2=1300×266757066.67
σ 2 = 889190.22 σ 2 = 889190.22 sigma^(2)=889190.22\sigma^2 = 889190.22σ2=889190.22
So, the combined variance is 889190.22 889190.22 889190.22889190.22889190.22.

Conclusion:

  1. Larger package of salary: Firm B
  2. Greater variability in the distribution of salary: Firm B
  3. Combined average salary: 17333.33 17333.33 17333.3317333.3317333.33
  4. Combined variance: 889190.22 889190.22 889190.22889190.22889190.22

Question:-03

3.(a) Define coefficient of determination and correlation ratio.

Answer:

Coefficient of Determination ( R 2 R 2 R^(2)R^2R2)

The coefficient of determination, denoted as R 2 R 2 R^(2)R^2R2, is a statistical measure used in the context of regression analysis to assess the goodness of fit of a model. It provides the proportion of the variance in the dependent variable that is predictable from the independent variable(s).

Definition:

R 2 = 1 SS res SS tot R 2 = 1 SS res SS tot R^(2)=1-(“SS”_(“res”))/(“SS”_(“tot”))R^2 = 1 – \frac{\text{SS}_{\text{res}}}{\text{SS}_{\text{tot}}}R2=1SSresSStot
Where:
  • SS res SS res “SS”_(“res”)\text{SS}_{\text{res}}SSres (Residual Sum of Squares): The sum of the squares of the residuals (the differences between the observed and predicted values).
  • SS tot SS tot “SS”_(“tot”)\text{SS}_{\text{tot}}SStot (Total Sum of Squares): The sum of the squares of the differences between the observed values and the mean of the observed values.

Interpretation:

  • R 2 R 2 R^(2)R^2R2 ranges from 0 to 1.
    • R 2 = 0 R 2 = 0 R^(2)=0R^2 = 0R2=0: The independent variable does not explain any of the variation in the dependent variable.
    • R 2 = 1 R 2 = 1 R^(2)=1R^2 = 1R2=1: The independent variable perfectly explains all the variation in the dependent variable.
    • An R 2 R 2 R^(2)R^2R2 value close to 1 indicates a strong relationship, whereas a value close to 0 indicates a weak relationship.

Example:

If R 2 = 0.85 R 2 = 0.85 R^(2)=0.85R^2 = 0.85R2=0.85, it means that 85% of the variance in the dependent variable is explained by the independent variable(s).

Correlation Ratio (η or η 2 η 2 eta^(2)\eta^2η2)

The correlation ratio, denoted by the Greek letter eta (η), or η 2 η 2 eta^(2)\eta^2η2 when referring to the squared correlation ratio, is a measure of the strength of the relationship between a continuous dependent variable and a categorical independent variable. It is used in analysis of variance (ANOVA) and is particularly useful when the relationship is not strictly linear.

Definition:

η 2 = SS between SS total η 2 = SS between SS total eta^(2)=(“SS”_(“between”))/(“SS”_(“total”))\eta^2 = \frac{\text{SS}_{\text{between}}}{\text{SS}_{\text{total}}}η2=SSbetweenSStotal
Where:
  • SS between SS between “SS”_(“between”)\text{SS}_{\text{between}}SSbetween (Between-Group Sum of Squares): The sum of the squared deviations of the group means from the overall mean, weighted by the number of observations in each group.
  • SS total SS total “SS”_(“total”)\text{SS}_{\text{total}}SStotal (Total Sum of Squares): The sum of the squared deviations of each observation from the overall mean.

Interpretation:

  • η 2 η 2 eta^(2)\eta^2η2 ranges from 0 to 1.
    • η 2 = 0 η 2 = 0 eta^(2)=0\eta^2 = 0η2=0: There is no relationship between the dependent variable and the categorical independent variable.
    • η 2 = 1 η 2 = 1 eta^(2)=1\eta^2 = 1η2=1: There is a perfect relationship between the dependent variable and the categorical independent variable.
    • Higher values of η 2 η 2 eta^(2)\eta^2η2 indicate a stronger relationship.

Example:

If η 2 = 0.65 η 2 = 0.65 eta^(2)=0.65\eta^2 = 0.65η2=0.65, it means that 65% of the variance in the dependent variable can be attributed to the differences between groups defined by the categorical independent variable.

Summary:

  • Coefficient of Determination ( R 2 R 2 R^(2)R^2R2): Measures the proportion of variance in the dependent variable explained by the independent variable(s) in a regression model. It ranges from 0 to 1.
  • Correlation Ratio (η or η 2 η 2 eta^(2)\eta^2η2): Measures the strength of the relationship between a continuous dependent variable and a categorical independent variable, particularly useful for non-linear relationships. It also ranges from 0 to 1.

(b) Calculate the correlation coefficient from the following data :
X X X\mathbf{X}X Y Y Y\mathbf{Y}Y
12 14
9 8
8 6
10 9
11 11
13 12
X Y 12 14 9 8 8 6 10 9 11 11 13 12| $\mathbf{X}$ | $\mathbf{Y}$ | | :—: | :—: | | 12 | 14 | | 9 | 8 | | 8 | 6 | | 10 | 9 | | 11 | 11 | | 13 | 12 |
Let now each value of X X X\mathrm{X}X be multiplied by 2 and then 6 be added to it.
Similarly, multiply each value of Y Y Y\mathrm{Y}Y by 3 and subtract 2 from it. What will be the correlation coefficient between the new series of X X X\mathrm{X}X and Y Y Y\mathrm{Y}Y ?

Answer:

To calculate the correlation coefficient ( r r rrr) for the given data and the transformed data, we will follow these steps:

Step 1: Calculate the Correlation Coefficient for the Original Data

Given data:
X Y 12 14 9 8 8 6 10 9 11 11 13 12 X Y 12 14 9 8 8 6 10 9 11 11 13 12 {:[X,Y],[12,14],[9,8],[8,6],[10,9],[11,11],[13,12]:}\begin{array}{|c|c|} \hline \mathbf{X} & \mathbf{Y} \\ \hline 12 & 14 \\ 9 & 8 \\ 8 & 6 \\ 10 & 9 \\ 11 & 11 \\ 13 & 12 \\ \hline \end{array}XY1214988610911111312
  1. Calculate the means of X X XXX and Y Y YYY:
    X ¯ = 12 + 9 + 8 + 10 + 11 + 13 6 = 63 6 = 10.5 X ¯ = 12 + 9 + 8 + 10 + 11 + 13 6 = 63 6 = 10.5 bar(X)=(12+9+8+10+11+13)/(6)=(63)/(6)=10.5\bar{X} = \frac{12 + 9 + 8 + 10 + 11 + 13}{6} = \frac{63}{6} = 10.5X¯=12+9+8+10+11+136=636=10.5
    Y ¯ = 14 + 8 + 6 + 9 + 11 + 12 6 = 60 6 = 10 Y ¯ = 14 + 8 + 6 + 9 + 11 + 12 6 = 60 6 = 10 bar(Y)=(14+8+6+9+11+12)/(6)=(60)/(6)=10\bar{Y} = \frac{14 + 8 + 6 + 9 + 11 + 12}{6} = \frac{60}{6} = 10Y¯=14+8+6+9+11+126=606=10
  2. Calculate the deviations from the mean:
    ( X i X ¯ ) , ( Y i Y ¯ ) ( X i X ¯ ) , ( Y i Y ¯ ) (X_(i)- bar(X)),(Y_(i)- bar(Y))(X_i – \bar{X}), (Y_i – \bar{Y})(XiX¯),(YiY¯)
    X i Y i ( X i X ¯ ) ( Y i Y ¯ ) 12 14 ( 12 10.5 ) ( 14 10 ) = 1.5 4 = 6 9 8 ( 9 10.5 ) ( 8 10 ) = 1.5 2 = 3 8 6 ( 8 10.5 ) ( 6 10 ) = 2.5 4 = 10 10 9 ( 10 10.5 ) ( 9 10 ) = 0.5 1 = 0.5 11 11 ( 11 10.5 ) ( 11 10 ) = 0.5 1 = 0.5 13 12 ( 13 10.5 ) ( 12 10 ) = 2.5 2 = 5 X i Y i ( X i X ¯ ) ( Y i Y ¯ ) 12 14 ( 12 10.5 ) ( 14 10 ) = 1.5 4 = 6 9 8 ( 9 10.5 ) ( 8 10 ) = 1.5 2 = 3 8 6 ( 8 10.5 ) ( 6 10 ) = 2.5 4 = 10 10 9 ( 10 10.5 ) ( 9 10 ) = 0.5 1 = 0.5 11 11 ( 11 10.5 ) ( 11 10 ) = 0.5 1 = 0.5 13 12 ( 13 10.5 ) ( 12 10 ) = 2.5 2 = 5 {:[X_(i),Y_(i),(X_(i)- bar(X))(Y_(i)- bar(Y))],[12,14,(12-10.5)(14-10)=1.5*4=6],[9,8,(9-10.5)(8-10)=-1.5*-2=3],[8,6,(8-10.5)(6-10)=-2.5*-4=10],[10,9,(10-10.5)(9-10)=-0.5*-1=0.5],[11,11,(11-10.5)(11-10)=0.5*1=0.5],[13,12,(13-10.5)(12-10)=2.5*2=5]:}\begin{array}{ccc} X_i & Y_i & (X_i – \bar{X})(Y_i – \bar{Y}) \\ \hline 12 & 14 & (12 – 10.5)(14 – 10) = 1.5 \cdot 4 = 6 \\ 9 & 8 & (9 – 10.5)(8 – 10) = -1.5 \cdot -2 = 3 \\ 8 & 6 & (8 – 10.5)(6 – 10) = -2.5 \cdot -4 = 10 \\ 10 & 9 & (10 – 10.5)(9 – 10) = -0.5 \cdot -1 = 0.5 \\ 11 & 11 & (11 – 10.5)(11 – 10) = 0.5 \cdot 1 = 0.5 \\ 13 & 12 & (13 – 10.5)(12 – 10) = 2.5 \cdot 2 = 5 \\ \end{array}XiYi(XiX¯)(YiY¯)1214(1210.5)(1410)=1.54=698(910.5)(810)=1.52=386(810.5)(610)=2.54=10109(1010.5)(910)=0.51=0.51111(1110.5)(1110)=0.51=0.51312(1310.5)(1210)=2.52=5
  3. Sum of the products of deviations:
    ( X i X ¯ ) ( Y i Y ¯ ) = 6 + 3 + 10 + 0.5 + 0.5 + 5 = 25 ( X i X ¯ ) ( Y i Y ¯ ) = 6 + 3 + 10 + 0.5 + 0.5 + 5 = 25 sum(X_(i)- bar(X))(Y_(i)- bar(Y))=6+3+10+0.5+0.5+5=25\sum (X_i – \bar{X})(Y_i – \bar{Y}) = 6 + 3 + 10 + 0.5 + 0.5 + 5 = 25(XiX¯)(YiY¯)=6+3+10+0.5+0.5+5=25
  4. Calculate the standard deviations of X X XXX and Y Y YYY:
    σ X = ( X i X ¯ ) 2 n = 1.5 2 + ( 1.5 ) 2 + ( 2.5 ) 2 + ( 0.5 ) 2 + 0.5 2 + 2.5 2 6 σ X = ( X i X ¯ ) 2 n = 1.5 2 + ( 1.5 ) 2 + ( 2.5 ) 2 + ( 0.5 ) 2 + 0.5 2 + 2.5 2 6 sigma _(X)=sqrt((sum(X_(i)-( bar(X)))^(2))/(n))=sqrt((1.5^(2)+(-1.5)^(2)+(-2.5)^(2)+(-0.5)^(2)+0.5^(2)+2.5^(2))/(6))\sigma_X = \sqrt{\frac{\sum (X_i – \bar{X})^2}{n}} = \sqrt{\frac{1.5^2 + (-1.5)^2 + (-2.5)^2 + (-0.5)^2 + 0.5^2 + 2.5^2}{6}}σX=(XiX¯)2n=1.52+(1.5)2+(2.5)2+(0.5)2+0.52+2.526
    σ X = 2.25 + 2.25 + 6.25 + 0.25 + 0.25 + 6.25 6 = 17.5 6 = 2.9167 1.71 σ X = 2.25 + 2.25 + 6.25 + 0.25 + 0.25 + 6.25 6 = 17.5 6 = 2.9167 1.71 sigma _(X)=sqrt((2.25+2.25+6.25+0.25+0.25+6.25)/(6))=sqrt((17.5)/(6))=sqrt2.9167~~1.71\sigma_X = \sqrt{\frac{2.25 + 2.25 + 6.25 + 0.25 + 0.25 + 6.25}{6}} = \sqrt{\frac{17.5}{6}} = \sqrt{2.9167} \approx 1.71σX=2.25+2.25+6.25+0.25+0.25+6.256=17.56=2.91671.71
    σ Y = ( Y i Y ¯ ) 2 n = 4 2 + ( 2 ) 2 + ( 4 ) 2 + ( 1 ) 2 + 1 2 + 2 2 6 σ Y = ( Y i Y ¯ ) 2 n = 4 2 + ( 2 ) 2 + ( 4 ) 2 + ( 1 ) 2 + 1 2 + 2 2 6 sigma _(Y)=sqrt((sum(Y_(i)-( bar(Y)))^(2))/(n))=sqrt((4^(2)+(-2)^(2)+(-4)^(2)+(-1)^(2)+1^(2)+2^(2))/(6))\sigma_Y = \sqrt{\frac{\sum (Y_i – \bar{Y})^2}{n}} = \sqrt{\frac{4^2 + (-2)^2 + (-4)^2 + (-1)^2 + 1^2 + 2^2}{6}}σY=(YiY¯)2n=42+(2)2+(4)2+(1)2+12+226
    σ Y = 16 + 4 + 16 + 1 + 1 + 4 6 = 42 6 = 7 2.65 σ Y = 16 + 4 + 16 + 1 + 1 + 4 6 = 42 6 = 7 2.65 sigma _(Y)=sqrt((16+4+16+1+1+4)/(6))=sqrt((42)/(6))=sqrt7~~2.65\sigma_Y = \sqrt{\frac{16 + 4 + 16 + 1 + 1 + 4}{6}} = \sqrt{\frac{42}{6}} = \sqrt{7} \approx 2.65σY=16+4+16+1+1+46=426=72.65
  5. Calculate the correlation coefficient:
    r = ( X i X ¯ ) ( Y i Y ¯ ) n σ X σ Y = 25 6 1.71 2.65 25 27.18 0.92 r = ( X i X ¯ ) ( Y i Y ¯ ) n σ X σ Y = 25 6 1.71 2.65 25 27.18 0.92 r=(sum(X_(i)-( bar(X)))(Y_(i)-( bar(Y))))/(n*sigma _(X)*sigma _(Y))=(25)/(6*1.71*2.65)~~(25)/(27.18)~~0.92r = \frac{\sum (X_i – \bar{X})(Y_i – \bar{Y})}{n \cdot \sigma_X \cdot \sigma_Y} = \frac{25}{6 \cdot 1.71 \cdot 2.65} \approx \frac{25}{27.18} \approx 0.92r=(XiX¯)(YiY¯)nσXσY=2561.712.652527.180.92

Step 2: Calculate the Correlation Coefficient for the Transformed Data

The transformations are:
  • X = 2 X + 6 X = 2 X + 6 X^(‘)=2X+6X’ = 2X + 6X=2X+6
  • Y = 3 Y 2 Y = 3 Y 2 Y^(‘)=3Y-2Y’ = 3Y – 2Y=3Y2

Transformation Impact:

  • The transformations are linear transformations of the original variables X X XXX and Y Y YYY.
  • Linear transformations do not affect the correlation coefficient between the two variables. The correlation coefficient remains unchanged.

Conclusion:

  • The correlation coefficient for the original data is approximately 0.92 0.92 0.920.920.92.
  • The correlation coefficient for the transformed data will be the same as for the original data, which is approximately 0.92 0.92 0.920.920.92.

Question:-04

4.(a) Differentiate between correlation and regression.

Answer:

Correlation and regression are both statistical tools used to examine relationships between variables, but they serve different purposes and provide different information. Here’s a detailed differentiation between the two:

Correlation:

  1. Purpose:
    • Correlation measures the strength and direction of the linear relationship between two variables.
  2. Nature of Analysis:
    • It quantifies the degree to which two variables are related, but does not imply causation.
  3. Output:
    • The result is a correlation coefficient, typically denoted by r r rrr.
    • r r rrr ranges from -1 to 1:
      • r = 1 r = 1 r=1r = 1r=1 indicates a perfect positive linear relationship.
      • r = 1 r = 1 r=-1r = -1r=1 indicates a perfect negative linear relationship.
      • r = 0 r = 0 r=0r = 0r=0 indicates no linear relationship.
  4. Symmetry:
    • Correlation is symmetric: corr ( X , Y ) = corr ( Y , X ) corr ( X , Y ) = corr ( Y , X ) “corr”(X,Y)=”corr”(Y,X)\text{corr}(X, Y) = \text{corr}(Y, X)corr(X,Y)=corr(Y,X).
  5. Units:
    • Correlation is a unitless measure, meaning it does not depend on the scale of the variables.
  6. Types:
    • Pearson correlation (measures linear relationship).
    • Spearman correlation (measures monotonic relationship, suitable for non-parametric data).
  7. Interpretation:
    • Correlation only indicates the degree of association, not the exact nature or causality.

Regression:

  1. Purpose:
    • Regression assesses the relationship between a dependent variable and one or more independent variables, and models how the dependent variable changes when the independent variable(s) are varied.
  2. Nature of Analysis:
    • It explains the nature of the relationship and provides a predictive model.
    • The focus is on predicting the value of the dependent variable based on the independent variable(s).
  3. Output:
    • The result is an equation that describes the relationship between the variables.
    • In simple linear regression: Y = a + b X Y = a + b X Y=a+bXY = a + bXY=a+bX
      • Y Y YYY is the dependent variable.
      • X X XXX is the independent variable.
      • a a aaa is the intercept.
      • b b bbb is the slope (regression coefficient).
  4. Symmetry:
    • Regression is not symmetric: the regression of Y Y YYY on X X XXX is not the same as the regression of X X XXX on Y Y YYY.
  5. Units:
    • The regression coefficients have units and are interpreted as the change in the dependent variable for a one-unit change in the independent variable.
  6. Types:
    • Simple linear regression (one independent variable).
    • Multiple linear regression (more than one independent variable).
    • Non-linear regression (non-linear relationships).
  7. Interpretation:
    • Regression provides insights into the nature of the relationship, including magnitude and direction of influence.
    • It also helps in making predictions and understanding causality, to an extent.

Summary:

  • Correlation: Measures strength and direction of linear relationship, symmetric, unitless, no causation.
  • Regression: Models relationship, provides predictive equation, not symmetric, coefficients have units, indicates causation to some extent.
Understanding these distinctions helps in choosing the appropriate method for analyzing data and interpreting the results accurately.

(b) In order to find the correlation between two variables X X X\mathrm{X}X and Y Y Y\mathrm{Y}Y from 12 pairs of observations, the following calculations were obtained :
Σ X = 30 , Σ X 2 = 670 , Σ Y = 5 , Σ Y 2 = 285 , Σ X Y = 344 Σ X = 30 , Σ X 2 = 670 , Σ Y = 5 , Σ Y 2 = 285 , Σ X Y = 344 {:[SigmaX=30″,”quad SigmaX^(2)=670″,”quad SigmaY=5″,”quad SigmaY^(2)=285″,”],[SigmaXY=344]:}\begin{aligned} & \Sigma \mathrm{X}=30, \quad \Sigma \mathrm{X}^2=670, \quad \Sigma \mathrm{Y}=5, \quad \Sigma \mathrm{Y}^2=285, \\ & \Sigma \mathrm{XY}=344 \end{aligned}ΣX=30,ΣX2=670,ΣY=5,ΣY2=285,ΣXY=344
On subsequent verification, it was discovered that the pair ( X = 11 , Y = 4 ) ( X = 11 , Y = 4 ) (X=11,Y=4)(\mathrm{X}=11, \mathrm{Y}=4)(X=11,Y=4) was copied wrongly, the correct values being ( X = 10 , Y = 14 ) ( X = 10 , Y = 14 ) (X=10,Y=14)(\mathrm{X}=10, \mathrm{Y}=14)(X=10,Y=14). After making necessary correction, find :
(i) regression coefficients,
(ii) two regression equations, and
(iii) correlation coefficient.

Answer:

To address the given problem, we need to first correct the calculations with the correct pair of observations ( X = 10 , Y = 14 ) ( X = 10 , Y = 14 ) (X=10,Y=14)(X=10, Y=14)(X=10,Y=14) instead of ( X = 11 , Y = 4 ) ( X = 11 , Y = 4 ) (X=11,Y=4)(X=11, Y=4)(X=11,Y=4). Then, we will proceed to find the regression coefficients, the regression equations, and the correlation coefficient.

Step 1: Correct the Calculations

Given:
  • Original Σ X = 30 Σ X = 30 Sigma X=30\Sigma X = 30ΣX=30
  • Original Σ Y = 5 Σ Y = 5 Sigma Y=5\Sigma Y = 5ΣY=5
  • Original Σ X 2 = 670 Σ X 2 = 670 SigmaX^(2)=670\Sigma X^2 = 670ΣX2=670
  • Original Σ Y 2 = 285 Σ Y 2 = 285 SigmaY^(2)=285\Sigma Y^2 = 285ΣY2=285
  • Original Σ X Y = 344 Σ X Y = 344 Sigma XY=344\Sigma XY = 344ΣXY=344
Incorrect pair: ( X = 11 , Y = 4 ) ( X = 11 , Y = 4 ) (X=11,Y=4)(X=11, Y=4)(X=11,Y=4)
Correct pair: ( X = 10 , Y = 14 ) ( X = 10 , Y = 14 ) (X=10,Y=14)(X=10, Y=14)(X=10,Y=14)
First, remove the contributions of the incorrect pair and then add the contributions of the correct pair.

Corrected Sums:

  • Corrected Σ X Σ X Sigma X\Sigma XΣX:
    Σ X new = Σ X 11 + 10 = 30 11 + 10 = 29 Σ X new = Σ X 11 + 10 = 30 11 + 10 = 29 SigmaX_(“new”)=Sigma X-11+10=30-11+10=29\Sigma X_{\text{new}} = \Sigma X – 11 + 10 = 30 – 11 + 10 = 29ΣXnew=ΣX11+10=3011+10=29
  • Corrected Σ Y Σ Y Sigma Y\Sigma YΣY:
    Σ Y new = Σ Y 4 + 14 = 5 4 + 14 = 15 Σ Y new = Σ Y 4 + 14 = 5 4 + 14 = 15 SigmaY_(“new”)=Sigma Y-4+14=5-4+14=15\Sigma Y_{\text{new}} = \Sigma Y – 4 + 14 = 5 – 4 + 14 = 15ΣYnew=ΣY4+14=54+14=15
  • Corrected Σ X 2 Σ X 2 SigmaX^(2)\Sigma X^2ΣX2:
    Σ X new 2 = Σ X 2 11 2 + 10 2 = 670 121 + 100 = 649 Σ X new 2 = Σ X 2 11 2 + 10 2 = 670 121 + 100 = 649 SigmaX_(“new”)^(2)=SigmaX^(2)-11^(2)+10^(2)=670-121+100=649\Sigma X^2_{\text{new}} = \Sigma X^2 – 11^2 + 10^2 = 670 – 121 + 100 = 649ΣXnew2=ΣX2112+102=670121+100=649
  • Corrected Σ Y 2 Σ Y 2 SigmaY^(2)\Sigma Y^2ΣY2:
    Σ Y new 2 = Σ Y 2 4 2 + 14 2 = 285 16 + 196 = 465 Σ Y new 2 = Σ Y 2 4 2 + 14 2 = 285 16 + 196 = 465 SigmaY_(“new”)^(2)=SigmaY^(2)-4^(2)+14^(2)=285-16+196=465\Sigma Y^2_{\text{new}} = \Sigma Y^2 – 4^2 + 14^2 = 285 – 16 + 196 = 465ΣYnew2=ΣY242+142=28516+196=465
  • Corrected Σ X Y Σ X Y Sigma XY\Sigma XYΣXY:
    Σ X Y new = Σ X Y 11 4 + 10 14 = 344 44 + 140 = 440 Σ X Y new = Σ X Y 11 4 + 10 14 = 344 44 + 140 = 440 Sigma XY_(“new”)=Sigma XY-11*4+10*14=344-44+140=440\Sigma XY_{\text{new}} = \Sigma XY – 11 \cdot 4 + 10 \cdot 14 = 344 – 44 + 140 = 440ΣXYnew=ΣXY114+1014=34444+140=440

Step 2: Calculate the Means

Number of observations n = 12 n = 12 n=12n = 12n=12.
  • Mean of X X XXX:
    X ¯ = Σ X new n = 29 12 X ¯ = Σ X new n = 29 12 bar(X)=(SigmaX_(“new”))/(n)=(29)/(12)\bar{X} = \frac{\Sigma X_{\text{new}}}{n} = \frac{29}{12}X¯=ΣXnewn=2912
  • Mean of Y Y YYY:
    Y ¯ = Σ Y new n = 15 12 Y ¯ = Σ Y new n = 15 12 bar(Y)=(SigmaY_(“new”))/(n)=(15)/(12)\bar{Y} = \frac{\Sigma Y_{\text{new}}}{n} = \frac{15}{12}Y¯=ΣYnewn=1512

Step 3: Calculate the Regression Coefficients

Regression Coefficient of Y Y YYY on X X XXX ( b Y X b Y X b_(YX)b_{YX}bYX):

b Y X = n Σ X Y new Σ X new Σ Y new n Σ X new 2 ( Σ X new ) 2 b Y X = n Σ X Y new Σ X new Σ Y new n Σ X new 2 ( Σ X new ) 2 b_(YX)=(n Sigma XY_(“new”)-SigmaX_(“new”)SigmaY_(“new”))/(n SigmaX_(“new”)^(2)-(SigmaX_(“new”))^(2))b_{YX} = \frac{n \Sigma XY_{\text{new}} – \Sigma X_{\text{new}} \Sigma Y_{\text{new}}}{n \Sigma X^2_{\text{new}} – (\Sigma X_{\text{new}})^2}bYX=nΣXYnewΣXnewΣYnewnΣXnew2(ΣXnew)2
b Y X = 12 440 29 15 12 649 29 2 b Y X = 12 440 29 15 12 649 29 2 b_(YX)=(12*440-29*15)/(12*649-29^(2))b_{YX} = \frac{12 \cdot 440 – 29 \cdot 15}{12 \cdot 649 – 29^2}bYX=12440291512649292
b Y X = 5280 435 7788 841 b Y X = 5280 435 7788 841 b_(YX)=(5280-435)/(7788-841)b_{YX} = \frac{5280 – 435}{7788 – 841}bYX=52804357788841
b Y X = 4845 6947 0.697 b Y X = 4845 6947 0.697 b_(YX)=(4845)/(6947)~~0.697b_{YX} = \frac{4845}{6947} \approx 0.697bYX=484569470.697

Regression Coefficient of X X XXX on Y Y YYY ( b X Y b X Y b_(XY)b_{XY}bXY):

b X Y = n Σ X Y new Σ X new Σ Y new n Σ Y new 2 ( Σ Y new ) 2 b X Y = n Σ X Y new Σ X new Σ Y new n Σ Y new 2 ( Σ Y new ) 2 b_(XY)=(n Sigma XY_(“new”)-SigmaX_(“new”)SigmaY_(“new”))/(n SigmaY_(“new”)^(2)-(SigmaY_(“new”))^(2))b_{XY} = \frac{n \Sigma XY_{\text{new}} – \Sigma X_{\text{new}} \Sigma Y_{\text{new}}}{n \Sigma Y^2_{\text{new}} – (\Sigma Y_{\text{new}})^2}bXY=nΣXYnewΣXnewΣYnewnΣYnew2(ΣYnew)2
b X Y = 12 440 29 15 12 465 15 2 b X Y = 12 440 29 15 12 465 15 2 b_(XY)=(12*440-29*15)/(12*465-15^(2))b_{XY} = \frac{12 \cdot 440 – 29 \cdot 15}{12 \cdot 465 – 15^2}bXY=12440291512465152
b X Y = 5280 435 5580 225 b X Y = 5280 435 5580 225 b_(XY)=(5280-435)/(5580-225)b_{XY} = \frac{5280 – 435}{5580 – 225}bXY=52804355580225
b X Y = 4845 5355 0.905 b X Y = 4845 5355 0.905 b_(XY)=(4845)/(5355)~~0.905b_{XY} = \frac{4845}{5355} \approx 0.905bXY=484553550.905

Step 4: Calculate the Regression Equations

Regression Equation of Y Y YYY on X X XXX:

Y Y ¯ = b Y X ( X X ¯ ) Y Y ¯ = b Y X ( X X ¯ ) Y- bar(Y)=b_(YX)(X- bar(X))Y – \bar{Y} = b_{YX} (X – \bar{X})YY¯=bYX(XX¯)
Y 15 12 = 0.697 ( X 29 12 ) Y 15 12 = 0.697 X 29 12 Y-(15)/(12)=0.697(X-(29)/(12))Y – \frac{15}{12} = 0.697 \left(X – \frac{29}{12}\right)Y1512=0.697(X2912)
Y = 0.697 X 0.697 29 12 + 15 12 Y = 0.697 X 0.697 29 12 + 15 12 Y=0.697 X-0.697*(29)/(12)+(15)/(12)Y = 0.697X – 0.697 \cdot \frac{29}{12} + \frac{15}{12}Y=0.697X0.6972912+1512
Y = 0.697 X 1.687 + 1.25 Y = 0.697 X 1.687 + 1.25 Y=0.697 X-1.687+1.25Y = 0.697X – 1.687 + 1.25Y=0.697X1.687+1.25
Y = 0.697 X 0.437 Y = 0.697 X 0.437 Y=0.697 X-0.437Y = 0.697X – 0.437Y=0.697X0.437

Regression Equation of X X XXX on Y Y YYY:

X X ¯ = b X Y ( Y Y ¯ ) X X ¯ = b X Y ( Y Y ¯ ) X- bar(X)=b_(XY)(Y- bar(Y))X – \bar{X} = b_{XY} (Y – \bar{Y})XX¯=bXY(YY¯)
X 29 12 = 0.905 ( Y 15 12 ) X 29 12 = 0.905 Y 15 12 X-(29)/(12)=0.905(Y-(15)/(12))X – \frac{29}{12} = 0.905 \left(Y – \frac{15}{12}\right)X2912=0.905(Y1512)
X = 0.905 Y 0.905 15 12 + 29 12 X = 0.905 Y 0.905 15 12 + 29 12 X=0.905 Y-0.905*(15)/(12)+(29)/(12)X = 0.905Y – 0.905 \cdot \frac{15}{12} + \frac{29}{12}X=0.905Y0.9051512+2912
X = 0.905 Y 1.131 + 2.417 X = 0.905 Y 1.131 + 2.417 X=0.905 Y-1.131+2.417X = 0.905Y – 1.131 + 2.417X=0.905Y1.131+2.417
X = 0.905 Y + 1.286 X = 0.905 Y + 1.286 X=0.905 Y+1.286X = 0.905Y + 1.286X=0.905Y+1.286

Step 5: Calculate the Correlation Coefficient

The correlation coefficient r r rrr can be obtained from the regression coefficients b Y X b Y X b_(YX)b_{YX}bYX and b X Y b X Y b_(XY)b_{XY}bXY:
r = b Y X b X Y r = b Y X b X Y r=sqrt(b_(YX)*b_(XY))r = \sqrt{b_{YX} \cdot b_{XY}}r=bYXbXY
r = 0.697 0.905 r = 0.697 0.905 r=sqrt(0.697*0.905)r = \sqrt{0.697 \cdot 0.905}r=0.6970.905
r = 0.630 0.793 r = 0.630 0.793 r=sqrt0.630~~0.793r = \sqrt{0.630} \approx 0.793r=0.6300.793

Summary:

  1. Regression Coefficients:
    • b Y X 0.697 b Y X 0.697 b_(YX)~~0.697b_{YX} \approx 0.697bYX0.697
    • b X Y 0.905 b X Y 0.905 b_(XY)~~0.905b_{XY} \approx 0.905bXY0.905
  2. Regression Equations:
    • Regression equation of Y Y YYY on X X XXX: Y = 0.697 X 0.437 Y = 0.697 X 0.437 Y=0.697 X-0.437Y = 0.697X – 0.437Y=0.697X0.437
    • Regression equation of X X XXX on Y Y YYY: X = 0.905 Y + 1.286 X = 0.905 Y + 1.286 X=0.905 Y+1.286X = 0.905Y + 1.286X=0.905Y+1.286
  3. Correlation Coefficient:
    • r 0.793 r 0.793 r~~0.793r \approx 0.793r0.793

Question:-05

5.(a) In a musical contest, 168 contestants participated. The competition comprised three different stages. It was found that 57 contestants cleared first stage; 45 second stage and 72 third stage. The number of contestants who cleared all the stages, who did not clear any stage, who cleared only first two stages and who cleared only third stage were 17 , 29 , 11 17 , 29 , 11 17,29,1117,29,1117,29,11 and 20 , respectively. With the given information, find how many contestants cleared at least two stages.

Answer:

To solve this problem, we will use the principle of inclusion-exclusion and the given data to find the number of contestants who cleared at least two stages. We denote the following:
  • A A AAA: Contestants who cleared the first stage
  • B B BBB: Contestants who cleared the second stage
  • C C CCC: Contestants who cleared the third stage
We are given:
  • | A | = 57 | A | = 57 |A|=57|A| = 57|A|=57
  • | B | = 45 | B | = 45 |B|=45|B| = 45|B|=45
  • | C | = 72 | C | = 72 |C|=72|C| = 72|C|=72
  • | A B C | = 17 | A B C | = 17 |A nn B nn C|=17|A \cap B \cap C| = 17|ABC|=17 (cleared all stages)
  • | A c B c C c | = 29 | A c B c C c | = 29 |A^(c)nnB^(c)nnC^(c)|=29|A^c \cap B^c \cap C^c| = 29|AcBcCc|=29 (did not clear any stage)
  • | A B C c | = 11 | A B C c | = 11 |A nn B nnC^(c)|=11|A \cap B \cap C^c| = 11|ABCc|=11 (cleared only the first two stages)
  • | A c B c C | = 20 | A c B c C | = 20 |A^(c)nnB^(c)nn C|=20|A^c \cap B^c \cap C| = 20|AcBcC|=20 (cleared only the third stage)

Finding the number of contestants who cleared at least two stages:

  1. Contestants who cleared at least one stage:
    Total number of contestants = 168 = 168 =168= 168=168
    Contestants who did not clear any stage = 29 = 29 =29= 29=29
    Contestants who cleared at least one stage = 168 29 = 139 = 168 29 = 139 =168-29=139= 168 – 29 = 139=16829=139
  2. Using Inclusion-Exclusion Principle:
    We have the formula for the number of contestants who cleared at least one stage:
    | A B C | = | A | + | B | + | C | | A B | | A C | | B C | + | A B C | | A B C | = | A | + | B | + | C | | A B | | A C | | B C | + | A B C | |A uu B uu C|=|A|+|B|+|C|-|A nn B|-|A nn C|-|B nn C|+|A nn B nn C||A \cup B \cup C| = |A| + |B| + |C| – |A \cap B| – |A \cap C| – |B \cap C| + |A \cap B \cap C||ABC|=|A|+|B|+|C||AB||AC||BC|+|ABC|
    We know:
    | A B C | = 139 | A B C | = 139 |A uu B uu C|=139|A \cup B \cup C| = 139|ABC|=139
    Substituting the known values:
    139 = 57 + 45 + 72 | A B | | A C | | B C | + 17 139 = 57 + 45 + 72 | A B | | A C | | B C | + 17 139=57+45+72-|A nn B|-|A nn C|-|B nn C|+17139 = 57 + 45 + 72 – |A \cap B| – |A \cap C| – |B \cap C| + 17139=57+45+72|AB||AC||BC|+17
    Simplifying:
    139 = 191 ( | A B | + | A C | + | B C | ) + 17 139 = 191 ( | A B | + | A C | + | B C | ) + 17 139=191-(|A nn B|+|A nn C|+|B nn C|)+17139 = 191 – (|A \cap B| + |A \cap C| + |B \cap C|) + 17139=191(|AB|+|AC|+|BC|)+17
    139 = 208 ( | A B | + | A C | + | B C | ) 139 = 208 ( | A B | + | A C | + | B C | ) 139=208-(|A nn B|+|A nn C|+|B nn C|)139 = 208 – (|A \cap B| + |A \cap C| + |B \cap C|)139=208(|AB|+|AC|+|BC|)
    | A B | + | A C | + | B C | = 208 139 = 69 | A B | + | A C | + | B C | = 208 139 = 69 |A nn B|+|A nn C|+|B nn C|=208-139=69|A \cap B| + |A \cap C| + |B \cap C| = 208 – 139 = 69|AB|+|AC|+|BC|=208139=69
  3. Finding the individual intersections:
    We know:
    | A B C c | = 11 (cleared only first two stages) | A B C c | = 11 (cleared only first two stages) |A nn B nnC^(c)|=11quad(cleared only first two stages)|A \cap B \cap C^c| = 11 \quad \text{(cleared only first two stages)}|ABCc|=11(cleared only first two stages)
    So:
    | A B | = | A B C | + | A B C c | = 17 + 11 = 28 | A B | = | A B C | + | A B C c | = 17 + 11 = 28 |A nn B|=|A nn B nn C|+|A nn B nnC^(c)|=17+11=28|A \cap B| = |A \cap B \cap C| + |A \cap B \cap C^c| = 17 + 11 = 28|AB|=|ABC|+|ABCc|=17+11=28
    We also know:
    | A c B c C | = 20 (cleared only third stage) | A c B c C | = 20 (cleared only third stage) |A^(c)nnB^(c)nn C|=20quad(cleared only third stage)|A^c \cap B^c \cap C| = 20 \quad \text{(cleared only third stage)}|AcBcC|=20(cleared only third stage)
  4. Using the total of intersections:
    | A B | + | A C | + | B C | = 69 | A B | + | A C | + | B C | = 69 |A nn B|+|A nn C|+|B nn C|=69|A \cap B| + |A \cap C| + |B \cap C| = 69|AB|+|AC|+|BC|=69
    We have:
    | A B | = 28 | A B | = 28 |A nn B|=28|A \cap B| = 28|AB|=28
    | A C | + | B C | = 69 28 = 41 | A C | + | B C | = 69 28 = 41 |A nn C|+|B nn C|=69-28=41|A \cap C| + |B \cap C| = 69 – 28 = 41|AC|+|BC|=6928=41
  5. Contestants who cleared only the third stage:
    | A c B c C | = 20 | A c B c C | = 20 |A^(c)nnB^(c)nn C|=20|A^c \cap B^c \cap C| = 20|AcBcC|=20
  6. Contestants who cleared at least two stages:
    Let’s break this into:
    • | A B | | A B | |A nn B||A \cap B||AB| includes those who cleared both first and second stages
    • | A C | | A C | |A nn C||A \cap C||AC| includes those who cleared both first and third stages
    • | B C | | B C | |B nn C||B \cap C||BC| includes those who cleared both second and third stages
    • | A B C | | A B C | |A nn B nn C||A \cap B \cap C||ABC| is already counted thrice in the above summation
    Using the intersection counts:
    | A B C | = 17 | A B C | = 17 |A nn B nn C|=17|A \cap B \cap C| = 17|ABC|=17
    Therefore, the contestants who cleared at least two stages are:
    | A B | + | A C | + | B C | 2 × | A B C | | A B | + | A C | + | B C | 2 × | A B C | |A nn B|+|A nn C|+|B nn C|-2xx|A nn B nn C||A \cap B| + |A \cap C| + |B \cap C| – 2 \times |A \cap B \cap C||AB|+|AC|+|BC|2×|ABC|
    We know:
    | A B | = 28 , | A C | = x , | B C | = y , x + y = 41 | A B | = 28 , | A C | = x , | B C | = y , x + y = 41 |A nn B|=28,quad|A nn C|=x,quad|B nn C|=y,quad x+y=41|A \cap B| = 28, \quad |A \cap C| = x, \quad |B \cap C| = y, \quad x + y = 41|AB|=28,|AC|=x,|BC|=y,x+y=41
    And including all stages:
    Total contestants cleared at least two stages = 28 + x + y 2 × 17 Total contestants cleared at least two stages = 28 + x + y 2 × 17 “Total contestants cleared at least two stages”=28+x+y-2xx17\text{Total contestants cleared at least two stages} = 28 + x + y – 2 \times 17Total contestants cleared at least two stages=28+x+y2×17
    Since x + y = 41 x + y = 41 x+y=41x + y = 41x+y=41:
    Total contestants cleared at least two stages = 28 + 41 34 = 35 Total contestants cleared at least two stages = 28 + 41 34 = 35 “Total contestants cleared at least two stages”=28+41-34=35\text{Total contestants cleared at least two stages} = 28 + 41 – 34 = 35Total contestants cleared at least two stages=28+4134=35
    Therefore, the number of contestants who cleared at least two stages is 35 35 353535.

(b) For a distribution, Bowley’s coefficient of Skewness is 0.56 , Q 1 = 16.4 0.56 , Q 1 = 16.4 -0.56,Q_(1)=16.4-0.56, \mathrm{Q}_1=16.40.56,Q1=16.4 and median = 24.2 = 24.2 =24.2=24.2=24.2. What is its coefficient of quartile deviation.

Answer:

Bowley’s coefficient of skewness ( S k S k S_(k)S_kSk) is given by the formula:
S k = Q 1 + Q 3 2 Median Q 3 Q 1 S k = Q 1 + Q 3 2 Median Q 3 Q 1 S_(k)=(Q_(1)+Q_(3)-2″Median”)/(Q_(3)-Q_(1))S_k = \frac{Q_1 + Q_3 – 2 \text{Median}}{Q_3 – Q_1}Sk=Q1+Q32MedianQ3Q1
Given:
  • Bowley’s coefficient of skewness S k = 0.56 S k = 0.56 S_(k)=-0.56S_k = -0.56Sk=0.56
  • First quartile ( Q 1 Q 1 Q_(1)Q_1Q1) = 16.4
  • Median = 24.2
We need to find the third quartile ( Q 3 Q 3 Q_(3)Q_3Q3) and the coefficient of quartile deviation.

Step 1: Finding the Third Quartile ( Q 3 Q 3 Q_(3)Q_3Q3)

Rearrange the formula for Bowley’s coefficient of skewness to solve for Q 3 Q 3 Q_(3)Q_3Q3:
0.56 = 16.4 + Q 3 2 ( 24.2 ) Q 3 16.4 0.56 = 16.4 + Q 3 2 ( 24.2 ) Q 3 16.4 -0.56=(16.4+Q_(3)-2(24.2))/(Q_(3)-16.4)-0.56 = \frac{16.4 + Q_3 – 2(24.2)}{Q_3 – 16.4}0.56=16.4+Q32(24.2)Q316.4
Simplify the numerator:
0.56 = 16.4 + Q 3 48.4 Q 3 16.4 0.56 = 16.4 + Q 3 48.4 Q 3 16.4 -0.56=(16.4+Q_(3)-48.4)/(Q_(3)-16.4)-0.56 = \frac{16.4 + Q_3 – 48.4}{Q_3 – 16.4}0.56=16.4+Q348.4Q316.4
0.56 = Q 3 32 Q 3 16.4 0.56 = Q 3 32 Q 3 16.4 -0.56=(Q_(3)-32)/(Q_(3)-16.4)-0.56 = \frac{Q_3 – 32}{Q_3 – 16.4}0.56=Q332Q316.4
Multiply both sides by ( Q 3 16.4 ) ( Q 3 16.4 ) (Q_(3)-16.4)(Q_3 – 16.4)(Q316.4):
0.56 ( Q 3 16.4 ) = Q 3 32 0.56 ( Q 3 16.4 ) = Q 3 32 -0.56(Q_(3)-16.4)=Q_(3)-32-0.56(Q_3 – 16.4) = Q_3 – 320.56(Q316.4)=Q332
Distribute 0.56 0.56 -0.56-0.560.56:
0.56 Q 3 + 9.184 = Q 3 32 0.56 Q 3 + 9.184 = Q 3 32 -0.56Q_(3)+9.184=Q_(3)-32-0.56Q_3 + 9.184 = Q_3 – 320.56Q3+9.184=Q332
Combine like terms:
9.184 + 32 = Q 3 + 0.56 Q 3 9.184 + 32 = Q 3 + 0.56 Q 3 9.184+32=Q_(3)+0.56Q_(3)9.184 + 32 = Q_3 + 0.56Q_39.184+32=Q3+0.56Q3
41.184 = 1.56 Q 3 41.184 = 1.56 Q 3 41.184=1.56Q_(3)41.184 = 1.56Q_341.184=1.56Q3
Solve for Q 3 Q 3 Q_(3)Q_3Q3:
Q 3 = 41.184 1.56 Q 3 = 41.184 1.56 Q_(3)=(41.184)/(1.56)Q_3 = \frac{41.184}{1.56}Q3=41.1841.56
Q 3 26.4 Q 3 26.4 Q_(3)~~26.4Q_3 \approx 26.4Q326.4

Step 2: Calculating the Coefficient of Quartile Deviation

The coefficient of quartile deviation is given by:
Coefficient of Quartile Deviation = Q 3 Q 1 Q 3 + Q 1 Coefficient of Quartile Deviation = Q 3 Q 1 Q 3 + Q 1 “Coefficient of Quartile Deviation”=(Q_(3)-Q_(1))/(Q_(3)+Q_(1))\text{Coefficient of Quartile Deviation} = \frac{Q_3 – Q_1}{Q_3 + Q_1}Coefficient of Quartile Deviation=Q3Q1Q3+Q1
Substitute the known values:
Coefficient of Quartile Deviation = 26.4 16.4 26.4 + 16.4 Coefficient of Quartile Deviation = 26.4 16.4 26.4 + 16.4 “Coefficient of Quartile Deviation”=(26.4-16.4)/(26.4+16.4)\text{Coefficient of Quartile Deviation} = \frac{26.4 – 16.4}{26.4 + 16.4}Coefficient of Quartile Deviation=26.416.426.4+16.4
Coefficient of Quartile Deviation = 10 42.8 Coefficient of Quartile Deviation = 10 42.8 “Coefficient of Quartile Deviation”=(10)/(42.8)\text{Coefficient of Quartile Deviation} = \frac{10}{42.8}Coefficient of Quartile Deviation=1042.8
Coefficient of Quartile Deviation 0.2336 Coefficient of Quartile Deviation 0.2336 “Coefficient of Quartile Deviation”~~0.2336\text{Coefficient of Quartile Deviation} \approx 0.2336Coefficient of Quartile Deviation0.2336

Summary:

The coefficient of quartile deviation is approximately 0.2336 0.2336 0.23360.23360.2336.

Question:-06

6.A researcher wants to study the association between temperament of husband and wife. She examined 5120 pairs and made the following contingency tables :
Temperament
of Husband
Temperament of Husband| Temperament | | :—: | | of Husband |
Temperament of Wife
Quiet
Good
Natured
Good Natured| Good | | :—: | | Natured |
Sullen
Quiet 850 571 580
Good Natured 618 593 455
Sullen 540 456 457
“Temperament of Husband” Temperament of Wife Quiet “Good Natured” Sullen Quiet 850 571 580 Good Natured 618 593 455 Sullen 540 456 457| Temperament <br> of Husband | Temperament of Wife | | | | :— | :—: | :—: | :—: | | | Quiet | Good <br> Natured | Sullen | | Quiet | 850 | 571 | 580 | | Good Natured | 618 | 593 | 455 | | Sullen | 540 | 456 | 457 |
Determine and interpret the association between the temperament of husband and wife.

Answer:

To determine the association between the temperament of husband and wife, we can use the chi-square test for independence. This test will help us determine whether there is a significant association between the temperament categories of husbands and wives.

Step 1: Set Up the Contingency Table

The given data is:
Temperament of Husband Quiet Good Natured Sullen Quiet 850 571 580 Good Natured 618 593 455 Sullen 540 456 457 Temperament of Husband Quiet Good Natured Sullen Quiet 850 571 580 Good Natured 618 593 455 Sullen 540 456 457 {:[“Temperament of Husband”,”Quiet”,”Good Natured”,”Sullen”],[“Quiet”,850,571,580],[“Good Natured”,618,593,455],[“Sullen”,540,456,457]:}\begin{array}{|l|c|c|c|} \hline \text{Temperament of Husband} & \text{Quiet} & \text{Good Natured} & \text{Sullen} \\ \hline \text{Quiet} & 850 & 571 & 580 \\ \text{Good Natured} & 618 & 593 & 455 \\ \text{Sullen} & 540 & 456 & 457 \\ \hline \end{array}Temperament of HusbandQuietGood NaturedSullenQuiet850571580Good Natured618593455Sullen540456457

Step 2: Calculate the Row and Column Totals

Temperament of Husband Quiet Good Natured Sullen Row Total Quiet 850 571 580 2001 Good Natured 618 593 455 1666 Sullen 540 456 457 1453 Column Total 2008 1620 1492 5120 Temperament of Husband Quiet Good Natured Sullen Row Total Quiet 850 571 580 2001 Good Natured 618 593 455 1666 Sullen 540 456 457 1453 Column Total 2008 1620 1492 5120 {:[“Temperament of Husband”,”Quiet”,”Good Natured”,”Sullen”,”Row Total”],[“Quiet”,850,571,580,2001],[“Good Natured”,618,593,455,1666],[“Sullen”,540,456,457,1453],[“Column Total”,2008,1620,1492,5120]:}\begin{array}{|l|c|c|c|c|} \hline \text{Temperament of Husband} & \text{Quiet} & \text{Good Natured} & \text{Sullen} & \text{Row Total} \\ \hline \text{Quiet} & 850 & 571 & 580 & 2001 \\ \text{Good Natured} & 618 & 593 & 455 & 1666 \\ \text{Sullen} & 540 & 456 & 457 & 1453 \\ \hline \text{Column Total} & 2008 & 1620 & 1492 & 5120 \\ \hline \end{array}Temperament of HusbandQuietGood NaturedSullenRow TotalQuiet8505715802001Good Natured6185934551666Sullen5404564571453Column Total2008162014925120

Step 3: Calculate the Expected Frequencies

The expected frequency for each cell in a contingency table is calculated using the formula:
E i j = ( Row Total i × Column Total j ) Grand Total E i j = ( Row Total i × Column Total j ) Grand Total E_(ij)=((“Row Total”_(i)xx”Column Total”_(j)))/(“Grand Total”)E_{ij} = \frac{(\text{Row Total}_i \times \text{Column Total}_j)}{\text{Grand Total}}Eij=(Row Totali×Column Totalj)Grand Total
For example, the expected frequency for the cell corresponding to "Quiet Husband" and "Quiet Wife" is:
E 11 = ( 2001 × 2008 ) 5120 784.7 E 11 = ( 2001 × 2008 ) 5120 784.7 E_(11)=((2001 xx2008))/(5120)~~784.7E_{11} = \frac{(2001 \times 2008)}{5120} \approx 784.7E11=(2001×2008)5120784.7
We will calculate the expected frequencies for all cells:
  • Quiet Husband, Quiet Wife: 2001 × 2008 5120 784.7 2001 × 2008 5120 784.7 (2001 xx2008)/(5120)~~784.7\frac{2001 \times 2008}{5120} \approx 784.72001×20085120784.7
  • Quiet Husband, Good Natured Wife: 2001 × 1620 5120 633.5 2001 × 1620 5120 633.5 (2001 xx1620)/(5120)~~633.5\frac{2001 \times 1620}{5120} \approx 633.52001×16205120633.5
  • Quiet Husband, Sullen Wife: 2001 × 1492 5120 583.8 2001 × 1492 5120 583.8 (2001 xx1492)/(5120)~~583.8\frac{2001 \times 1492}{5120} \approx 583.82001×14925120583.8
  • Good Natured Husband, Quiet Wife: 1666 × 2008 5120 653.3 1666 × 2008 5120 653.3 (1666 xx2008)/(5120)~~653.3\frac{1666 \times 2008}{5120} \approx 653.31666×20085120653.3
  • Good Natured Husband, Good Natured Wife: 1666 × 1620 5120 527.1 1666 × 1620 5120 527.1 (1666 xx1620)/(5120)~~527.1\frac{1666 \times 1620}{5120} \approx 527.11666×16205120527.1
  • Good Natured Husband, Sullen Wife: 1666 × 1492 5120 484.6 1666 × 1492 5120 484.6 (1666 xx1492)/(5120)~~484.6\frac{1666 \times 1492}{5120} \approx 484.61666×14925120484.6
  • Sullen Husband, Quiet Wife: 1453 × 2008 5120 570.0 1453 × 2008 5120 570.0 (1453 xx2008)/(5120)~~570.0\frac{1453 \times 2008}{5120} \approx 570.01453×20085120570.0
  • Sullen Husband, Good Natured Wife: 1453 × 1620 5120 462.2 1453 × 1620 5120 462.2 (1453 xx1620)/(5120)~~462.2\frac{1453 \times 1620}{5120} \approx 462.21453×16205120462.2
  • Sullen Husband, Sullen Wife: 1453 × 1492 5120 425.6 1453 × 1492 5120 425.6 (1453 xx1492)/(5120)~~425.6\frac{1453 \times 1492}{5120} \approx 425.61453×14925120425.6

Step 4: Calculate the Chi-Square Statistic

The chi-square statistic is calculated using the formula:
χ 2 = ( O i j E i j ) 2 E i j χ 2 = ( O i j E i j ) 2 E i j chi^(2)=sum((O_(ij)-E_(ij))^(2))/(E_(ij))\chi^2 = \sum \frac{(O_{ij} – E_{ij})^2}{E_{ij}}χ2=(OijEij)2Eij
Where O i j O i j O_(ij)O_{ij}Oij are the observed frequencies and E i j E i j E_(ij)E_{ij}Eij are the expected frequencies.
For each cell, calculate ( O i j E i j ) 2 E i j ( O i j E i j ) 2 E i j ((O_(ij)-E_(ij))^(2))/(E_(ij))\frac{(O_{ij} – E_{ij})^2}{E_{ij}}(OijEij)2Eij:
  • (850 – 784.7)^2 / 784.7 ≈ 5.21
  • (571 – 633.5)^2 / 633.5 ≈ 3.96
  • (580 – 583.8)^2 / 583.8 ≈ 0.02
  • (618 – 653.3)^2 / 653.3 ≈ 1.83
  • (593 – 527.1)^2 / 527.1 ≈ 7.23
  • (455 – 484.6)^2 / 484.6 ≈ 1.89
  • (540 – 570.0)^2 / 570.0 ≈ 1.58
  • (456 – 462.2)^2 / 462.2 ≈ 0.08
  • (457 – 425.6)^2 / 425.6 ≈ 2.42
Sum these values to get the chi-square statistic:
χ 2 5.21 + 3.96 + 0.02 + 1.83 + 7.23 + 1.89 + 1.58 + 0.08 + 2.42 = 24.22 χ 2 5.21 + 3.96 + 0.02 + 1.83 + 7.23 + 1.89 + 1.58 + 0.08 + 2.42 = 24.22 chi^(2)~~5.21+3.96+0.02+1.83+7.23+1.89+1.58+0.08+2.42=24.22\chi^2 \approx 5.21 + 3.96 + 0.02 + 1.83 + 7.23 + 1.89 + 1.58 + 0.08 + 2.42 = 24.22χ25.21+3.96+0.02+1.83+7.23+1.89+1.58+0.08+2.42=24.22

Step 5: Determine the Degrees of Freedom and Critical Value

Degrees of freedom for a contingency table is calculated as:
d f = ( r 1 ) ( c 1 ) d f = ( r 1 ) ( c 1 ) df=(r-1)(c-1)df = (r – 1)(c – 1)df=(r1)(c1)
where r r rrr is the number of rows and c c ccc is the number of columns.
In this case:
d f = ( 3 1 ) ( 3 1 ) = 2 × 2 = 4 d f = ( 3 1 ) ( 3 1 ) = 2 × 2 = 4 df=(3-1)(3-1)=2xx2=4df = (3 – 1)(3 – 1) = 2 \times 2 = 4df=(31)(31)=2×2=4
Using a chi-square distribution table, we find the critical value for α = 0.05 α = 0.05 alpha=0.05\alpha = 0.05α=0.05 and d f = 4 d f = 4 df=4df = 4df=4:
χ c r i t i c a l 2 9.488 χ c r i t i c a l 2 9.488 chi_(critical)^(2)~~9.488\chi^2_{critical} \approx 9.488χcritical29.488

Step 6: Compare the Chi-Square Statistic with the Critical Value

Since χ 2 = 24.22 χ 2 = 24.22 chi^(2)=24.22\chi^2 = 24.22χ2=24.22 is greater than χ c r i t i c a l 2 = 9.488 χ c r i t i c a l 2 = 9.488 chi_(critical)^(2)=9.488\chi^2_{critical} = 9.488χcritical2=9.488, we reject the null hypothesis.

Interpretation

There is a significant association between the temperament of husbands and wives. The observed frequencies differ significantly from the expected frequencies, suggesting that the temperament of one partner is related to the temperament of the other.

Question:-07

7.(a) Suppose a student of PGDAST calculated r 12 = 0.90 , r 13 = 0.30 r 12 = 0.90 , r 13 = 0.30 r_(12)=0.90,r_(13)=0.30r_{12}=0.90, r_{13}=0.30r12=0.90,r13=0.30 and r 23 = 0.70 r 23 = 0.70 r_(23)=0.70r_{23}=0.70r23=0.70 from a data set. Examine whether these computations are error free.

Answer:

To examine whether the computations of the correlation coefficients r 12 = 0.90 r 12 = 0.90 r_(12)=0.90r_{12} = 0.90r12=0.90, r 13 = 0.30 r 13 = 0.30 r_(13)=0.30r_{13} = 0.30r13=0.30, and r 23 = 0.70 r 23 = 0.70 r_(23)=0.70r_{23} = 0.70r23=0.70 are error-free, we can use the property of correlation coefficients and the concept of the determinant of the correlation matrix.

Step 1: Correlation Matrix

Construct the correlation matrix R R RRR:
R = ( 1 r 12 r 13 r 12 1 r 23 r 13 r 23 1 ) R = 1 r 12 r 13 r 12 1 r 23 r 13 r 23 1 R=([1,r_(12),r_(13)],[r_(12),1,r_(23)],[r_(13),r_(23),1])R = \begin{pmatrix} 1 & r_{12} & r_{13} \\ r_{12} & 1 & r_{23} \\ r_{13} & r_{23} & 1 \\ \end{pmatrix}R=(1r12r13r121r23r13r231)
Given values:
R = ( 1 0.90 0.30 0.90 1 0.70 0.30 0.70 1 ) R = 1 0.90 0.30 0.90 1 0.70 0.30 0.70 1 R=([1,0.90,0.30],[0.90,1,0.70],[0.30,0.70,1])R = \begin{pmatrix} 1 & 0.90 & 0.30 \\ 0.90 & 1 & 0.70 \\ 0.30 & 0.70 & 1 \\ \end{pmatrix}R=(10.900.300.9010.700.300.701)

Step 2: Check for Positive Semidefiniteness

The determinant of the correlation matrix should be non-negative for the matrix to be positive semidefinite, which is a requirement for a valid correlation matrix.
Calculate the determinant of R R RRR:
det ( R ) = 1 ( 1 1 0.70 0.70 ) 0.90 ( 0.90 1 0.30 0.70 ) + 0.30 ( 0.90 0.70 0.30 1 ) det ( R ) = 1 1 1 0.70 0.70 0.90 0.90 1 0.30 0.70 + 0.30 0.90 0.70 0.30 1 “det”(R)=1(1*1-0.70*0.70)-0.90(0.90*1-0.30*0.70)+0.30(0.90*0.70-0.30*1)\text{det}(R) = 1 \left(1 \cdot 1 – 0.70 \cdot 0.70 \right) – 0.90 \left(0.90 \cdot 1 – 0.30 \cdot 0.70 \right) + 0.30 \left(0.90 \cdot 0.70 – 0.30 \cdot 1 \right)det(R)=1(110.700.70)0.90(0.9010.300.70)+0.30(0.900.700.301)
Simplify the calculations:
det ( R ) = 1 ( 1 0.49 ) 0.90 ( 0.90 0.21 ) + 0.30 ( 0.63 0.30 ) det ( R ) = 1 ( 1 0.49 ) 0.90 ( 0.90 0.21 ) + 0.30 ( 0.63 0.30 ) “det”(R)=1(1-0.49)-0.90(0.90-0.21)+0.30(0.63-0.30)\text{det}(R) = 1 (1 – 0.49) – 0.90 (0.90 – 0.21) + 0.30 (0.63 – 0.30)det(R)=1(10.49)0.90(0.900.21)+0.30(0.630.30)
det ( R ) = 1 ( 0.51 ) 0.90 ( 0.69 ) + 0.30 ( 0.33 ) det ( R ) = 1 ( 0.51 ) 0.90 ( 0.69 ) + 0.30 ( 0.33 ) “det”(R)=1(0.51)-0.90(0.69)+0.30(0.33)\text{det}(R) = 1 (0.51) – 0.90 (0.69) + 0.30 (0.33)det(R)=1(0.51)0.90(0.69)+0.30(0.33)
det ( R ) = 0.51 0.621 + 0.099 det ( R ) = 0.51 0.621 + 0.099 “det”(R)=0.51-0.621+0.099\text{det}(R) = 0.51 – 0.621 + 0.099det(R)=0.510.621+0.099
det ( R ) = 0.51 0.621 + 0.099 = 0.012 det ( R ) = 0.51 0.621 + 0.099 = 0.012 “det”(R)=0.51-0.621+0.099=-0.012\text{det}(R) = 0.51 – 0.621 + 0.099 = -0.012det(R)=0.510.621+0.099=0.012

Step 3: Interpretation

The determinant of the correlation matrix R R RRR is 0.012 0.012 -0.012-0.0120.012, which is negative. For a valid correlation matrix, the determinant should be non-negative (i.e., 0 0 >= 0\geq 00).

Conclusion

Since the determinant of the correlation matrix is negative, the given correlation coefficients r 12 = 0.90 r 12 = 0.90 r_(12)=0.90r_{12} = 0.90r12=0.90, r 13 = 0.30 r 13 = 0.30 r_(13)=0.30r_{13} = 0.30r13=0.30, and r 23 = 0.70 r 23 = 0.70 r_(23)=0.70r_{23} = 0.70r23=0.70 cannot all be correct simultaneously. Thus, the computations are not error-free. At least one of the given correlation coefficients must be incorrect.

(b) (i) Explain the method of least squares.
(ii) Fit an equation of the form y = a b X y = a b X y=ab^(X)y=a b^{\mathrm{X}}y=abX on the following data using the method of least squares.
X X X\mathrm{X}X Y Y Y\mathrm{Y}Y
2 144
3 172
4 207
5 248
6 298
X Y 2 144 3 172 4 207 5 248 6 298| $\mathrm{X}$ | $\mathrm{Y}$ | | :—: | :—: | | 2 | 144 | | 3 | 172 | | 4 | 207 | | 5 | 248 | | 6 | 298 |

Answer:

(i) Explanation of the Method of Least Squares

The method of least squares is a standard approach in regression analysis to approximate the solution of overdetermined systems (sets of equations in which there are more equations than unknowns). It minimizes the sum of the squares of the residuals (the differences between observed and calculated values).

Steps in the Method of Least Squares:

  1. Model Specification:
    • Define the mathematical form of the relationship between the dependent variable y y yyy and the independent variable x x xxx. Common forms include linear y = a + b x y = a + b x y=a+bxy = a + bxy=a+bx, exponential y = a b x y = a b x y=ab^(x)y = ab^xy=abx, etc.
  2. Formulate the Objective Function:
    • For a given set of data points ( x i , y i ) ( x i , y i ) (x_(i),y_(i))(x_i, y_i)(xi,yi), the residual for each point is the difference between the observed value and the value predicted by the model.
    • The sum of the squares of these residuals is the objective function that needs to be minimized: S = i = 1 n ( y i y ^ i ) 2 S = i = 1 n ( y i y ^ i ) 2 S=sum_(i=1)^(n)(y_(i)- hat(y)_(i))^(2)S = \sum_{i=1}^{n} (y_i – \hat{y}_i)^2S=i=1n(yiy^i)2
  3. Derive the Normal Equations:
    • Calculate the partial derivatives of S S SSS with respect to the model parameters and set them to zero. This yields a system of normal equations.
  4. Solve the Normal Equations:
    • Solve these equations to obtain estimates of the model parameters.

(ii) Fitting an Equation of the Form y = a b X y = a b X y=ab^(X)y = a b^Xy=abX

Given data:
X Y 2 144 3 172 4 207 5 248 6 298 X Y 2 144 3 172 4 207 5 248 6 298 {:[X,Y],[2,144],[3,172],[4,207],[5,248],[6,298]:}\begin{array}{|c|c|} \hline X & Y \\ \hline 2 & 144 \\ 3 & 172 \\ 4 & 207 \\ 5 & 248 \\ 6 & 298 \\ \hline \end{array}XY21443172420752486298
To fit the equation y = a b X y = a b X y=ab^(X)y = ab^Xy=abX using the method of least squares, we first transform it into a linear form by taking the natural logarithm on both sides:
ln ( y ) = ln ( a ) + X ln ( b ) ln ( y ) = ln ( a ) + X ln ( b ) ln(y)=ln(a)+X ln(b)\ln(y) = \ln(a) + X \ln(b)ln(y)=ln(a)+Xln(b)
Let Y = ln ( y ) Y = ln ( y ) Y^(‘)=ln(y)Y’ = \ln(y)Y=ln(y), A = ln ( a ) A = ln ( a ) A=ln(a)A = \ln(a)A=ln(a), and B = ln ( b ) B = ln ( b ) B=ln(b)B = \ln(b)B=ln(b). Then the equation becomes:
Y = A + B X Y = A + B X Y^(‘)=A+BXY’ = A + BXY=A+BX
We can now use the method of least squares to fit a linear equation to the transformed data.

Transform the Data:

X Y Y = ln ( Y ) 2 144 ln ( 144 ) 4.9698 3 172 ln ( 172 ) 5.1475 4 207 ln ( 207 ) 5.3327 5 248 ln ( 248 ) 5.5175 6 298 ln ( 298 ) 5.6971 X Y Y = ln ( Y ) 2 144 ln ( 144 ) 4.9698 3 172 ln ( 172 ) 5.1475 4 207 ln ( 207 ) 5.3327 5 248 ln ( 248 ) 5.5175 6 298 ln ( 298 ) 5.6971 {:[X,Y,Y^(‘)=ln(Y)],[2,144,ln(144)~~4.9698],[3,172,ln(172)~~5.1475],[4,207,ln(207)~~5.3327],[5,248,ln(248)~~5.5175],[6,298,ln(298)~~5.6971]:}\begin{array}{|c|c|c|} \hline X & Y & Y’ = \ln(Y) \\ \hline 2 & 144 & \ln(144) \approx 4.9698 \\ 3 & 172 & \ln(172) \approx 5.1475 \\ 4 & 207 & \ln(207) \approx 5.3327 \\ 5 & 248 & \ln(248) \approx 5.5175 \\ 6 & 298 & \ln(298) \approx 5.6971 \\ \hline \end{array}XYY=ln(Y)2144ln(144)4.96983172ln(172)5.14754207ln(207)5.33275248ln(248)5.51756298ln(298)5.6971

Calculate the Necessary Sums:

X = 2 + 3 + 4 + 5 + 6 = 20 X = 2 + 3 + 4 + 5 + 6 = 20 sum X=2+3+4+5+6=20\sum X = 2 + 3 + 4 + 5 + 6 = 20X=2+3+4+5+6=20
Y = 4.9698 + 5.1475 + 5.3327 + 5.5175 + 5.6971 26.6646 Y = 4.9698 + 5.1475 + 5.3327 + 5.5175 + 5.6971 26.6646 sumY^(‘)=4.9698+5.1475+5.3327+5.5175+5.6971~~26.6646\sum Y’ = 4.9698 + 5.1475 + 5.3327 + 5.5175 + 5.6971 \approx 26.6646Y=4.9698+5.1475+5.3327+5.5175+5.697126.6646
X 2 = 2 2 + 3 2 + 4 2 + 5 2 + 6 2 = 4 + 9 + 16 + 25 + 36 = 90 X 2 = 2 2 + 3 2 + 4 2 + 5 2 + 6 2 = 4 + 9 + 16 + 25 + 36 = 90 sumX^(2)=2^(2)+3^(2)+4^(2)+5^(2)+6^(2)=4+9+16+25+36=90\sum X^2 = 2^2 + 3^2 + 4^2 + 5^2 + 6^2 = 4 + 9 + 16 + 25 + 36 = 90X2=22+32+42+52+62=4+9+16+25+36=90
X Y = 2 × 4.9698 + 3 × 5.1475 + 4 × 5.3327 + 5 × 5.5175 + 6 × 5.6971 X Y = 2 × 4.9698 + 3 × 5.1475 + 4 × 5.3327 + 5 × 5.5175 + 6 × 5.6971 sum XY^(‘)=2xx4.9698+3xx5.1475+4xx5.3327+5xx5.5175+6xx5.6971\sum XY’ = 2 \times 4.9698 + 3 \times 5.1475 + 4 \times 5.3327 + 5 \times 5.5175 + 6 \times 5.6971XY=2×4.9698+3×5.1475+4×5.3327+5×5.5175+6×5.6971
9.9396 + 15.4425 + 21.3308 + 27.5875 + 34.1826 108.483 9.9396 + 15.4425 + 21.3308 + 27.5875 + 34.1826 108.483 ~~9.9396+15.4425+21.3308+27.5875+34.1826~~108.483\approx 9.9396 + 15.4425 + 21.3308 + 27.5875 + 34.1826 \approx 108.4839.9396+15.4425+21.3308+27.5875+34.1826108.483

Solve the Normal Equations:

The normal equations for the least squares fit are:
n A + B X = Y n A + B X = Y nA+B sum X=sumY^(‘)nA + B \sum X = \sum Y’nA+BX=Y
A X + B X 2 = X Y A X + B X 2 = X Y A sum X+B sumX^(2)=sum XY^(‘)A \sum X + B \sum X^2 = \sum XY’AX+BX2=XY
Substituting the sums:
5 A + 20 B = 26.6646 5 A + 20 B = 26.6646 5A+20 B=26.66465A + 20B = 26.66465A+20B=26.6646
20 A + 90 B = 108.483 20 A + 90 B = 108.483 20 A+90 B=108.48320A + 90B = 108.48320A+90B=108.483

Solve for A A AAA and B B BBB:

Multiply the first equation by 4:
20 A + 80 B = 106.6584 20 A + 80 B = 106.6584 20 A+80 B=106.658420A + 80B = 106.658420A+80B=106.6584
Subtract this from the second equation:
( 20 A + 90 B ) ( 20 A + 80 B ) = 108.483 106.6584 ( 20 A + 90 B ) ( 20 A + 80 B ) = 108.483 106.6584 (20 A+90 B)-(20 A+80 B)=108.483-106.6584(20A + 90B) – (20A + 80B) = 108.483 – 106.6584(20A+90B)(20A+80B)=108.483106.6584
10 B = 1.8246 10 B = 1.8246 10 B=1.824610B = 1.824610B=1.8246
B = 0.18246 B = 0.18246 B=0.18246B = 0.18246B=0.18246
Substitute B B BBB back into the first equation:
5 A + 20 × 0.18246 = 26.6646 5 A + 20 × 0.18246 = 26.6646 5A+20 xx0.18246=26.66465A + 20 \times 0.18246 = 26.66465A+20×0.18246=26.6646
5 A + 3.6492 = 26.6646 5 A + 3.6492 = 26.6646 5A+3.6492=26.66465A + 3.6492 = 26.66465A+3.6492=26.6646
5 A = 23.0154 5 A = 23.0154 5A=23.01545A = 23.01545A=23.0154
A = 4.60308 A = 4.60308 A=4.60308A = 4.60308A=4.60308
So, ln ( a ) = A = 4.60308 ln ( a ) = A = 4.60308 ln(a)=A=4.60308\ln(a) = A = 4.60308ln(a)=A=4.60308 and ln ( b ) = B = 0.18246 ln ( b ) = B = 0.18246 ln(b)=B=0.18246\ln(b) = B = 0.18246ln(b)=B=0.18246.

Exponentiate to Get a a aaa and b b bbb:

a = e 4.60308 100.0 a = e 4.60308 100.0 a=e^(4.60308)~~100.0a = e^{4.60308} \approx 100.0a=e4.60308100.0
b = e 0.18246 1.2001 b = e 0.18246 1.2001 b=e^(0.18246)~~1.2001b = e^{0.18246} \approx 1.2001b=e0.182461.2001

The Fitted Equation:

The equation of the form y = a b X y = a b X y=ab^(X)y = ab^Xy=abX fitted to the data is:
y = 100.0 × 1.2001 X y = 100.0 × 1.2001 X y=100.0 xx1.2001 ^(X)y = 100.0 \times 1.2001^Xy=100.0×1.2001X

Scroll to Top
Scroll to Top