Sample Solution

IGNOU MST-017 Solved Assignment | June 2024 to June 2025 | Applied Regression Analysis | MSCAST Sample Solution

1(a) State whether the following statements are true or false and also give the reason in support of your answer.
(i) We define three indicator variables for an explanatory variable with three categories.

Answer:

The statement provided is somewhat ambiguous, so I’ll assume question is asking whether it’s correct to define three indicator (dummy) variables for an explanatory variable that has three categories. Let’s clarify this and provide a comprehensive answer.

Statement:

"We define three indicator variables for an explanatory variable with three categories."

Explanation:

In the context of regression analysis, when we have a categorical explanatory variable with

k

categories, we typically use

k - 1

indicator (dummy) variables. This approach prevents perfect multicollinearity (also known as the dummy variable trap), where the dummy variables are perfectly collinear with the intercept term.

Let’s assume we have a categorical variable

X

with three categories: A, B, and C. Here’s how we typically define the dummy variables:

Indicator Variable 1 ( $D 1$ ):
- $D 1 = 1$ if the observation belongs to category A
- $D 1 = 0$ otherwise
Indicator Variable 2 ( $D 2$ ):
- $D 2 = 1$ if the observation belongs to category B
- $D 2 = 0$ otherwise

We do not need a third indicator variable for category C because its presence is already implied when

D 1

and

D 2

are both 0.

Justification:

If we create three dummy variables for a categorical variable with three categories, we will encounter perfect multicollinearity. Here’s why:

Suppose

X

has categories A, B, and C, and we create three indicator variables

D 1

D 2

, and

D 3

$D 1$ for category A
$D 2$ for category B
$D 3$ for category C

In this case, there is a linear relationship among these dummy variables:

D 1 + D 2 + D 3 = 1

This relationship implies perfect multicollinearity, which makes the regression coefficients indeterminate because the design matrix

X

becomes singular (not invertible).

Correct Approach:

Define only

k - 1

dummy variables for

k

categories to avoid multicollinearity. Thus, for three categories, we define only two dummy variables.

Conclusion:

The statement "We define three indicator variables for an explanatory variable with three categories" is false. We should define

k - 1

indicator variables for

k

categories to avoid multicollinearity.

Example:

Let’s create an example with three categories:

$X$ = A, B, C (categorical variable)

Define two indicator variables:

$D 1$ = 1 if $X$ = A, 0 otherwise
$D 2$ = 1 if $X$ = B, 0 otherwise

Definitions and Formulas:

Coefficient of Determination ( $R^{2}$ ):

$R^{2} = 0.833$
Number of Observations (n):

$n = 12$
Number of Explanatory Variables (k):

$k = 3$
Adjusted $R^{2}$ Formula:

$Adjusted R^{2} = 1 - (\frac{(1 - R^{2}) (n - 1)}{n - k - 1})$

Calculation:

Calculate the numerator:

$1 - R^{2} = 1 - 0.833 = 0.167$
Calculate the degrees of freedom adjustment:

$n - 1 = 12 - 1 = 11$

$n - k - 1 = 12 - 3 - 1 = 8$
Calculate the fraction:

$\frac{(1 - R^{2}) (n - 1)}{n - k - 1} = \frac{0.167 \times 11}{8} = \frac{1.837}{8} = 0.229625$
Calculate the Adjusted $R^{2}$ :

$Adjusted R^{2} = 1 - 0.229625 = 0.770375$

Conclusion:

The calculated Adjusted

R^{2}

is approximately 0.770375, not 0.84.

Thus, the statement "If the coefficient of determination is 0.833, the number of observations and explanatory variables are 12 and 3, respectively, then the Adjusted $R^{2}$ will be 0.84." is false.

Back to Course

Next Lesson