1. matter of degree, and not of presence or absence of multicollinearity/ 2. condition of the X variables that are non-stochastic; feature of the sample and not the population

Multicollinearity Flashcards by Whitley Plummer

Perfect Multicollienearity

R^2 = 1 perfect linear relationship between the explanatory variables.

How well did you know this?

Not at all

Perfectly

What happens to the estimators when multicollinearity exists?

Cannot identify unique estimates for parameters and therefore cannot draw any statistical inferences (ie hypothesis testing) about a sample.

How well did you know this?

Not at all

Perfectly

Near, Imperfect Multicollinearity

Two or more explanatory variables aren’t exactly linearly related.

How well did you know this?

Not at all

Perfectly

Inferior Good

Income increases but demand for the good declines

How well did you know this?

Not at all

Perfectly

Coefficient of Correlation; used to determine the strength of or degree of collinearity. May not be adequate if multiple variables are involved.

How well did you know this?

Not at all

Perfectly

Ordinary Least Squares (OLS)

OLS produces estimators with the smallest variances; they are BLUE: Best, linear, unbiased, estimate. OLS remain BLUE even when one of the partial regression coefficients are statistically insignificant.

How well did you know this?

Not at all

Perfectly

Unbiasedness

the estimator provides you with the correct parameter coefficient; converges, because it is a repeated sampling property, with the true population value of the estimates

How well did you know this?

Not at all

Perfectly

Consequences of Multicollinearity

OLS doesn’t destroy minimum variance property; however the numerical value of the variance does not have to be small.
Large variances and standard error, wider confidence intervals
Small t-value; insignificant t-ratios
Failure to reject the null, resulting in a type 2 error
Cannot estimate Xs influence on Y
High R^2 but few statistically significant t ratios
OLS estimators and their standard errors become more sensitive to small changes in the data.
Wrong signs for regression coefficients
difficulty in assessing the individual contributions of explanatory variables to the explained sum of squares or R^2. –> because the variables are so collinear when one moves the other does also, making it impossible to separate them.

How well did you know this?

Not at all

Perfectly

Why kind of problem is Multicollinearity?

It is a sampling (regression) phenomenon; some samples can have Xs that are so collinear, it messes up the regression analysis. Xs might not be linear in the population. This problem occurs because data is typically nonexperimental; observed as they occur.

How well did you know this?

Not at all

Perfectly

t critical vs t value

t critical are the values that determine the critical region under the normal distribution, outside of which we would reject the null hypothesis at a selected level of confidence. The t value is the value we calculate using the standard error.

How well did you know this?

Not at all

Perfectly

Sample Specific

matter of degree, and not of presence or absence of multicollinearity/
condition of the X variables that are non-stochastic; feature of the sample and not the population

How well did you know this?

Not at all

Perfectly

Indicators of Multicollinearity

High R^2 but few significant t ratios
High pairwise correlation among explanatory variables. >.8, there is possibility of multicollinearity
Examination of partial correlations
Subsidiary or auxiliary regressions
Variance Inflation Factor

How well did you know this?

Not at all

Perfectly

Partial correlation coefficient

correlation btw two variables, holding the influence of the other x variables constant.

How well did you know this?

Not at all

Perfectly

Auxiliary Regression

regressing each x variable on the remaining X’s to compute the corresponding R^2; these regressions are considered “subsidiary or auxiliary to the main regression. want to find coefficient of determination; then determine if it the R^2 for each is statistically significant using F test. – high R^2 can be a surface indicator

How well did you know this?

Not at all

Perfectly

Variance Inflation Factor

VIF = 1/ (1- R^2); as R^2 increases the variance and standard error increase or inflated. Undefined if perfect collinearity (R^2 =1) and 1 perfect collinearity.

How well did you know this?

Not at all

Perfectly

Additional takeaways

Study These Flashcards

R^2 can be counterbalanced by a low variance and sum of differences X, doesn’t have to.
Multicollinearity by itself need not cause high standard errors.

When is Multicollinearity bad/not so bad?

Study These Flashcards

when using as as predictor, and the same relationship is expected to continue into the future (big if)
If objective is to predict & reliable estimation of the individual parameters of the chosen model, then MC is bad news.

Correlation Diagnostics

Study These Flashcards

Correlation Matrix

2. Auxiliary Regression

Correlation Matrix

Study These Flashcards

Getting the pairwise correlation for all of the various explanatory variables.

Issues with Solutions

Study These Flashcards

no sure fire way to cure MC; plus the problem remains with the sample, and may not reflect the population, OLS estimators retain their BLUE property,

Solutions to Multicollinearity

Study These Flashcards

Dropping the Variable from the model
New data or new sample
Rethinking the model
Prior Information about Some Parameters
Transformation of Variables

Dropping a variable from the model

Study These Flashcards

Might want to remove, but always return to theory first. because economically the regression might be appropriate.–can lead to model specification error which will result in biased ; don’t drop a variable from an economically viable model just because of collinearity.

Acquiring Additional Data or a New Sample

Study These Flashcards

the new sample might not display as much MC as your original dataset. A sample with a larger n may reduce some of the MC.

Rethinking the model

Study These Flashcards

Did you omit any variables?
Is it the correct functional form?
Does it align with theory?

Prior Information about some Parameters

use coefficient obtain in another study that you believe still holds (tall order) and apply it to the regression; subtract the coefficient *Variable from the y and then just regress for the remaining variables. This method is also difficult because you must rely on extraneous prior info.

Transformation of Variables

ex. aggregate values are transformed to per capita; this reduced some of the multicollinearity.

Multicollinearity Flashcards

(26 cards)