L5 - Regression Analysis II Flashcards
What is a spurious correlation?
Relationship between income and height, but gender is the
responsible factor
What is a masked relationship?
Example: Positive relationship between happiness and debt, but
relationship reverses when controlling for income (both happiness and debt are positively related to income)
How can spurious correlations and masked relationships be detected?
By doing multiple linear regression
partial correlation
interpretation of the ß
Interpretation of the ß = standardized regression coefficient: A one unit increase in X so a 1 SD increase in X leads to an e.g. .546 units SD in Y.
F-test
The F-test of overall significance indicates whether your linear regression model provides a better fit to the data than a model that contains no independent variables.
How can you evaluate if the value of the regression coefficients b are significantly different from 0?
t-test
How much variance in the outcome variable
is accounted for by the predictor? Which test?
R-squared
What can you do if there is a nonlinear relationship between DV and IV?
add a quadratic component.
What is a lack of multicollinearity?
Overlap among predictors should not be too large (i.e., there should be no perfect linear relationship between two or more predictors)
What happens under multicollinearity?
Under multicollinearity, the regression coefficients may be unstable (i.e., it will be difficult to assess the individual importance of a predictor) and their standard errors large
What is a problematic pearson correlatino for multicollinearity?
r > .8
What is the tolerance in multicollinearity?
What tolerance is a serious problem?
A small tolerance is problematic like .1 (.02 is a potential problem)
What is the VIF
Variance Inflation Factor = 1/ Tolerance
- largest VIF should not be greater than 10
- average should not be much greater than 1