Multiple Regression Flashcards
1
Q
Properties of fitted regression line
A
- exogeneity assumption
- no-correlation between residuals and fitted values
- mean of residuals = 0
- variance decomposition = explained variance + unexplained variance
…
1
Q
Assumptions
A
- Normality of residuals - histogram, plots of residuals, formal tests
- Homoscedasticity - homogeneity of variance of residuals - scatterplots of residual variance, Levene’s test
- Linearity - scatterplot
- Non-correlated errors
2
Q
Cohen’s f values
A
0.02 - small
0.15 - medium
0.35 - large effect
3
Q
Measures of model fit
A
- R2 - might overestimate the explained variance while we add other variables
- adjusted R2 - can decrease
- R2 change
4
Q
Types of Influential Observations
A
- outliers
- leverage points
- influential observations - reinforcing, conflicting, shifting
5
Q
Multicollinearity - impact
A
- Estimation - unique variance of each predictor is reduced. when they’re not correlated - total explained variance = sum of unique explained variances by each predicot
- Explanation - problem in interpretation
sometimes small multicollinearity can be ignored
6
Q
Handling multicollinearity
A
- delete
- combine
- estimation techniques
- do nothing but don’t intepret in a regular way
- increase sample size
- dimensionality reduction
7
Q
Suppression
A
- inflated R^2 - very high
- high correlation between predictors, but only one of them is correlated with dependent
8
Q
Correlations
A
- Zero order - unique + shared -> (b+c)/total
- Partial - x&y but controlled for other variables -> b/total - (a+c)
- semi partial - unique variance of x -> b/total -> R^2 will decrease by this amount if we remove this variable from the model
9
Q
R^2 will decrease by which correlation?
A
squared part/ semi-partial - unique contribution of the variable
10
Q
suppression & correlation
A
when partial is higher that zero-order, we have suppression
11
Q
how can including suppressor improve the model?
A
- it can remove the irrelevant variance in the predictor - decrease the noise
12
Q
Interpretation with interaction
A
- conditional main effects
e.g. the income for females is less than males under the condition that years of education is zero - interaction term
females earn 60 euros multiplied by years of education less compared to men - difference in slope = slope of … is different
13
Q
interaction and follow-up analysis
A
- interaction with metric - simple slope
- interaction between metric and non-metric - simple slope
- interaction between non-metrics - simple effect