WEEK 6 Flashcards
Write a formal multiple linear regression statistical model
yi = b0 + b1xi1 + b2xi2 + … + ei
Define multiple linear regression
Has a continuous response variable like a simple linear regression but it has more than one continuous predictor (x1, x2)
What are the basic assumptions of a multiple linear regression?
Linearity, normally distributed, residuals have homogeneous variances and predictors should not be strongly correlated (no collinearity)
How does the slope parameter differ between the linear and multiple regressions
MLR has 2 slopes.
The effect of the response on the predictor is controlled by the effect of the 2nd, 3rd … predictor
In another words, it explains the change in x2 per unit of Y holding x2 constant
Describe an interaction in MLR
The effect of X1 on Y depends on the level of X2 (vice-versa)
How do we account for interactions in MLR
We add an interaction coefficient to the model, a multiplicative effect
Eg. B3 (X1i * X2i)
How do we detect collinearity
Brainstorm which predictor variables in your model are likely to be correlated before building the model
Plot each variable against each other to check for effects
Calculate the tolerance and VIF
Lower tolerance is not good, T > 0.1 is really bad
High VIF is bad, VIF > 10 is really bad (strong correlation)
In SLR we call the slope a regression slope. What is it called in a MLR?
Partial regression slope.
What problems would we face when interpreting a model that has correlated variables?
Correlated predictors increase uncertainty in the effects of the predictor on the response variable, diminishing the power of the model.