Multiple Linear Regression Flashcards
Write the regression equation.
y = b0 + b1 x + e
Why will the estimate value of y not be perfect?
Because there are residuals or errors, which is what ‘e’ stands for.
What is a regression line used for?
To make predictions about the value of the dependent variable based on a value of the predictor.
What happens to R^2 when more variables are added?
R^2 always increases even when the new variables have no predictive power.
How do we know if two models are nested?
If one model contains all the terms of the other, and at least one additional term.
What is b(0)?
The value of y variable when x variable=0.
What is b(1)?
This is the amount of change in variable y for one unit change of variable x.
Write the multiple regression equation.
y = b(0) + b(1)x(1) + b(2)x(2)… + b(n)x(n) + e
What is a fully specified model?
A model in which we have accounted for all factors that determine variation in the dependent variable (y).
Why can’t we usually have a fully specified model?
We cannot measure all the factors that affect y
What is the relationship between the t and p values?
As the t value increases, the p value increases
What is the t value for significance at 0.05 level of confidence?
+/- 1.96
What does rejecting the null hypothesis mean?
- Our relationship is not likely to have occurred by chance
- Our relationship is likely to be reflected in the population
What do we do when we want to add a categorical variable such as sex to the model?
We create a variable that takes the values “0” and “1” for men and women, respectively.
What do we do if there are multiple categories?
We create multiple dummy variables where one category is a reference and left out of the model.