1.1 Basics of Multiple regression and underlying assumptions Flashcards
What are the assumptions of multiple regression?
a) Linear relationship exists between the dependent and independent variable
b) Residuals are normally distributed
c) Variance of the error terms is constant for all observations (heteroskedasticity)
d) Residual for one observation is not correlated with that of another observation (serial correlation)
e) Independent variables are not random, and there is no exact linear relation between any 2 or more independent variables (multicollinearity)
What are multiple regression models used for?
Multiple regression models are used to
a) Identify relationships between variables
b) Forecast variables
c) Test existing theories
Allows for considerations of multiple underlying influences on the dependent variable
What does multiple regression do
Multiple regression methodology estimates the intercept and slope coefficients such that the sum of the squared error terms is minimized.
What is the residual under multiple regression
The residual is the difference between the observed value and the predicted value
What is the use of the p-value?
- The idea is to use the p-value to evaluate the null hypothesis that a slope coefficient equals zero.
- P-value is the smallest level of significance (ᵅ)for which the null hypothesis can be rejected
- Significance of coefficients by comparing the p-value to the chosen significance level
- If the p-value < significance level, then the null hypothesis can be rejected
- If the p-value > significance level, then the null hypothesis can’t be rejected
How to interpret multiple regression results
- Intercept term is the value of the dependent variable when the independent variables are equal to zero
- Each slope coefficient is the estimated change in the dependent variable for 1 unit change in the independent variable while holding the other independent variables constant
o Slope coefficients in multiple regression are called partial slope coefficients