chapter 14: multiple regression and model building Flashcards
what are multiple regression models?
regression models that employ more than one independent variable
why is it possible for the multiple regression model formula to be like the:
y = B0 + B1x1 + B2x2 + E
why are there 2 xs?
because the mean level (Uy) now is B0 + B1x1 + B2x2
this basically means that there are two different independent variables that can correlate or “Influence” the dependent variable “y”
E still remains the error term that causes y to deviate from the mean level
what is the new name for the mean level:
Uy = B0 + B1x1 + B2x2
the plane of means
its in a three dimensional space
what is B0 in Uy = B0 + B1x1 + B2x2?
it is still the y intercept
what is B1 in Uy = B0 + B1x1 + B2x2?
the regression parameter for the variable x1
the slope of the plane of the x1 direction
what is B2 in Uy = B0 + B1x1 + B2x2?
the regression parameter for the variable x2
the slope of the plane of the x2 direction
what is the error term E in Uy = B0 + B1x1 + B2x2?
the error term
what describes the effects on y other than x1 and x2
what is the formula of the point estimate or prediction of
y = B0 + B1x1 + B2x2 + E
what is the name of such equation
y^ = b0 + b1x1 + b2x2
called the least squared plane, the estimate of the plane of means
what is there no error term when we use
y^ = b0 + b1x1 + b2x2
to predict a point of
y = B0 + B1x1 + B2x2 + E
the error term has a 50% chance of being positive and 50% chance of being negative
what is the residual?
the difference between the observes and predicted values
what is SSE
the unexplained variation
the sum of the squared residuals
what is the multiple coefficient of determination?
the proportion of the Total variation in the n observed values of the dependent variable that is explained by the overall regression model
R^2
R^2 = explained variation / total variation
what is the multiple correlation coefficient
R
what is the adjusted R^2
the adjusted multiple coefficient of determination used to avoid overestimating the importance of independent variables
adjusted R^2 =
(R^2 - (k / (n - 1))) * ((n - 1) / (n - (k + 1)))
n is the number of observations
k the number of independent variables in the model
what are the four assumptions of the error term values in the multiple regression model?
- at any given combination of x1, x2, …, xk, the population of potential error terms has a mean value of 0
- constant variance assumption
- normality assumption
- independence assumption