Lecture 10 - Multiple Regression Flashcards
What is regression analysis?
Techniques that allow us to asses the relationship between one dependent variable and several independent variables
What does regression allows us to do?
Predict using multiple independent predictors (IV), the effect that each IV (predictor) has on the DV
What can we not assume about regression analysis?
Causality
What is multiple linear regression?
It’s an extension of simple linear regression and has similar underlying assumptions
What does the standard error of estimate do?
Provides an index of the general error that you are making with you predictions
Is it or is it not possible to calculate confidence intervals around your prediction?
It is
What is the standard error of estimate?
For every point on the regression line you can calculate the residual error - the standard error of estimate is the standard deviation of the errors of estimate
Why is SEE useful?
Indicates the amount of error to expect, on average, in your predictions
Gets a sense of accuracy of your predictive model
Assumes random error
What does the residual mean in the residual analysis?
It’s the part of the data that cannot be explained by the statistical model
What does it mean if there is a pattern in the residual error?
If there’s a pattern in the residual then it is the action of a systematic error?
What does homoscedascity mean?
The same spread of data
What are the different variables in multiple linear regression?
Predictor variables, criteria variables, predictor variable
Multiple Linear Regression retains what squares approach?
The least squares - using the line that has the lowest sum of squared residuals
What are some limitations and assumptions of regression analysis?
Ratio of cases to predictor - the cases to predictor must be high enough to make the regression model stable and useful
Outliers among the predictors and criteria variables - the least squares approach means that the residuals can have a large impact
What are limiting factors for MLR?
Multicollinearity (high correlations between variables r