16 - Regression And Prediction Flashcards
Regression
Allows the researcher to make predictions of the likely values of the dependent variable (Y) from know values of the independent variable (X)
F ratio should be significant.
Regression assumptions
- minimum requirement: at least 15*(nb IV)
- identify and remove outliers
- normality of residuals
- homoscedasticity
- linear dispersion of points
- avoid multicollinearity (multiple regression)
Line of best fit
Regression equation will minimise the sum of square of the vertical distances between the actual data point and the line and therefore make error as small as possible.
Y= a +bX + error
Y= criterion X= predictor
Standard error of the estimate (regression)
SE= SD*sqrd(1-r^2)
r^2 and R^2
Simple
Multiple regression
VIF
The variance inflation factor (VIF) measures the impact of collinearity among the IVs in a multiple regression model on the estimation. It expresses the degree to which collinearity among the predictors dégradés the precision of an estimate. Concern if VIF>10
Hierarchical multiple regression
The researcher determines the order of entry of the IVs into the equation.
Stepwise multiple regression
- forward entry: one at a time
- backward entry: all at the same time