LECTURE 3 AND 4 multiple regression Flashcards
what is multiple regression
using linear regression model to predict the value on one variable from several predictor variables - hypothetical relationship between several variables
what equation does the multiple regression use
expansion of straight line equation
y = b0X1 + b2X2 ….+error
forced entry
all predictors entered simultaneously
hierachial
experimenter decides input of variables based on theoretical background
aloows to observe unique predictive influence of a new variable on the outcome as known predictors held constant
stepwise
predictors selected using semi partial correlation with outcome
r value
correlation between observed value and predicted value
r square
proportion of variance accounted for by the model
adjusted r square
estimate of r square in popualtion (shrinkage) - estimates the change in r square when generalise from sample to the population
beta values
chnage in outcome associated with unit change in the predictor - also has SD value (as increase value by 1SD have a ___ SD effect on second variable)
t test in ANOVA
tells whether each IV make a significant contribution to predicting the DV (coeff sig diff from 0) (t= p=)
how to interpret SD beta values
when predictor value increase by 1SD, predicted value increase by ___ (beta) of a SD
r square change (hierachial)
hows how the second variable has accounted for more of the models variance
ie.e r square of model 1 is how much variance accounted for by 1st vairable
r square of model 2 is how much variance accounted for by both vairable - r square change is the specific variance accounted for by the addition of the second variable
how to report anova
F (df, residual df)=__, p= __)
how can you assess accuracy of the multiple regression model
standardised residuals and influential cases (cookes distance)
describe standardised residuals for accuracy of the multiple regression model
average sample - 95% STDresiduals between +- 2
99% between +- 2.5
outliers of >3