Week 9 Flashcards
What are the assumptions of regression models?
- use correct variables for the technique (for linear, we want interval level data, equal distances between each point of the scale)
- independence of data (and error terms)
- sample size and normality (of variables and residuals)
- linearity
- homoscedacity (of residuals)
In regressions models, the ____ the sample size the better.
Larger
You can check for non linearity using a:
plot. Select statistics for correlation values.
What is residuals another way of saying?
Pretty much saying how crappy your data is.
What does checking the pattern of model mis-fit check in linear regression?
Scedacity
Under assumption checks, what do you tick if you are wanting to look at if your model is good at all values in a linear regression?
Tick residual plots, if your data is equally dispersed and shows no real pattern, this is good at predicting scores across the spectrum of scores of the predictor variable.
What is homoscedacity?
It means that the error variance should be the same at each level of the predictor variable.
Rather than eyeballing plots, how else can we test heteroscedacity?
Under assumption checks. If we have a significant value, this is BAD. WE have violated the assumption of homoscedacity.
What type of designs do even a small amount of outliers have a huge effect for?
Linear regression models
Can we ever remove unusual scores (outliers)?
people have strong opinions about this.
how can we test to see if we have any unusual outliers? (2)
- Using cooks distance (you can save this to the data set)
2. any residual value >3 SD’s from the mean manually
Simultaneous multiple regression examines the ____ ability of a set of predictor variables in accounting for variability in a response variable
combined
In a hierarchical multiple regression, what is in model 1?
Everything that was in block one.
In hierarchical multiple regression, what is included in model 2?
Everything that was in block one, plus that which was i block two.
What explains the percentage of the variation explained by the models in hierarchical multiple regression?
Adjusted R squared.