Multiple Regression Flashcards
What is collinearity?
Collinearity is when two or more predictors are highly similar. They CORRELATE highly with each other, meaning they may produce the same or very similar results
What is covariance?
Covariance is when the change in one variable is associated with the change in another one. For example, if a variable changes that has an effect on the other variable
Why is multiple regression based on correlations data?
Because there is no direct manipulation of any predictor variables
What data does multiple regression use?
Scale data: either ratio, interval or ordinal data
Why do we base building a multiple regression model off previous research?
We need a reason for including the predictors we are choosing; do we think they will have a direct influence on our criterion variable?
What are assumptions of multiple regression? (Hint: there are 4 and they revolve around the data)
1) no outliers
2) normality of data
3) linearity of data
4 reliability of data
How is normality of data assessed in MR?
By looking at the skew and kurtosis. Normal distribution is between -2 and +2.
What does MR do it there is not a linear relationship between the criterion variable and the predictors?
It will underestimate the relationship. This increases the type 2 error.
Why does having lots of predictor variables potentially violate an assumption of MR?
Having lots of variables can make the relationship between criterion and predictors non-linear
How do we measure reliability in MR?
By using Cronback Alpha
What is Cronbacks Alpha? What does it measure?
It is a measure of internal consistency: it measures how closely related a group of items are in a set. It measures reliability.
What is homoscedacity? What is it called if you do not have it?
Where the variance is the same for all the predictor variables.
Hetroscedacity is what occurs if you do not have homodecsity.
Why is hetrosecdasity bad for MR results? How do we test for hetrodecsity?
It is bad as it can lead to a distortion in our results, which leads to a type-1 error.
We check for it by looking at residuals
What will the correlation be it multicolinesrity is at a moderate level?
Between 0.3 and 0.8
How many variables (predictors) do you need to be able to do a multiple regression analysis?
Two or more independent variables