Multiple Regression Flashcards
What is the difference between univariate analysis and multivariate analysis?
Multivariate analysis looks at determining the effect of multiple x variables upon the variance of Y whereas univariate just looks at one
What is the ‘model’?
The name given to the structure of different predictors that all together go in to predicting the variance in Y.
What does the linear ‘line of best fit’ become when you are looking at multivariate analysis?
‘plane of best fit’
What statistic do we look to in multiple regression for the ‘coefficient of explanation’?
Adjusted R squared
What is the process of standardisation in multiple regression and why do we do it?
Because the different predictors that go in to a model may be measured with different units, we are unable to compare their relative influence upon the variance of Y. This warrants us to standardise them so that they are comparable and we can determine the most influential factor.
In SPSS, what symbols denote the unstandardised and standardised coefficients for the different predictors??
Unstandardised = alpha Standardised = beta
What is the F statistic?
this denotes the regression model’s level of significance in explaining the y variance
What is multicollinearity?
When different predictors correlate with each other
What is multicollinearity and an why is it a problem?
Multicollinearity is when two or more predictors are related/correlated with each other. Problem because if we are conducting multiple regression analysis then we are aiming to determine the impact of each individual predictor upon the y variance and so if two predictors correlate with each other then it distorts our interpretation of that result as there may be an unidentified relationship
What test do we use ot test for multicollinearity?
Variance Inflation Factor (VIF)
If multicollinearity is not present between the predictors then what value would the VIF take?
VIF < 5
What can we do to remove multicollinearity?
1) remove predictor and re-run the test
2) create an interaction term where there is logical observable relationship
3) conduct a factor analysis
What is the F-test?
this test identifies the effect taking a specific predictor out of the model has upon the model’s ability to explain variance in Y.
What would be the result of the F-test if the predictor taken out of the model was really important?
There would be a big change in value relative to the initial value
What is the p-value of the p-test?
this value denotes the associated level of significance of the change to the F-test