Multiple Regression Flashcards

1
Q

What is the difference between univariate analysis and multivariate analysis?

A

Multivariate analysis looks at determining the effect of multiple x variables upon the variance of Y whereas univariate just looks at one

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the ‘model’?

A

The name given to the structure of different predictors that all together go in to predicting the variance in Y.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does the linear ‘line of best fit’ become when you are looking at multivariate analysis?

A

‘plane of best fit’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What statistic do we look to in multiple regression for the ‘coefficient of explanation’?

A

Adjusted R squared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the process of standardisation in multiple regression and why do we do it?

A

Because the different predictors that go in to a model may be measured with different units, we are unable to compare their relative influence upon the variance of Y. This warrants us to standardise them so that they are comparable and we can determine the most influential factor.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

In SPSS, what symbols denote the unstandardised and standardised coefficients for the different predictors??

A
Unstandardised = alpha
Standardised = beta
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the F statistic?

A

this denotes the regression model’s level of significance in explaining the y variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is multicollinearity?

A

When different predictors correlate with each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is multicollinearity and an why is it a problem?

A

Multicollinearity is when two or more predictors are related/correlated with each other. Problem because if we are conducting multiple regression analysis then we are aiming to determine the impact of each individual predictor upon the y variance and so if two predictors correlate with each other then it distorts our interpretation of that result as there may be an unidentified relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What test do we use ot test for multicollinearity?

A

Variance Inflation Factor (VIF)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

If multicollinearity is not present between the predictors then what value would the VIF take?

A

VIF < 5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What can we do to remove multicollinearity?

A

1) remove predictor and re-run the test
2) create an interaction term where there is logical observable relationship
3) conduct a factor analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the F-test?

A

this test identifies the effect taking a specific predictor out of the model has upon the model’s ability to explain variance in Y.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What would be the result of the F-test if the predictor taken out of the model was really important?

A

There would be a big change in value relative to the initial value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the p-value of the p-test?

A

this value denotes the associated level of significance of the change to the F-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the 3 methods for constructing a multiple regression model?

A
  1. hierarchical
  2. step-wise
  3. simultaneous
17
Q

What is the hierarchical method for constructing a multiple regression model?

A

Tell SPSS to input different predictors based on their presumed coefficient of explanation (r squared).

18
Q

What is required in order to validly construct a multiple regression model using the hierarchical construction method?

A

Prior knowledge about which predictors will highly likely be the most influential

19
Q

What is the step-wise method for constructing a multiple regression model?

A

SPSS would determine the statistical significance of the different predictors then add them in based off that information.

20
Q

What is the simultaneous method for constructing a multiple regression model?

A

Inputting all predictors at the same time via a ‘cauldron’ approach

21
Q

What are 3 top tips for constructing a multiple regression model?

A
  1. do not ‘scattergun’
  2. use multiple statistics to inform judgement of different predictors
  3. do not overcomplicate model construction
22
Q

What is the tolerance statistic in SPSS?

A

This outlines whether or not a given factor is classed as providing enough explanation to the overall y variance

23
Q

What value does the tolerance need to be for each factor in order to be classed as a factor that contributes enough to the variance of Y?

A

They need to have a value above 0.2. This means they need to explain at least 20% of the variance in Y.