multiple regression Flashcards

1
Q

what does a vif score of 5 or above mean

A

multicolinearity (intercorrelations between the independent variables)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what does a vif score of 10 or above mean

A

severe multicolinearity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

multiple regression is an analysis of ___

A

dependence - one variable is examined by its dependence on another

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what are independent variables referred to as?

A

predictors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

the __ of each X variable describes its relationship with Y

A

coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

quadratic equation

A

y = cx2 + bx + c

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

multi regression equation

A

y = b1x1 + b2x2 + … + bnxn + c

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

in the multiple regression equation what does b stand for

A

the predictor values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what will the results from SPSS come out as (eg what is the name of the variable)

A

the R value - this is the measure of association between the observed and predicted value of the criterion variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is r2

A

simple linear regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what does r2 adj account for

A

accounts for the number of predictor variables in multiple regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

should you assess the relative importance of the predictor by the size of the coefficient? if not then what should you do??

A

NO – STANDARDISE THE COEFFICIENTS WITH BETA WEIGHTS (looks at the response of y to each independent variable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is f?

A

the significance of the model being able to explain variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what does f = 0 mean?

A

the model does not explain variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is B

A

the regression coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is t?

A

the significance of the coefficient in explaining the variance

17
Q

what is assumed in multiple regression about the distribution?

A

it is normally distributed - must be for it to work

18
Q

homoscedasticity is always assumed in multiple regression, what is this?

A

the variance is constant across all levels of the predicted variable. eg there is very little variance from the line of best fit for all variables.

19
Q

what should you look at to see if x values are correlating with eachother?

A

vif factor (variance inflation factor)

20
Q

is it better to have more or less values in your graphs and analysis

A

less - more values is not necessarily good, need around 2 or 3 as most regressors are likely to be significant

21
Q

what is the equation showing: s^2y/s^2e

A

used to determine r2 value - simple linear regression

22
Q

what does this represent: s^2y

A

variance of original simulation model output

23
Q

what does this represent: s^2e

A

variance of regression residuals

24
Q

what is the difference between Homoscedasticity and Heteroscedasticity

A
homo = everything is the same variance, normally distributed, okay
hetero = violation of homo, variation of variance, errors in independent variable
25
Q

what does it mean if the f ratio is greater than 1

A

explained variance is higher than unexplained variance

26
Q

what is ‘tolerance’?

A

gives you the unique variance associated with each variable

27
Q

what does a tolerance of 0.34 mean

A

means 34% of variance for that predictor is not accounted for by other predictors

28
Q

a tolerance of less than 0.2 means?

A

that that predictor does not add anything new to the model - not good

29
Q

3 ways to identify homoscedasticity - common question

A

in a scatter plot there should be no discernable change in distribution (should be randomly scattered)
data in histograms and p-p plots should also remain normally distributed

30
Q

why do you need to standardize coefficients - common question

A

to reduce multicollinearity

to compare different stats effectively

31
Q

what is stepwise multiple regression

A

it finds the independent variable that has the largest significant Pearson’s correlation with Y. It then returns to the matrix and finds the next most significant correlation and onwards

32
Q

how does stepwise mutliple regression deal with insignificant predictors?

A

Stepwise regression involves adding predictors to the model one by one until it finds a non-significant predictor, at which point it stops building the model.

33
Q

what is hierarchal multiple regression

A

This uses the system of ‘blocks’ in the ‘linear regression’ window to allow the user to define the order in which variables should be regressed. This approach is very logical and selection of predictors is based on underlying theoretical considerations

34
Q

how does hierarchal multiple regression deal with insignificant predictors?

A

the system uses a method of ‘blocks’ in the linear regression window to allow the user to choose what variables to regress. this allows certain predictors to be controlled and better understood

35
Q

how does forward stepwise regression work?

A

model selects from the group of predictors based on the predictor which makes the largest contribution to r2. adds the rest of the predictors but stops once the remaining variables cannot make a significant contribution.

36
Q

how does backward stepwise regression work?

A

opposite of forward. dependent variable is regressed against all predictors and the weakest is taken out (contributes the least). this continues until only the statistically significant variables remain.

37
Q

what is the equation for standardising values

A

zx (variable) = (x (each observation) + x mean) / standard deviation of variable

38
Q

how to calculate f value

A

explained variance / unexplained variance