Multiple Regression Flashcards

Question

what does negative β indicate?

Answer 1

negative relationship (even if it's not significant)

Answer 2

estimate of the two variable relationship without other variables taken into consideration

Answer 3

* Extraversion b = 1.40, β = .594, t =6.95, p<.001 (extroversion a significant predictor of wellbeing) * Agreeableness b = -0.48, β = -.,018 t =-.222 p=.83 (agreeableness is not a significant predictor of wellbeing) AND so on... * telling us about individual contribution of predictors from the model

Answer 4

estimates of the contributions of individual predictors

Answer 5

they are just estimates of two variable relationship without the other variables taken into consideration -> they are uncontrolled as they were (just an estimate of two variable relationship) -> always do a regression because correlations won't give you the answer that you need

Answer 6

We are looking for 3 beta weights from the analysis and the dfs for Ssreg in the overall ANOVA is 3.

Answer 7

as every SD our variable increases, Y decreases by .58 of the SD

Answer 8

When we need to control for a third/important variable (i..e controlling for age while seeing if personality predicts wellbeing)

Answer 9

* add the variables into the equation in steps 1. add the control variable in first -> making sure that you control any variance that might be described by this * examine R2 and its significant 2. add the variables we are interested in -> incl the ones you want to control for and run the analysis again. Giving you 2 models

Answer 10

* Age on its own (Model 1) * Age and Personality Traits (Model 2) We are interested in the change of predictive power from Model 1 to 2 -> want to see if there's a change in step 1 or two -> We're looking at the predictive power of model 2 and see if significantly better than the predictor power of model 1

Answer 11

using F ratio changes 1. enter the first set of variables into the analysis for the first model -> get the R^2 and F ratio telling us how much variance is accounted for by the model 2. next batch of variables are added in a second model -> R^2 and F ratio telling us how much variance is accounted by the model (which includes both sets of predictors) 3. wan to compare models * We're going to make a call on the F ratio and whether there's a significant change -> i.e. the F change compares the models and tells us there is a significant improvement in variance explained in model 2 (model that explains significantly more variances) * We can see if more variance is explained in the second model compared to the first model

Answer 12

* each model has a separate ANOVA table which tells us if the variance is difference from zero for each separate model * does not compare the model but instead what you report for each individual model (the amount of variance accounted for zero)

Answer 13

“A hierarchical regression was carried out with Age in step 1, and mean scores of the different personality scales; Extraversion, Agreeableness, Openness, Conscientiousness, and Neuroticism in step 2. A significant model was found at step 1, F(1,84) = 6.57, p <.05 and explained 7.3% of the variance. The inclusion of the five personality traits significantly increased the amount of variance explained to 52.6% (p<.001) and was a significant model, F(6,79)=14.61, p <.001…” * Model 1 and 2 are significant * Age predicts wellbeing * The age and personality scales (as a group) predicts wellbeing * Model 2 is significantly accounts for significantly more variance * Addition of personality improve our ability to explain the variance

Answer 14

* if model 2 sig, focus your interpretation on this * when you find predictor predicts an outcome in model 1, but stops in model 2, this is something to focus on

Answer 15

* you can introduce categorical variables into regression using dummy coding (0's and 1's)

Answer 16

* traditional gender or experimental conditions

Answer 17

outcome numbers have to be continuous but your predictor does not have to be (can be 1's and 2's allowing us to code for 2 different things in our analysis -> can get a score which will help you predict an outcome)

Answer 18

category coded as ‘1’ is higher in the outcome variable than the category coded as ‘0’. (i.e. men are scoring higher than females)

Answer 19

category coded as ‘0’ is higher in the outcome variable (i.e. females are scoring higher than males)

Answer 20

forced entry regression

Answer 21

in the ANOVA table

Answer 22

men are scoring higher on this measure than females * positive coefficient means males are scoring higher on Wellbeing (but it is not significant)

Answer 23

would mean that year one students (0) are happier than year 2 students (1)

Answer 24

* Variable Type: Outcome must be continuous (Predictors can be continuous or discrete e.g. dummy variables). * Non-Zero Variance: Predictors must not have zero variance. * Independence: All values of the outcome should come from a different person or item. * Linearity: The relationship we model is, in reality, linear * Homoscedasticity: For each value of the predictors the variance of the error term should be constant. * Normally-distributed Errors: The residuals must be normally distributed

Answer 25

Mulitcollinearity

Answer 26

* exists when predictors are highly correlated with each other -> look for strong-medium correlations

Answer 27

undermine your findings * b1 can be unstable (vary across samples) * difficult to say which predictor is important * artificially reduces with R^2 -> and number of individual predictors when all correlated together

Answer 28

collinearity diagnostics

Answer 29

* VIF is a measure of each predictors relationship with other predictors * want it to be a low as possible (tells you that your predictors are reasonably independent of one another * anything close to 10 is an issue / problematic

Answer 30

1 divided by the VIF -> should be above 0.2

Answer 31

extreme outliers

Answer 32

standardised residuals

Answer 33

actual score is very different from predicted

Answer 34

* just look/check for high standardised residuals (looking for min and max) * only about 5% should be over 2SD (if they are, that means outliers) * look in the residuals statistics table to see what the minimum and maximum residual are

Answer 35

cook's distance

Answer 36

* measures of influence of each cause has on the model -> most if not all cook's distance should be below 1 (want cook's distance to be as low as possible) - look in the maximum cook's distance section in the residual statistics