Week 12 Flashcards

Question

If Regression Assumptions are violated

Answer 1

* Check Normality of predictors, if you fix these heteroscedasitcity can dissapear * Use a transformation on the Outcome Variable * Consider using a different method like Weighted Least Squares Regression * Use some kind of Non-Linear Regression

Answer 2

* Slope of Regression line will be equal to 0 * Beta = 0

Answer 3

* Slope of the Regression line will not be Zero * Beta Not= 0

Answer 4

**1. Analyse 2. Regression 3. Linear** 4. Move your DV into the dependent box. 5. Move your independent variable into the independent box. **6. Ok**

Answer 5

* Same as Pearson's Correlation - *p* value * Tells us strength and direction of relationship

Answer 6

* Tells us amount of variance in DV explained by IV * Proportion of Variance that can be explained by the variable * 23% of variability in grades explained by attendance in this example * Known to overestimate the explained variance

Answer 7

* *R* needs to be adjusted to be smaller than *R* squared * Corrects bias of overestimated explained variance * Useful as Goodness of Fit Statistic

Answer 8

* Determines how well sample data fits a distribution from a normal population * Determines if a sample is skewed or normal in the actual population

Answer 9

* Uses the df, *F* value and the *p* value * Compares error rate with line of best fit and the error rate of the baseline model of 0 * ANOVA is significant if it is "better" than the baseline

Answer 10

* The Slope of the Regression Equation * Amount of change in a Dependent Variable due to a change of an Independent Variable * This is the Beta coefficient e.g. each unit of attendance is associated with 1.88 unit of increase in grades

Answer 11

* Check if IV is a significant predictor of the DV * Become more relevant when we start adding more predictors

Answer 12

* A measure of the effect size * Useful for multiple Regression * Important when we have more than one Predictor * Predictors often measured in different scales * e.g, IQ Points, Classes attended, additional study time

Answer 13

* Most commonly found in Research Projects * Allows us to predict the outome variable from more than one predictor * Answers how well does combination predictors predict the outcome **Y = b1(X1) + b2(X2) + C + error**

Answer 14

Outlier on one variable

Answer 15

Outlier on a combination of variables

Answer 16

* Normality * Univariate Outliers * Multivariate Outliers * Multicollinearity * Normality, Linearity & Homosedasticity of residuals

Answer 17

* Two or more IV's highly correlated in regression * IV can be predicted from another IV in a regression model.

Answer 18

Mahalonobis Distance

Answer 19

* largest value should not be greater than the critical 𝜒2 value for df = k at 𝛼= .001. * Where k = the number of predictors. * Use same table as Cook's Distance * For simplicity use table below:

Answer 20

* Tells you if there are cases that influence the regression line * Use same table as Mahalanobis Distance * rule of thumb is if Cook’s **D is > 1** you have influential cases. * Dealt with the same way as Univariate Oultiers

Answer 21

* Pearson's Correlations between IV * if *i* > .85 then there is multicollinearity * **Tolerance:** Values < .1 are multicollinear; < .2 warrant a closer look * **VIF:** Values > 10 are clearly Multicollinear; > 5 warrant a closer look * If you find a problem then remove the offending variable * If they are so closely related then they are basically the same thing. treat as one variable.

Answer 22

* Use Residual Statistics Table * First, we find the critical 𝜒2 for a model with 4 predictors: 𝜒2 = 18.467 - Check the Mahalanobis Distance Table * Use Mahal. Distance Maximum (13.803 here) * 13.803 < 18.467 Therefore there are no multivariate outliers. * Cooks D is < 1 so there are no influential cases

Answer 23

* Use the Variables Entered/Removed Table - Tells you how many predictors are in the model (4) * Then Model Summary Table * *R* is not just Person's *R* anymore * It is correlation between actual scores and predictions in the regression equation * *R square* = Proportion of variance in DV Accounted for y combined predictors * Again *R square* Adjusted is a corrected version of *R square* that accounts for the positive bias.

Answer 24

* Now tests the comBination of predictors * A significant predictor of GHQ * The table has the df, the F value, and the p-value.

Answer 25

* Unstandardized coefficient is the slope of the regression * Shows each unit increase in one of the independent variables is associated with a b unit increase in GHQ * All other IVs are kept constant * Beta values = Standardised regression coefficients * Allow direct comparison of regression coefficients. * Displayed in units of standard deviation.

Answer 26

* t-values and p-values test the significance of the unique contribution of each predictor * Changes depending on predictors included in the model.

Answer 27

* **Tolerance:** values < .1 are multicollinear; < .2 warrant closer inspection. * **VIF:** values > 10 are clearly multicollinear; > 5 warrant closer inspection.

Answer 28

* If you have a predictor that is not reflecting anything it makes the model worse * This changes the numbers slightly * Only have significant predictors in the model

Answer 29

* Our general form for the regression is: Y = b1(X1) + b2(X2) + b3(X3) + C + error * And if we take this equation and substitute in our variables we get: GHQ = b1(neuroticism) + b2(state-anxiety) + b3(trait-anxiety) + C + error GHQ = .555(Neuroticism) + .318(state-anxiety) + .471(trait-anxiety) + 13.552 + error

Answer 30

* Correlation between theDV & IV * Value greater than 0.4 is taken for further analysis.

Answer 31

The strength & direction of the relationship

Answer 32

* Tells you the percentage of variation explained by only the independent variables that actually affect the dependent variable.

Answer 33

* Tells you the percentage of variation explained by only the independent variables that actually affect the dependent variable.

Answer 34

* Tells you the percentage of variation explained by only the independent variables that actually affect the dependent variable.

Answer 35

* Tells you the percentage of variation explained by only the independent variables * Those that actually affect the dependent variable.

Week 12 Flashcards

(59 cards)