L6 - Linear Regression Flashcards

Question

What is the **function** for a simple linear regression?

Answer 1

Y=mx + c Where **m** is the slope and **c**= the Y-intercept X is the independent variable Y is the dependent variable.

Answer 2

**method of least squares** involves the minimization of the squared deviations between the actual scores of Y vs. those predicted by the resultant regression equation (Y’).

Answer 3

**Total sum of squares, Error sum of squares, Regression sum of squares.**

Answer 4

Total SS = difference between actual scores and the mean value of Y.

Answer 5

Error SS (unexplained) = difference between Y’ and the actual values of Y.

Answer 6

Reg SS (explained) = difference between Y’ (predicted) and the mean of Y.

Answer 7

The R-squared value. The value is equal to: **regression sum of squares** *(variance explained)* / **total sum of squared**

Answer 8

This is the **ratio** of the; ## Footnote **Regression Mean square/ Error mean square**

Answer 9

**Regression mean square** = **RSS/ DF_regg** ie. divide RSS by the degrees of freedom (of that component) * RSS = regression sum of squares* * DF = degrees of freedom*

Answer 10

DF_regg = No. coefficients estimated (including the constant) – 1.

Answer 11

**Error mean square** = ESS/ DF_error = Total observations (N)– No. coefficients (k) –1. *Error mean square divided by degrees of freedom for error sum of squares*

Answer 12

DF_error= Total observations - total coefficiants - 1

Answer 13

For each unit of increase in age, Y increases by **.67**. Y= 45.67 + 0.67Age For every 1 unit change in age leads to 1 unit (**.67 is 1 unit**) increase in Y.

Answer 14

**Unstandardised** *We only standardise afterwards to obtain some measure of the relative importance of variables*

Answer 15

By multiplying them by the **ratio** of the **standard deviation of the variable** and the **standard deviation of the** **dependent measure** This is called the **Beta value**. ie. ratio = Sy / Sx Sy = SD of dependent measure Sx = SD of particular predictor

Answer 16

**t-test**

Answer 17

Simple linear regression: **1 predictor variable** Multiple linear regression: **2 or more predictor variables**

Answer 18

**hierarchical vs. statistical**

Answer 19

The variables to be entered in the order which is **theoretically important.**

Answer 20

Variables are entered in according to a **specific statistical criterion** e.g., the one with the next highest correlation with the dependent measure.

Answer 21

**Hierarchical** You are controlling for what goes in and which order, based on a systematic or theory driven model.

Answer 22

**Standard vs. Stepwise** Standard: In what is called ‘**Ordinary least squares regression**’, all variables get entered in at once. Stepwise: In Stepwise procedures, including hierarchical or theory-driven entry procedures, the variables go in **Step by Step.**

Answer 23

1. **Forwards method**: variables go in according to the highest first-order and then partial correlation. 2. **Backwards method**: all variables go in, then the one with the lowest partial correlation gets moved, until there is no significant change in R-squared. 3. **Stepwise**: combination of backwards and forwards.

Answer 24

Requires a lot more power + atheoretical + open to wild goose chases, if a correlation is a Type 1 error. It's influenced by **type 1 errors**. * Things can be significant by chance.* * This is **model inconsistency** or unreliability*

Answer 25

It is based on theory and not on chance. ## Footnote *No chance of the model being based on type 1 errors.*

Answer 26

How much **variation in the dependent measure** that particular predictor uniquely explains.

Answer 27

**False** There might be small amounts of variation explained by non-included variables (not in the equation). A lot of the variance might be shared by 2 or more predictors.

Answer 28

**Mediators** **Moderators**

Answer 29

Where a variable **mediates** the relationship between two variables. e.g. B mediates the relationship between A and C. Variable B carries the relationship between A and C. A only correlates with C because A gives rise to B, which in turn, gives rise to C.

Answer 30

Mediated. A and C are only correlated when B exists

Answer 31

An analysis of mediation only makes sense if **all 3 variables are correlated at least moderately**. If this weren’t the case, then the effect probably wasn’t there.

Answer 32

**Mediation**

Answer 33

We run **2 regressions** R1: Run a regression with the number of work hours as a predictor of Work satisfaction R2: Run a regression with both variables in the equation. Then, compare the beta coefficients for No. hours between R1 and R2. ***Usually, beta value will be higher in R1, but if it has gone down, partial mediation has occurred. If it is fully reduced to 0, full mediation has occurred.***

Answer 34

When doing a mediation analysis (Baron and Kenny Method), if the second equation the beta coefficient has reduced in size it is **partial mediation**. If it has reduced to 0, it is **full mediation**.

Answer 35

**Sobel Test** *This test can be used to test differences in the magnitude of beta coefficients between the first and second equations.*

Answer 36

When you have got a very large sample and where you can assume normality in the product term used to captual the indirect effect. (i.e., product term of the coefficients corresponding to pathways between the predictor-mediator- outcome variable).

Answer 37

A lot of people test these individually, If you want to determine which one is the best mediator (based on the assumption that the 2 are correlated), then it is better to run them all in the same model.

Answer 38

A moderator is **a third variable** which **influences the nature of magnitude** of the relationship between two other variables.

Answer 39

Ny testing for a significant A x B interaction. * We can obtain an interaction term simply by multiplying A and B’s scores together to give a product term.* * We then conduct a hierarchical analysis. On Step 1, we enter the main effects (B) and (A), and then the product term is entered on the Step 2.* * The idea is to show how much additional variation can be explained, or whether the interaction term, shows anything above and beyond what is already explainable in terms of main effects.*

Answer 40

A significant interaction means that the relationship between two variables as expressed in the standardised slope coefficient (beta) is not consistent across the level of the other factor. * for example, if the coefficient for Age was 0.40 for Females (i.e., increases in being male increases memory), and –0.10 for Males,* * ie., increases in age for males slightly decreases predicted memory function. The main thing is that the two coefficients vary significantly.*

Answer 41

**Break it down into simple linear effects.** We select Males only and then run a simple linear regression (Memory as predicted) by age. This gives us an equation. Then we do the same with Females only. We thus have two equations **Memory = Constant + Beta. Age.** By slotting in some made up values for age, e.g., 20, 25,30, up to 80, one can then get predicted memory scores for males and females separately. We then plot these two functions, and this gives us a clear depiction of the nature of the interaction.

Answer 42

**1. Homoscedascity** (The variance of residual is the same for any value of X) **2. Normality** (For any fixed value of X, Y is normally distributed) **3. Linearity** (The relationship between X and the mean of Y is linear) **4. Independence or Non-serial error dependence** (Observations are independent of each other)

L6 - Linear Regression Flashcards

(66 cards)