Week 9 (Simple Linear Regression II) Flashcards

1
Q

R squared

A

The proportion of variance in the outcome variable accounted for by the predictor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

F-ratio

A

The ratio of model variance to error variance, whether the regression model is significant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Intercept

A

The value of the outcome variable when the predictor = 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Slope

A

The rate of change in the outcome variable in relation the change in the predictor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Unstandardised beta (b)

A

The change in Y for a one unit change in X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Standardised beta (β)

A

The standardised change in Y for one standard deviation change in X

-Another measure of slopes
-B0 (intercept) is always 0
-B1 (the slope): as X increases by one standard deviation, Y changes by B1 of a standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does an output of Standard Coefficients Beta mean (e.g, -.344)

A

In this example: As X increases by one standard, Y decreases by .34 standard deviation.

Look at standard deviation readings to see what this means in context

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When to use standardised OR unstandardised beta

A

Unstandardised b
-When you want coefficients to refer to meaningful units
-When you want a regression equation to predict values of Y

Standardised β (independent of units)
-When you want an effect size measure, e.g., small/medium/large β is equivalent to small/medium/large r(.1/ .3/ .5)
-When you want to compare the strength of relationship between the predictor and the outcome (across predictors measured in different ways)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Covariance

A

-The extent to which variables co-vary (change together)
-High covariance: means there is a large overlap between the patterns of change (variance) observed in each variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Outliers influence of regression

A

-Affect the model’s ability to predict all cases
How influential an outlier is depends on
-Distance between Yobs and Ypred (Residual)
-Leverage (unusual value on predictor)

Cases with standardised residuals or predictors in excess of +- 3.29 (p <0.001) have the potential to be influential outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How to check for outliers

A

Standardised residuals (Std. Residual) should have no extreme values (be smaller than 3.29)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Assumptions of linear regression

A
  1. Linearity
    -The outcomes (continuous variable) is linearly related to the predictors
  2. Independence
    -Observations are randomly and independently chosen from the population
  3. Normality of residuals
    -The residuals are normally distributed
  4. Homogeneity of variance (homoscedasticity)
    -The variability of the residuals is the same for all levels for the predictors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Independence assumption (linear regression)

A

-This assumption means that the residuals in your model are not related to each other.
-The residuals are NOT independent of each other in cases such as repeated observations on the same subject, or observations from related subjects (e.g twins, students in the same class)
-If this assumption is violated then the model standard errors (SEs) will be invalid, as will the conference intervals (CIs) and significance tests based upon them
-Ensure independent sampling in your design - subjects randomly and independently chosen from the populations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Residual plot

A

X = ZPRED (standardised predicted value)
Y= ZRESID (standardised residual)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Normality assumption (Linear regression)

A

-The residuals (not IV or DV) should be normally distributed, can be done using histogram and Normal probability plot
-In small samples a lack of normality invalidates confidence intervals and significance tests , whereas in large samples it will not because of the central limit theorem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Central limit theorem

A

The distribution of sample means approximate a normal distribution as the sample size gets larger, regardless of the population’s distribution.

17
Q

The homogeneity of variance assumption

A

-The variability of the residuals should be the same for all values of Ypred
-Violating this assumption invalidates CIs and significance tests