8: Regression Flashcards

1
Q

When is linear regression used?

A

When the relationship between x and y can be described with a straight line. Allows us to estimate how much y will change as a result of a given change in x.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the variable being predicted known as (y)?

A

Outcome, dependent, criterion.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the variable being used to predict known as (x)?

A

Predictor, independent, explanatory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the assumptions of linear regression?

A

Normal distribution, linear relationship, no outliers, and sensitive to range restrictions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the three stages of linear regression?

A
  1. Analysing the relationship.
  2. Proposing a model to explain the relationship.
  3. Evaluating the model.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Regression equation

A
y = bx + a.
a = intercept, b = slope.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the two models, in relation to goodness of fit?

A

The simple model and the improved model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is total variance and how is it calculated?

A

Variance not explained by the mean of y (simple model). Calculate difference between each data point and the mean, square them, and add them together (SST).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is residual variance and how is it calculated?

A

Variance not explained by the regression model. Calculate difference between predicted value of y and actual value, square them, and add them together (SSR).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the difference between SST and SSR represent?

A

The improvement in prediction using the regression model compared to the simplest model. An ANOVA can be used to evaluate this.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

F =

A

MSM / MSR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does R represent?

A

The strength of the relationship between x and y.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is adjusted R^2?

A

R^2 adjusted to account for degrees of freedom.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is multiple regression and when is it used?

A

Allows us to assess the influence of several predictor variables on the outcome variable (y). The slopes of each predictor variable are combined.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the regression equation for multiple regression?

A

y = (b1 x x1) + (b2 x x2) + (b3 x x3) + a.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What assumptions are made for multiple regression?

A

Sufficient sample size, outcome variable not normally distributed, predictor variables linearly related to the outcome variable.

17
Q

What is multicollinearity?

A

Predictor values which are highly correlated with one another, may be an issue.