Multiple Regression Flashcards

1
Q

What does a confounder affect?

A

The exposure-outcome association because it is a common cause the of exposure and outcome.

Controlling for the confounder gives the adjusted exposure-outcome association.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does a mediator affect?

A

The E-O associations because it is caused by the exposure which in turn affects the outcome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do mediators address?

A

How or why the exposure (E) causes outcome (O).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When is a value classified as a moderator?

A

If E-O association varies depending on the values/levels of Z.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What do Moderators address?

A

When or for whom the exposure (E) causes outcome (O).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What can bias in an estimated E-O association arise due to?

A

Many reasons, one common reason is not taking into account the effects of confounders in estimating the E-O association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is an unadjusted association?

A

E-O association is estimated ignoring the possible effect of C

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is an adjusted association?

A

E-O association is estimated by taking account of the possible effect of C

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are possible ways to adjust for confounders?

A

Effects of confounders can be dealt with:

at the design stage, or

at the analysis stage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How can confounders be dealt with at the study design stage?

A

Matching - observational studies

Randomisation of exposure/treatment – clinical trials

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How can confounders be dealt with at the analysis stage?

A

Stratified analysis - works for one/few categorical confounders

Statistical modelling (multiple regression analysis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does a multiple regression model investigate?

A

The relationship between an outcome variable and more than one predictor variables simultaneously.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What must an outcome variable be for linear regression?

A

A continuous variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What must a Predictor/exposure variable be?

A

Continuous or categorical.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the multiple regression formula?

A

Y = B0 + B1x1+ ….. +BkXk+ E

Coefficients (β’s) - partial regression coefficients

βj represents the change in average y for one unit change in xj (holding all other x’s fixed)

In other words, βj is the effect of the predictor xj adjusted for the effects of all other predictors in the model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How can you calculate the adjusted effect of an effect and outcome relationship?

A

2 options:

Option 1 (multiple regression) is more practical and almost universally used.

Option 2 (regression of residuals) is rarely used in practice, but is useful to understand the underlying mechanism of adjustment.

17
Q

How can we adjust the E-O association for C?

What is the resulting model?

A

Include C in the regression model of O on E as an additional predictor, which will automatically adjust the E-O association for C.

The resulting model is a multiple linear regression model (more than one predictors):
O = B0 + B1E + B2C + E

The coefficient of E (i.e., B1) in this case will represent the adjusted E-O association controlling for the effect of C .

18
Q

For multiple linear regression, the fitted model is what instead of a line?

A

Hyperplane

19
Q

What can a multiple regression deal with?

A

Any number of confounders.

20
Q

How can all confounders be adjusted for in a multiple regression model?

A

Including them simultaneously as additional predictors.

21
Q

What can restrict the number of variables that can be included in a regression model?

What is a way to resolve this?

A

Sample size

A rule of thumb is to ensure that there are more than 10 observations (data points) per predictor variable. A sample of size 100, for example, will allow us to consider up to 10 predictors.

22
Q

What does R-squared (R2) measure?

A

Amount of variance in outcome (y) that can be explained by differences in predictor variables.

23
Q

What is R2 an indicator of?

A

Goodness-of-fit or prediction power of a fitted regression model.

24
Q

What is R2 generally expressed as?

A

Percentage and ranges between 0 and 100 (perfect prediction).

25
Q

What are assumptions of a linear regression?

A

Normality: The error terms are normally distributed.

Variance Homogeneity: The error terms have the same variance irrespective of the values of X (i.e., variance does not depend on X),

Linearity (continuous X): There is a linear relationship between X and Y.

Independence: The observations Y are independent from each other (thus, conditional on X, the error terms are also independent).

26
Q

How can we check for assumptions of a linear regression?

A

There are a number of diagnostics that use the residuals (differences between observed and predicted values):

Normality: Histogram, Normal probability plot of residuals.

Homogeneity of variance: Plot residuals vs. predicted values (called Residual plot).

Linearity: Partial residual plot –

Shows dependent variable plotted against each predictor variable, both adjusted for all other predictor variables.

27
Q

What do partial residual plots show?

A

The net relationship where the influence of other variables is partialled out.

Plots suggest a linear relationship between each predictor variable and the outcome variable