Lecture 9: Effect modification Flashcards

1
Q

What is a modifier?

A

A modifier alters the relationship between the independent and dependent variable

The level of influence of the independent variable varies with different values of the modifier

E.g. the effect of exercise on weight loss - will be different for men/women

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the interaction term?

A

The cross product of X1 and Z (the modifier)

This new product is added to the MLR

Y = b0 + b1x1 + b2z + b3 (x1 X z) + e

reject null hypothesis if b3 is significantly different to 0
- significant interaction effect or significant effect modification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do you calculate an interaction effect in SPSS?

A

To work out an interatction effect in SPSS - compute new variable - include in a MLR

Check if B3 is significantly different to 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does the MLR interpretation vary if an interaction effect exists?

A

B1 and B2 - main effects

If B3 is significant there is an interaction effect - there is a different in effect of X1 on Y with different values of Z

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How are effects estimated in the presence of an interaction?

A

No longer can just consider the main effects

  • Only need to consider p values for main effects if interaction coefficient = 0
  • therefore only use interaction term and the variable you want to predict (ignore the other main effect)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do we assess interaction if there is more than two levels on the categorical variable?

A

Code them into dummy variables

- Transform and recode into different variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Can interaction terms also be calculated for two continuous independent variables?

A

Yes
- as before compute a new variable by creating a cross-product term and include this in your MLR

Y = B0 + B1X1 + B2X2 + B3 (X1 * X2)

If an interaction existed

To work out the effect of X1 on Y - do the calculation:

Effect of B1 on Y = B1 + B3 x X2

where value of X2 would be provided (i.e work out effect of X1 on Y if X2 = 5)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How can Continuous x Continuous interactions be presented?

A

Continuous x Continuous interactions can be presented

  • Tabular format using Q1, Q2, Q3
  • Graphical format
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is an outlier?

A

An outlier is an observation that lies an abnormal distance from other values in a random sample from the population

Outliers can be identified by sorting cases, running descriptives or graphing data (boxplot, scatterplot and a regression variable plot (scatterplot with histogram ontop)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Are all outliers harmful?

A

No some may be more influential than others

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Describe Tukey’s method for identifying outliers?

A

Uses the IQR (middle 50% of the dataset)

  • Lower outer, lower inner, upper inner & upper outer
  • Extreme outliers are more than 3 x above/below the 1st/3rd quartile
  • Mild outliers are between 1.5-3 x above/below the 1st/3rd quartile

Lower Outer = Q1 - 3 x IQR
Lower inner = Q1 - 1.5 x IQR

Upper inner = Q3 + 1.5 x IQR
Upper outer = Q3 + 3 x IQR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How can outliers be identified using standard deviation?

A
  1. 27% data lies within -1/+1 SD from mean
  2. 45% data lies within -2/+2 SD from mean
  3. 73% of data lies within -3/+3 SD from mean
  • -> Data would need to be normally distributed for this to happen
  • -> Values larger than 2-3 more or less standard deviations from mean would be considered an outlier
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What value of standardised residual would need to be considered an outlier?

A

If standardised residual is larger than 3 (+/-)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are DFBeta and DFFIT?

A

DFBeta measures the change in the estimated coefficient Bj due to changing that observations

Standardised DFBeta = DFBeta / SE (est Bj)

DFFIT is the change in predicted value (i) due to deleting that observation = DFFIT / SE (i)

Outliers are considered if their absolute values DFFIT and DFBeta > 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How is DFBeta calculated?Wh

A

Run two MLR with and without outlier

Work out the difference in the two Beta / SE (for the coefficient with outlier excluded)

Can also be calculated using SPSS - click SAVE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What can outliers do?

A

Increase error variance and make not normally distributed

Increase chance of T1 or T2 errors

Contribute to the assumptions of MLR being violated

17
Q

How should outliers be handled?

A

If can be corrected - re-measured to fix

If not in a variable of interest - can be excluded

If important for the population it shouldn’t be removed - can run two analysis with and without and document the findings