Multiple Imputation Flashcards

1
Q

What does multiple imputation allow that single imputation doesn’t?

A

Allows investigator to obtain valid assessments of uncertainty

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Basic idea of multiple imputation?

A

Impute each missing value several times, thus creating M>1 complete data sets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Draw the schematic for multiple imputation

A

see notes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Outline the three steps in multiple imputation in as much detail as possible

A

1)
- create M copies of incomplete data set
- use an appropriate method to impute missing values in each copy (same method for each copy)
- imposed data sets are composed of fixed proportion (observed data) and a missing proportion (imputed values)
- each copy will be different
2)
- for each complete copy of data, carry out statistical analysis as you would if no missing data
- store parameter estimates and variances (or variance-covariance matrix if more than one parameter)
- estimate of θ obtained by m-th complete data set is θhat(m) and estimated variance by U(m)
3)
- results of M analyses are combined into single analysis that takes into account the imputation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Give the combined estimate of θ

A

θhat(MI) = 1/M * sum θhat(m)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Give the between imputation variability

A

B = 1/(M-1) ( sum [ (θhat(m) - θhat(MI) ) ^2 ] )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Give the within imputation variability

A

Wbar = 1/M * sum [ U(m) ]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Give overall variability in multiple imputation

A

Vmi = Wbar + B + B/M

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do you find the (1-α)100% confidence interval for multiple imputation

A

θhatMI ± tv(α/2) * sqrt(Vmi)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Summarise where the variability in θhatMI comes from

A
  1. Wbar, variance since we’re taking a sample
  2. B, extra variance since missing values in the sample
    3, B/M, extra simulation variance caused by the fact θhatMI itself is estimated for finite M
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are traditional choices for M?

A

3, 5 or 10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the relative efficiency of using M samples?

A

M / (M + λ), where λ is the fraction of missing data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly