Single Imputation Methods Flashcards

1
Q

Describe complete case analysis (note not an imputation method)

A
  • excludes data for any case that has one or more missing value
  • data treated as if cases with missing values simply aren’t there
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When is CCA good?

A

Can produce unbiased estimates of regression coefficients provided missingness is a function of a prediction variable and not a response variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Describe available case analysis (note not an imputation method)

A
  • (X1, X2) data matrix where only X2 is subject to missingness
  • all cases would be used for mean and variance of X1, but only complete cases contribute to estimates for X2 and covariance of X1 and X2
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Problems with ACA?

A
  • not clear which sample size should be used to calculate standard errors
  • estimates can be biased if data not MCAR
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the main issue with single imputation methods?

A

Standard errors of estimates tend to be too low, hence single imputation is not recommended

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Describe mean imputation

A
  • each missing value is imputed by overall mean of observed values for that variable
  • can use mode for categorical data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why is mean imputation worse than CCA?

A

Reduces variability of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describe conditional mean imputation

A
  • replaces missing value with a predicted conditional mean from a regression equation
  • first step is to fit a regression model, where the variable with the missing values is regressed on the observed ones, then find predictions for missing values
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Issue with conditional mean imputation?

A

There is no random error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe stochastic regression imputation

A
  • regression model gets added error term, which is a random value from N(0 , σhat^2)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Does stochastic regression imputation fix the issue of attenuating standard errors?

A

No

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe hot deck imputation

A
  • imputes values from “similar” individuals
  • replaces the missing value with a random draw from a subsample of individuals that have similar values on a set of matching variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the issue with hot deck imputation

A

Can produce substantially biased estimates of correlations and regression coefficients, and under estimates standard errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe last observation carried forward

A
  • in longitudinal studies, imputes missing value with its value at a previous time point
  • assumes that values do not change after the last observed measurement or during the intermittent period where scores are missing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Issues with last observation carried forward?

A

May produce biased results even when MCAR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Describe iterative regression imputation

A
  1. impute all missing values in Y1, Y2, …, Yk with some crude imputation method
  2. impute Y1 given Y2, …, Yk and Yobs using stochastic regression imputation
  3. impute Y2 given Y1, Y3, …, Yk and Yobs using newly imputed values for Y1
  4. ….
  5. impute Yk given Y1, …, Y(k-1) and Yobs using newly imputed values for Y1, …, Y(k-1)
  6. loop until convergence
17
Q

When does iterative regression imputation work?

A

only <5% missing values