Lecture 2 - Missing Data Flashcards
What is missing data?
What is the criteria for an inadequately sampled participant or variable?
The absence of data points in a dataset, which can pose significant challenges in data analysis.
Over 50% of their data is missing.
What is selection bias?
Selection bias occurs when the sample of participants included in a study is not representative of the population intended to be analyzed, leading to spurious effects.
What are the types of missing data?
Missing Completely at Random (MCAR), Missing at Random (MAR), and Missing Not at Random (MNAR).
What is MCAR?
MCAR means the probability of a data point being missing is independent of any observed or unobserved data.
What is MAR?
MAR means the probability of a data point being missing is related to some observed data but not to the missing data itself.
What is MNAR?
MNAR means the probability of a data point being missing is related to the value of the missing data itself.
How can missing data reduce power?
Missing data can reduce statistical power due to a smaller effective sample size.
How can missing data cause biased estimates?
Systematic missing data can lead to biased parameter estimates.
What is Little’s MCAR test?
A statistical test used to determine if data are missing completely at random.
What are separate variance t-tests used for?
Used to follow up significant results from Little’s MCAR test to further investigate the nature of missing data.
What is estimation maximization (EM)?
A method for imputing missing data based on maximum likelihood estimation.
What is multiple imputation (MI)?
A method that involves creating multiple datasets with imputed values to account for the uncertainty in the imputations.
How should missing data be handled after a study?
Be transparent about the extent and nature of missing data.
What are SPSS checks for missing data?
Checking participant sampling via the number and proportion of missing data, and assessing the adequacy of variable assessment.
How can inadequately sampled participants be dealt with?
Inadequately sampled participants can be handled by:
1. Deletion: Removing them from the analysis if their missing data is not systematic and does not introduce bias.
2. Imputation: Use imputation methods to estimate missing data, provided the assumptions for imputation are met.
3. Sensitivity Analysis: Conduct sensitivity analysis to assess the impact of including or excluding these participants on the study’s conclusions.