Lecture 2 - Missing Data Flashcards
What is missing data?
What is the criteria for an inadequately sampled participant or variable?
The absence of data points in a dataset, which can pose significant challenges in data analysis.
Over 50% of their data is missing.
What is selection bias?
Selection bias occurs when the sample of participants included in a study is not representative of the population intended to be analyzed, leading to spurious effects.
What are the types of missing data?
Missing Completely at Random (MCAR), Missing at Random (MAR), and Missing Not at Random (MNAR).
What is MCAR?
MCAR means the probability of a data point being missing is independent of any observed or unobserved data.
What is MAR?
MAR means the probability of a data point being missing is related to some observed data but not to the missing data itself.
What is MNAR?
MNAR means the probability of a data point being missing is related to the value of the missing data itself.
How can missing data reduce power?
Missing data can reduce statistical power due to a smaller effective sample size.
How can missing data cause biased estimates?
Systematic missing data can lead to biased parameter estimates.
What is Little’s MCAR test?
A statistical test used to determine if data are missing completely at random.
What are separate variance t-tests used for?
Used to follow up significant results from Little’s MCAR test to further investigate the nature of missing data.
What is estimation maximization (EM)?
A method for imputing missing data based on maximum likelihood estimation.
What is multiple imputation (MI)?
A method that involves creating multiple datasets with imputed values to account for the uncertainty in the imputations.
How should missing data be handled after a study?
Be transparent about the extent and nature of missing data.
What are SPSS checks for missing data?
Checking participant sampling via the number and proportion of missing data, and assessing the adequacy of variable assessment.
How can inadequately sampled participants be dealt with?
Inadequately sampled participants can be handled by:
1. Deletion: Removing them from the analysis if their missing data is not systematic and does not introduce bias.
2. Imputation: Use imputation methods to estimate missing data, provided the assumptions for imputation are met.
3. Sensitivity Analysis: Conduct sensitivity analysis to assess the impact of including or excluding these participants on the study’s conclusions.
What is the difference between selection bias and missing data?
Selection bias concerns who participates in the study, while missing data concerns the data provided by participants.
How can inadequately assessed variables be dealt with?
Inadequately assessed variables can be handled by:
Deletion: Remove the variable from the analysis if it is not crucial to the research question and if its removal does not undermine the study’s validity.
Imputation: If the variable is crucial, use imputation methods to estimate the missing data, ensuring that the assumptions for imputation are met and the imputation method is appropriate for the type of data.
Sensitivity Analysis: Conduct a sensitivity analysis to understand the impact of including or excluding the variable on the study’s conclusions.
What is the impact of systematic missing data?
It can lead to biased estimates and erroneous conclusions.
What is the goal of missing data analysis?
To understand whether missing data will cause problems for analyses and inferences, and to estimate missing data if possible.