Concerns with Sampling and Observational Studies (Exam 1) Flashcards
selection bias
- occurs when individuals in the target population are not sampled at an equal rate
- even when a random sampling design is used, there are other potential sources of bias in the sampling method
- biases can happen at other steps… not just when we’re selecting our sampling
perfect coverage
- when the sampling frame lists everyone in the target population (no missing or extra people)
- only sometimes achievable
undercoverage
occurs when a certain group of individuals in the target population is not included in the sampling frame and therefore given no chance of selection in the sample
overcoverage
occurs when a certain group of individuals not in the target population is included in the sampling frame
nonresponse
- occurs when individuals selected for the sample can’t be contacted/refuse to participate in the study
- important to consider WHY these individuals don’t respond… classify nonresponse into 3 explanations
- depending on the nonresponse mechanism we assume to be true, there may (not) be selection bias introduced to the sample, and we may (not) be able to correct this bias
missing completely at random (MCAR)
- when the probability of nonresponse is the same for all individuals in the sample
- does NOT introduce selection bias (missing respondents are random)
- since we assumes that the nonrespondents are not different in any way from the respondents, the sample of respondents should still be representative of the target population, so we can remove the nonrespondents from our sample
missing at random (MAR)
- when the probability of nonresponse is associated with KNOWN and measured factors
- there IS something special about you about why you’re missing
- assumes that certain groups in the population are more likely to produce nonresponse than others (if these groups are known prior to sampling, then we can oversample them to guarantee proper representation or after sampling, we can adjust the weights assigned to individuals in the sample to guarantee proper representation
missing not at random (MNAR)
- when the probability of nonresponse varies for reasons that are UNKNOWN to us, particularly the level of the variable of interest
- no way to verify why these people are missing
- since we don’t know the cause of nonresponse, very difficult to make adjustments/corrections for this source of missing data
weighting
- procedure that allows us to place a higher/lower emphasis on certain individuals in the sample
- we want the weighted individuals to add up proportionally
what are 3 ways in which selection bias can be introduced into the data collection process?
1) from the construction of the sampling frame (undercoverage & overcoverage)
2) from the methods of selecting individuals from the sampling frame (convenience sampling & voluntary response sampling)
3) from nonresponse
experimental study
deliberately imposes some treatment on individuals in order to observe their responses
observational study
- observes individuals and measures variables of interest, without controlling any factors that might influence the response
- ex: surveys/polls/questionnaires, interviews, examinations
response bias
- occurs when people answer falsely/inaccurately, sometimes due to unintentional behavior by the interviewer
- ex: respondents may lie, especially about illegal/unpopular behavior