Chapters 1-6.2 Flashcards
cluster sample
- first divide population into smaller groups (ideally, these clusters should mirror the characteristics of the population i.e. by location)
- then choose an SRS of the clusters
- all individuals in the chosen clusters are included in the sample
stratified sample
- first classify the population into groups of similar individuals called strata (ex. undergraduate and graduate)
- choose a separate SRS in each stratum and combine these SRSs to form the sample
stratified or SRS
stratified samples give more precise info about population than SRS
what is an SRS
consists of individuals chosen in such a way that every set of individuals has an equal chance of getting selected
what is undercoverage
occurs when members of the population are left out of the process of choosing a sample (don’t participate) ex. calling people’s houses during weekday working hours
what is non-response
occurs when an individual chosen for the sample can’t be contacted or refuses to participate
what is wording of questions
confusing or leading questions can introduce strong bias, and changes in wording can greatly change a survey’s outcome
what are the two main types of bias
- convenience
- voluntary
what is convenience bias
a sample selected by taking the members of the population that are easiest to reach
ex. first people to arrive at a game
what is voluntary response bias
people decided whether to join a sample based on open invitation
what is response bias
systematic pattern of incorrect answers in a sample survey
what is an observational study
observes individuals and measures the variables of interest but does not attempt to influence the responses
what is an experiment
deliberately imposes some treatment on individuals to measure their responses
-only source of fully convincing data to understand cause and effect
what is a lurking variable
a variable that is not among the explanatory or response variables in a study but that may influence the response variable (ex. how long you study for a test)
what is a confounding variable
- when two variables are associated in such a way that their effects on a response variable cannot be distinguished from each other
(ex. as the sales of ice cream go up so do murder rates confounding variable: temperature)