intro to data collection Flashcards
observational study
data collected only by monitoring
researchers collect data on individuals in a way that doesn’t directly interfere with how the data arises and doesn’t attempt to influence the responses
cannot make causal conclusions on them
experimental study
researchers directly influence the process by which the data arises, usually involving the assigning of participants to one or more treatments
can make causal conclusions on them
what is the best way to ensure a sample is representative of the population where it came from
to ensure that all the observations that comprise it were selected randomly
explanatory and response variables
explanatory variables are the independent variables that might affect or give rise to the response variable/ dependent variable
sampling design
describes exactly how the sample is chosen from the population
what do good sampling designs allow us to do
generalize from the sample to the population
(make an inference about the population from the sample data)
simple random sampling
a sample where each possible sample of size n has the same chance of being the sample that is selected
every member in the population has an equal chance of being included and there is no implied connection between members in the sample
stratified sampling
population is divided into non overlapping groups called strata
works best when there’s is a lot of variability between each stratum, but no much variability within each stratum
division of individuals into groups of similar interests
multistage sampling
involves taking simpler samples within simple random samples
big sample, smaller sample, even smaller sample
some subgroups are oversampled to ensure representation in the sample
bad sampling examples
a convenience sample —> obtained by measuring whatever or whoever is available to be measured
voluntary response sample —> individuals decide whether to participate in study
what survey challenges result from bad sampling?
- undercoverage : leaves out some groups in the population
- no response: individuals who are selected in sample can’t be contacted or refuse to participate
- response bias : participants respond differently from how they truly feel
confounding variable
variables associated with both the explanatory and response variable
get in the way of being able to make causal conclusions about the relationship between the two variables
provide an alternative explanation for why response occurs
^^ reason why we cant make causal statements from observational studies
randomized controlled experiment
includes comparison of 2 or more treatments
uses interpersonal chance of participants to the treatment ( random assignment)
single blind
when the participants font’ knwo which group they are in
double blind
when reasearchers also font know which participants are in which group so they control for researcher expectation