data science - statistics II - sampling Flashcards
sample
a subset from a larger set of data
population
the larger data set or idea of a data set
N and n
Variable for population (N) and sample (n)
random sampling
drawing elements into a sample at random - each member of the population has an equal opportunity of being selected
stratified sampling
dividing the population into strata and randomly sampling from each strata
simple random sampling
sample that results from random sampling w/o stratifying the population - each member of the population has an equal opportunity of being selected
sample bias
a sample that misrepresents the population
when does bias occur
when measurements or observations are systematically in error bc they are not representative of the full population
vast search effect
bias or non-reproducibility resulting from repeated data modeling, or modeling data with large numbers of predictor variables
sample statistic
metric calculated for a sample of data drawn from a larger population