14 - Sampling & Surveys Flashcards
how to estimate unknown population mean µ
use sample mean xbar
N vs. n
N = population size
n = sample size
Good samples
probability samples for which each unit in the pop has a known probability of being in the sample
- SRS = Simple Random Sample = equal probability sample = each unit has the same chance of being in the sample
Sampling error
the error in the estimate due to the fact that we did not see the entire population (did not take a census)
- can be driven to 0 by taking a bigger and bigger sample
Sampling variability
different samples would give slightly different estimates
Non-sampling error
- do not get smaller as the sample size gets larger
- attributable to a systematic lack of representativeness in the sampling mechanism
- best dealt with by avoiding them
Sample size
So long as the sample size is small w/respect to the population size, then the sample size required to reach a certain level of precision is independent of the population size
Ex. n/N <= 0.1
Ex. do you need to take a bigger sample in CA than in WY bc CA has more people?
-no - take the same size sample in each state
Stratified sample
group the population into homogeneous strata, then take an SRS w/in each stratum
- helpful when a few observations may dominate the population total
Cluster samples
good when the pop naturally falls into groups (Ex. houses on a street, children in classrooms)
- can cut time & costs
Sequential/Skip sampling
sample every ith observation
- helpful when data is flowing in a stream (Ex. visitors on a website, containers at a packaging plant)