Chapter 12: Sample Surveys Flashcards
What are the 3 ideas of sampling?
- Examine a part of the whole: A sample can give information about the population.
- Randomize to make the sample representative.
- The sample size is what matters. It’s the size of the sample - and not its fraction of the larger population - that determines the precision of the statistics it yields.
What are some sampling methods?
- Simple random sample (SRS)
- Stratified samples
- Cluster samples
- Systematic samples
- Multistage samples
What are some causes of bias?
- Voluntary response
- Convenience samples
- Bad sampling frames
- Undercover age
- Nonresponse bias
- Response bias
Define ‘Population’.
The entire group of individuals or instances about whom we hope to learn.
Define ‘Sample’.
A (representative) subset of a population, examines in hope of learning about the population.
Define ‘Sample survey’.
A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population. Polls taken to assess voter preferences are common sample surveys.
Define ‘Bias’.
Any systematic failure of a sampling method to represent its population. It is almost impossible to recover from bias, so efforts to avoid it are well spent. Common errors include relying on voluntary response, undercoverage of the population, nonresponse bias and response bias.
Define ‘Randomization’.
The best defense against bias is randomization, in which each individual is given a fair, random chance of selection.
Define ‘Sample size’.
The number of individuals in a sample. The sample size determines how well the sample represents the population, not the fraction of the population sampled.
Define ‘Census’.
A sample that consists of the entire population.
Define ‘Population parameter’.
A numerically values attribute of a model for a population. We rarely expect to know the true value of a population parameter, but we do hope to estimate it from sampled data. For example, the mean income of all employed people in the country is a population parameter.
Define ‘Statistic, sample statistic’.
Values calculated for samples data. Those that correspond to, and thus estimate, a population parameter are of particular interest. For example, the mean income of all employed people in a representative sample can provide a good estimate of the corresponding population parameter.
Define ‘Representative’.
A sample is said to be representative if the statistics computed from it accurately reflect the corresponding population parameters.
Define ‘Simple random sample (SRS)’.
A simple random sample of sample size n is a sample in which each set of n elements in the population has an equal chance of selection.
Define ‘Sampling frame’.
A list of individuals from who the sample is drawn. Individuals who may be in the population of interest, but who are not in the sampling frame, cannot be included in any sample.
Define ‘Sampling variability’.
The natural tendency of randomly drawn samples to differ, one from another. Sometimes - unfortunately - called sampling error, sampling variability is no error at all, but just the natural result of random sampling.
Define ‘Stratified random sample’.
A sampling design in which the population is divided into several subpopulations, or strata, and random samples are then drawn from each stratum. If the strata are homogenous, but are different from each other, a stratified sample may yield more consistent results.
Define ‘Cluster sample’.
A sampling design in which entire groups, or clusters, are chosen at random. Cluster sampling is usually selected as a matter of convenience, practicality or cost.