Part 3 vocab Flashcards
Population
the entire group of individuals or instances about whom we hope to learn
Sample
A (representative) subset of a population, examined in hope of learning about the population
sample survey
a study that asks questions of a sample drawn from some population in the hope of learning something about the entire population. Polls taken to assess voter preferences are common sample surveys
Bias
Any systematic failure of a sampling method to represent its population is bias. Biased sampling method tend to over or underestimate parameters. It is almost impossible to recover from bias, so efforts to avoid it are well spent. Common errors include: Relying on voluntary response, undercoverage of the population, nonresponse bias, and reponse bias.
Randomization
The best defense against bias is randomization, in which each individual is given a fair, random chance of selection.
Sample size
The number of individuals in a sample. The sample size determines how well the sample represents the population, not the fraction of the population sampled.
Census
A sample that consists of the entire population is called a census
Population parameter
A numerically valued attribute of a model for a population. We rarely expect to know the true value of a population parameter, but we do hope to estimate it from sampled data. For example, the mean income of all employed people in the country is a population parameter.
Statistic, sample statistic
Statistics are values calculated for sampled data. Those that correspond to, and thus estimate, a population parameter, are of particular interest. For example, the mean income of all employed people in a representative sample can provide a good estimate of the corresponding population parameters. The term “sample statistic” is sometimes used, usually to parallel the corresponding term “population parameter”
Representative
A sample is said to be representative if the statistics computed from it accurately reflect the corresponding population parameters.
SRS (Simple random sample)
A simple random sample of sample size N is a sample in which each set of N elements in the population has an equal chance of selection.
Sampling Frame
A list of individuals from whom the sample is drawn is called the sampling frame. Individuals who may be in the population of interest, but who are not in the sampling frame, cannot be included in any sample.
Sampling variability
The natural tendency of randomly drawn samples to differ, one from another. Sometimes, unfortunately, called sampling error, sampling variability is no error at all, but just the natural result of random sampling
Stratified random sample
A sampling design in which the population is divided into several subpopulations, or “strata”. Random individuals are then drawn from each stratum so that the sample includes individuals from each, often in a representative proportion. If the strata are homogeneous, but are different from each other, stratified sampling can reduce variability in results
Cluster Sample
A sampling design in which entire group, or “clusters”, are chosen at random. Cluster sampling is usually selected as a matter of convenience, practicality, or cost. Clusters are heterogeneous, and a random sample of cluster should be representative of the population.