Targets 3A-3G Flashcards
Population
The larger group we hope to learn something about
Census
Collect data from all members of a population
Sample
Subset of the population that actually gets examined or measured
Inference
Drawing a conclusion about a population based on data from a sample
Sampling Variability
We don’t expect the statistic from a sample to be the same as if we calculated it from the population; each sample produces a different result.
Random Sampling
typically produces a sample that is representative of the population
allows us to use the laws of probability to construct a margin of error
Larger ones give better information about a population than smaller ones
Sample survey
A study that uses an organized plan to choose a sample that represents some specific population
Good sample
representative of the population and will provide a good estimate of the value of interest
Biased sample
Systematically over or under represents a portion of the population and would consistently overestimate or underestimate the value of interest
Random Sampling Process
Use a chance process to determine which members of a population are included in the sample
Sampling without replacement: a member of the population can only be selected once
Sampling with replacement: a member of the population can be selected more than once
Sampling frame
the list of all the potential subjects in a population
Chance processes
flipping a coin, rolling dice, drawing names out of a well-mixed hat, random number generator, random digit table
Random Sampling Benefits
is the number 1 way to fight bias. It avoids favoritism by the sampler and self-selection by the respondents.
Simple Random Sampling (SRS) Characteristics
Every group of n individuals in the population has an equal chance of being selected for the sample.
Equivalent to throwing all members of a population into a hat, mixing them up, and then drawing without replacement
NOT the only legitimate method of random sampling
Simple Random Sampling (SRS) Process
Choosing a sample of size n from a population of size N
- Assign random numbers between 1 and N to each member of the population
- Use randomness (calculator or table) to select n unique numbers. Be sure to state: ignore unused numbers; ignore repeats (for sampling without replacement)
- The subjects corresponding to the chosen numbers are in the sample.
Stratified Random Sampling
Split the sampling frame into homogeneous groups, then pick a random sample from each group.
Benefit :1 Reduces variability in the sample statistic (Field of Dreams Lab)
Benefit 2: Guarantees that all subgroups are represented in the sample; allows for subgroup comparisons
Cluster Random Sampling
Split the sampling frame into heterogeneous clusters (members that are in close proximity to each
other), then randomly choose several clusters.
Sample ALL subjects in the randomly chosen clusters.
Benefit 1: Can make sampling less expensive by reducing travel time
Systematic Random Sampling
Begin with a randomly selected individual; every nth!” member after that will be in the sample.
Larger values of n generate smaller samples. Smaller values of n generate larger samples.
Benefit: Can be easier to use than an SRS especially if the population is lined up (or listed) already
If we want a sample of size 3, we’ll choose every 5th member of the population. (15 ÷ 3 = 5)
To begin, we choose a random number between 1 and 5. Our random number was 4, so the 4th
member is the first one for the sample. We also include every 5th member after that.
Biased Sample
Systematically over or under represents a portion of the population and would consistently overestimate or underestimate the value of interest.
Convenience Sample
select individuals from the population who are easy to reach
Example: A farmer brings a juice company several crates of oranges each week. A company inspector looks at
10 oranges from the top of each crate before deciding whether to buy all the oranges.
Voluntary Response Bias
happens when the sample is made up of subjects who chose to participate
Example: In 2016, Britain’s Natural Environment Research Council used an online poll to choose the name of
its new $300 million ship. The winning name “Boaty McBoatface” received 124,000 votes, far more than more
serious candidates “Shackleton”, “Endeavor”, and “Falcon”.
Undercoverage Bias
happens when a subgroup of the population has little to no chance of being chosen
Example: In 1936 legitimate polling organizations such as Gallup were just getting started. That same year the
magazine Literary Digest, used the phone book to send out a presidential poll. 2.4 million out of 10 million
mailed ballots were returned. 57% said they would vote for Alf Landon (Rep) and 43% said they would vote
for Franklin Roosevelt (Dem). In the actual election, Landon only won electoral votes from two states.