4a: 3.1-3.4 Sampling Flashcards
In Stats, what does random mean?
Equally likely to be chosen
What is the main goal of taking a sample?
Getting a group that represents the whole population well
What does it mean for a sample to be biased?
Any data gathered from the sample will not accurately represent the whole population. If you repeated the method many times, you would consistently either over or underestimate the real proportion/mean.
What is a simple random sample (SRS)?
Every possible group of n individuals is equally likely to be chosen.
Both split the population into groups first and then use some kind of random process, so what is the difference between stratified and cluster samples?
Stratified is when you take some from each group. Cluster is when you randomly pick some groups and then take everyone from those groups. Ideally, stratified is used when the groups are different from each other (you want to make sure to get some old people and some young people in your sample).
What is voluntary response and why/how does it produce bias?
Rather than selecting people using a random process, you let people choose to respond by calling or going to a website or something. Generally, only people who are especially passionate (and usually in the same direction) will take the time to respond.
What do you need to include in your answer when describing bias?
- Give a possible reason for how the sample or experiment does not represent the population. 2. State whether that statistic obtained will be an over or underestimate of the true value
How would you take a stratified sample in the cafeteria?
Randomly select 2 (or however many you need) students from each table
How would you take a cluster sample in the cafeteria?
Randomly select 2 (or however many you need) tables and sample all the students at those tables
Bias: how do you tell if a sample will be an under or overestimate?
Think: what “number” will I get from this sample? Will it be higher than the real number or lower? Example 1: I think that the percentage of my sample of students in detention will agree that the tardy policy is fair will be smaller than the real percentage of all students at the school. So this sample will underestimate the real percentage. Example 2: If I sample heights during break, the average I will get will clearly be a bigger number than the real average of people in the building. So, this sample would be an overestimate of the real height.