Statistical Reports Flashcards
What is sampling error?
Variation in survey percentages due to sampling
What is the Margin of Error?
A number used to give an indication of the amount of uncertainty due to sampling error
Why do we use the Margin of Error?
It’s a way for accounting for the variation in survey percentages due to sampling
What is the “rule of thumb” method for calculating the Margin of Error?
MoE = 1 / (√n)
where n = sample size
x 100% to convert into a percentage
When can you apply the “rule of thumb” method?
When you are calculating the MoE of a survey percentage that is between 30% and 70%
(note: sometimes if they ask you to calculate the MoE for a whole survey then you’ll use the rule of thumb but then only use that number for survey percentages between 30% and 70%)
How can you use the Margin of Error to calculate a Confidence Interval for a single group?
X% +/- MoE
e.g. 45% with an MoE of 5%
45% +/- 5%
Confidence interval (40%, 50%)
How can you interpret a confidence interval of a single group? e.g. (40%,50%)
- If there is a claim being made, base your interpretation of that e.g. if it says over 45% of NZers watch cricket, then you would say “because the confidence interval contains 45%, then we cannot make the claim that over 45% of NZers watch cricket”
- If there is no claim being made, make a statement like “With 95% confidence, I estimate that the percentage of New Zealanders who watch cricket is somewhere
between 40% and 50%”
How do you calculate the MoE of the difference between two groups when the two groups are from the same survey?
MoE (difference) = 2 x MoE
How do you calculate the MoE of the difference between two groups when the two groups are from different surveys?
MoE (difference) = 1.5 x ((MoE1 + MoE2) / 2)
((MoE1 + MoE2) / 2) is the average MoE
How do you construct a confidence interval when the two groups are from two different surveys?
(X2-X1) +/- MoE(difference)
e.g. if two groups are 23% and 28% and the MoE(difference) is 5%:
(29%-23%) +/- 5%
6% +/- 5%
(1% , 11%)
How do you interpret a confidence interval when the two groups are from two different surveys?
If the confidence interval contains 0, then we cannot make the claim that the two groups are different
If the confidence interval does NOT contain 0, then we make the claim (with a 95% confidence) that the two groups are different
What is non-sampling error?
The error that arises in a data collection process as a result of factors other than taking a sample.
When thinking about non-sampling error, what kind of things can you think about?
- Who did they sample? Is the sample representative of the population that the claim is being made about?
- How did they sample? e.g. Telephone, Cell Phone, Email?
- When did they sample?
What is a self-selecting sample? What is wrong with it?
When the sampling process allows individuals to select themselves. Individuals with strong opinions about the survey issues or those with substantial knowledge will tend to be over-represented, creating bias
What is an “under representative sample”? What is wrong with it?
When the sampling process is such that a specific group is excluded or under-represented in the sample, deliberately or inadvertently. If the excluded or under-represented group is different, with respect to survey issues, then bias will occur.