Module 5 - Manley Ch 6 - Generalizations Flashcards
What is a statistical inference?
Using specific observations as evidence for general claims about a larger group, or vice versa.
What is the difference between a statistical generalization and a statistical instantiation?
Generalization: Moves from a sample to conclusions about the whole population.
Instantiation: Moves from known facts about the population to conclusions about a sample.
Why is forming appropriate generalizations difficult?
It requires careful sampling, avoiding biases, and understanding probability, which is why the field of statistics exists.
When does a sample provide strong evidence for a hypothesis?
When the likelihood of observing the sample is much greater if the hypothesis is true than if it is false.
What factors weaken the reliability of a sample?
Small sample size
Sampling bias (e.g., non-random or convenience sampling)
What is sampling bias?
A selection effect where the method of choosing the sample skews the results, making them unrepresentative of the population.
How can experiments be designed to reduce sampling bias?
By ensuring the sample is random and representative of the population.
Why is a larger sample size important?
It increases the likelihood that the sample’s proportions closely resemble those of the whole population.
What is the law of large numbers?
The principle that larger samples tend to reflect the true distribution of the population more accurately.
Why do smaller samples often have more extreme proportions?
Because random variation has a larger effect on smaller groups, leading to greater deviations from the population average.
Why might small counties show extreme rates of a disease compared to larger ones?
Small samples are more prone to random fluctuations, leading to unusually high or low rates.
Why are the hospitals or schools with the “best” or “worst” rates often small ones?
Small sample sizes magnify the effects of random variation, making extreme outcomes more likely.
What should you consider when interpreting extreme results in small samples?
Whether the results could be explained by random variation rather than true differences.
How do you test the strength of evidence?
By comparing the likelihood of observing the evidence if the hypothesis is true versus if it is false.
Why is it important to design experiments where evidence strongly distinguishes between hypotheses?
To ensure that the observations are far more likely under one hypothesis than the other, making the evidence meaningful.
What is stratified sampling?
A method of sampling where the sample is divided into subgroups (strata) that match the population’s proportions for specific characteristics.
Why does sample size matter in statistical generalizations?
Larger samples are more likely to reflect the true characteristics of the population and reduce the margin of error.
What does a 95% confidence interval mean?
If the true population value lies outside the interval, the observed result would occur only 5% of the time.
Why is random sampling important?
It minimizes selection effects and ensures that the sample is representative of the population.
What are some common issues with non-random sampling methods?
Oversampling certain groups (e.g., cars on low-traffic roads).
Failing to account for relevant subgroups.
Bias from convenience sampling.