unit 10: inference for proportions Flashcards
For what values of n and p does the normal approximation to the distribution of pˆ work best? For
what values of n and p is the normal approximation very poor?
The normal approximation is best when n is large and p = 0.5. The normal approximation is worst
when n is small and p is close to 0 or 1.
In words, what is the meaning of SE(ˆp1 − pˆ2)?
- SE(ˆp1 − pˆ2) is the standard error of the difference in sample proportions, which is an estimate of
the standard deviation of the sampling distribution of pˆ1 − pˆ2. - It is a measure of the variability in pˆ1 − pˆ2 if samples were to be repeatedly drawn from the populations.
What is the difference in meaning of the symbols pˆ and p?
- pˆ is the sample proportion, whereas p represents the true proportion for the entire population.
- Once the sample is drawn, we will know the value of pˆ, but p is typically an unknown value that we are trying to estimate.
Would it ever make sense to test H0: pˆ = 0.25?
No, this would never make sense. We test hypotheses about parameters, and not about statistics
pˆ is an unbiased estimator of p.
True, since E(ˆp) = p.
When n < 30 we should use the t distribution when calculating confidence intervals for p
False.
The t distribution never arises in inference for proportions.
The true distribution of pˆ is based on the binomial distribution
True. pˆ =X/n where X has a
binomial distribution with parameters n and p
The true standard deviation of the sampling distribution of pˆ depends on the value of p.
True.
The sampling distribution of pˆ is perfectly normal for large sample sizes.
False. It’s an approximation, but it can be a very good approximation.
The normal approximation to the sampling distribution of pˆ works best when we have a large
sample size and p = 0.5.
true
The sampling distribution of pˆ becomes more normal as p tends to 1.
false. the sampling list becomes strongly skull as p approaches 1
The sampling distribution of p is approximately normal for large sample sizes
False. p is a
parameter, not a statistic, and as such does not have a sampling distribution.
All else being equal, the value of SE(ˆp) decreases as the sample size increases.
true
In repeated sampling, exactly 95% of 95% confidence intervals for p will capture p.
False, since we are using a normal approximation and not an exact procedure. (But if the sample size is very large then the true percentage will be very close to 95%.)