16 - Confidence Intervals Flashcards
What is a confidence interval?
- range of feasible values for an unknown population parameter
- µ (pop mean), p (pop proportion)
- statement conveying the confidence that the range of feasible values really does include the unknown population value
Because proportions are averages, the CLT implies…
a normal model for the sampling distribution of p^ if the sample size n is large enough:
p^ ~ N[p, p(1-p)/n]
SE (p^)
sqr[p(1-p)/n]
If we use the percentile of the normal distribution, z0.025, then
z0.025 = 1.9, and
P(-1.96 SE(p^) <= p^ - p <= 1.96 SE (p^)) = 0.95
p^ lies within 1.96 standard errors of p in 95% of samples
se(p^) =
sqr[p^(1-p^)/n]
CI for p
The 100(1-a)% z-interval for p is the interval from…
p^-za/2sqr[p^(1-p^)/2] to p^+za/2sqr[p^(1-p^)/2]
Xbar =
xbar =
mean of a randomly chosen sample
mean of the observed sample
SE(Xbar) =
se(Xbar) =
σ/sqr(n)
s/sqr(n)
Student’s t-distribution
very similar to the normal distribution, but the t has fatter tails
incorporates excess variability
as sample size n gets larger, the t-distribution convergs to the standard normal distribution
defining the t-distribution
any normal random variable, its Z-score:
(Xbar - µ)/(σ/sqr(n)) = Z ~ N(0, 1)
replace σ with s (sample SD), its Z-score:
(Xbar - µ)/(s/sqr(n)) = T ~ Tn-1
- Tn-1 → a random variable with n-1 degrees of freedom
Student’s t-distribution compensates for…
Exact sampling distribution of random variable Tn-1…
substituting s for σ in the standard error.
The exact sampling distribution of the random variable Tn-1 = (Xbar - µ)/(S/sqr(n))
degrees of freedom
n-1; larger n = better estimate of a standard normal distribution
Degrees of freedom is necessary because…
mimics sample size,
there will be more variability in s for small sample sizes than for large sample sizes
Confidence interval for µ
The 100(1-a)% confidence t-interval for µ is
xbar - ta/2,n-1 s/sqr(n) to xbar + ta/2,n-1 s/sqr(n)
Interpreting CI’s
95% of intervals created according to this procedure are expected to contain μ.