5. Confidence Intervals Flashcards
confidence intervals for the mean: normally distributed known variance, normally distributed unknown variance, confidence intervals for a proportion
Confidence Intervals
Description
- confidence intervals provide parameter estimates where a range of values is provided instead of a single number
- they serve a similar purpose to estimators but instead of just one ‘plausible’ value of the parameter they determine a range of possible values
- this way the true parameter value falls inside the range with high probability
Confidence Interval
Definition
-let X1,…,Xn be generated from a model with an unknown parameter θ∈R
-consider an interval [U,V] where U=U(X1,…,Xn) and V=V(X1,…,Xn) are statistics
-the interval [U,V] is a confidence interval with confidence level α if:
P(θ∈[U,V]) ≥ 1-α
-for every possible value of θ
Confidence Interval Observations
Significance Level
- a confidence interval with confidence level 1-α is sometimes called a (1-α)-confidence interval
- as for tests, a typical value of α is 5% corresponding to 95%-confidence intervals
Confidence Interval Observations
θ
- the symbol θ in the definition of the confidence interval stands for a generic parameter
- this could be the mean μ, a proportion p, etc.
Confidence Interval Observations
Randomness
-in the equation:
P(θ∈[U,V]) ≥ 1-α
-the interval [U,V] is random since it depends on the random sample X1,…,Xn
-but the value θ is not
Confidence Interval Observations
Usefullness
-the usefulness of confidence intervals lies in the fact that the condition:
P(θ∈[U,V]) ≥ 1-α
-holds true for all values of θ simultaneously
-thus without even knowing the true value of θ we can be certain that the condition holds
-and for given data x1,…,xn, we can use [U(x1,..,xn),V(x1,..,xn)] as an interval estimate for θ
Confidence Intervals for the Mean
Normally Distributed Data, Known Variance Lemma
-let X1,…,Xn~N(μ,σ²) with known variance σ²
-define the interval:
[U,V] = [X~-(q_α/2σ/√n) , X~+(q_α/2σ/√n)]
-where α∈(0,1) and q_α/2 is the (1-α/2)-quantile of the standard normal distribution
-then [U,V] is the (1-α)-confidence interval for the mean μ
-X~ indicates X bar, the sample mean
Confidence Intervals for the Mean
Normally Distributed Data, Known Variance Lemma Proof
-prove that: P(μ<u>V) = α/2 -then the probability that μ falls outside of the interval [U,V] is α -thus: P(μ∈[U.V]) = 1 - α</u>
Confidence Intervals for the Mean
Normally Distributed Data, Unknown Variance Lemma
-let X1,…,Xn~N(μ,σ²) where the variance σ² is unknown
-define:
[U,V] = [X~-t_n-1(α/2)σx^/√n ,X~+t_n-1(α/2)σx^/√n]
-where α∈(0,1), t_n-1(α/2) is the (1-α/2)-quantile of the t(n-1)-distribution and σx^ is the sample standard deviation of x1,…,xn
-then [U,V] is a (1-α)-confidence interval for μ
Confidence Intervals for the Mean
Normally Distributed Data, Unknown Variance Lemma Proof
-similar to the proof for known variance, consider the two cases where the confidence interval can fail to cover the mean μ
-we end up with:
P(U≤μ≤V) = 1 - α
Confidence Intervals for the Mean Observations
Width
- width of the interval is proportional to 1/√n
- i.e. if we use more observations to construct a confidence interval, the resulting interval will be narrower
Confidence Intervals for the Mean Observations
Confidence Level
- the confidence level 1-α affects the width of the confidence interval
- the smaller α, i.e. the higher the confidence level, the wider the confidence interval gets
- while we can adjust α, there is a trade-off to be made here
- increasing α has the advantage of reducing the width of the confidence interval but it also increases the probability that the interval doesn’t contain the true value
Confidence Intervals for the Mean Observations
Standard Deviation
- if the data is spread out, σ and σx^ will be large
- the resulting confidence interval has width proportional to σ or σx^ respectively
Confidence Intervals for the Mean Observations
Variance
-all other things being equal, confidence intervals for unknown variance (in particular at small sample size) are wider than confidence intervals for known variance
Confidence Intervals for a Proportion
Description
- assume that we have observed attribute data x1,…,xn∈{1,…,K} and we want to obtain a confidence interval for the proportion p of individuals in the population which have class x=1
- to formalise this estimation problem, we introduce a statistical model:
- consider random variables X1,…,Xn∈{1,…,K} i.i.d. with P(Xi=k)=pk for all i∈(1,…,n} and all k∈{1,…,K}
- we know that we can use the proportion of observed values in a class k as a point estimate for pk