16 - Confidence Intervals Flashcards

1
Q

What is a confidence interval?

A
  1. range of feasible values for an unknown population parameter
    • µ (pop mean), p (pop proportion)
  2. statement conveying the confidence that the range of feasible values really does include the unknown population value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Because proportions are averages, the CLT implies…

A

a normal model for the sampling distribution of p^ if the sample size n is large enough:

p^ ~ N[p, p(1-p)/n]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

SE (p^)

A

sqr[p(1-p)/n]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

If we use the percentile of the normal distribution, z0.025, then

A

z0.025 = 1.9, and

P(-1.96 SE(p^) <= p^ - p <= 1.96 SE (p^)) = 0.95

p^ lies within 1.96 standard errors of p in 95% of samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

se(p^) =

A

sqr[p^(1-p^)/n]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

CI for p

The 100(1-a)% z-interval for p is the interval from…

A

p^-za/2sqr[p^(1-p^)/2] to p^+za/2sqr[p^(1-p^)/2]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Xbar =

xbar =

A

mean of a randomly chosen sample

mean of the observed sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

SE(Xbar) =

se(Xbar) =

A

σ/sqr(n)

s/sqr(n)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Student’s t-distribution

A

very similar to the normal distribution, but the t has fatter tails

incorporates excess variability

as sample size n gets larger, the t-distribution convergs to the standard normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

defining the t-distribution

A

any normal random variable, its Z-score:

(Xbar - µ)/(σ/sqr(n)) = Z ~ N(0, 1)

replace σ with s (sample SD), its Z-score:

(Xbar - µ)/(s/sqr(n)) = T ~ Tn-1

  • Tn-1 → a random variable with n-1 degrees of freedom
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Student’s t-distribution compensates for…

Exact sampling distribution of random variable Tn-1

A

substituting s for σ in the standard error.

The exact sampling distribution of the random variable Tn-1 = (Xbar - µ)/(S/sqr(n))

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

degrees of freedom

A

n-1; larger n = better estimate of a standard normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Degrees of freedom is necessary because…

A

mimics sample size,

there will be more variability in s for small sample sizes than for large sample sizes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Confidence interval for µ

The 100(1-a)% confidence t-interval for µ is

A

xbar - ta/2,n-1 s/sqr(n) to xbar + ta/2,n-1 s/sqr(n)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Interpreting CI’s

A

95% of intervals created according to this procedure are expected to contain μ.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

From the CLT, we know:

E(Xbar) =

Var(Xbar) =

SD(Xbar) = SE(Xbar) =

A

sample mean:

μ

σ2/n

σ/sqr(n)

17
Q

Manipulating CI’s

A

you can transform the ends of the CI to obtain a new CI for the transformed parameter

Ex. MPG → L/100km

Ex. probabilities → odds ratios

18
Q

_________ provides a 95% confidence interval for µ

A

Xbar +/- 1.96 σ/sqr(n)

or

Xbar +/- 2 σ/sqr(n)

19
Q

sample mean of 0/1 variables

A

sample proportion of 1’s

this means we can still apply CLT to sample proportion

20
Q

p =

p^ =

A

population proportion

sample proportion

21
Q

Sampling distribution of p^

A

p^ ~ N(p, p(1-p)/n)

22
Q

key assumptions of sampling distribution of p^

A
  1. we have an independent sample from the pop
  2. sample size is large enough for CLT to be applied
  3. sample size rule: np^ and n(1-p^) > 10
23
Q

Rule for the sample size

A

both np^ and n(1-p^) > 10

*#of successes and failures each have to be greater than 10*

24
Q

_______ provides a 95% confidence interval for the population proportion

A

p^ +/- 2se(p^) = p^ +/- 2sqr[p(1-p)/n]

but since we don’t know p, we replace it with p^:

p^ +/- 2sqr[p^(1-p^)/n]

25
Q

What is MoE?

A

Margin of Error is the distance from the center to the edge of the interval

MoE = 2 s/sqr(n) ⇒ critical value * standard error

Ex. if you want a 95% confidence level, MoE = 1.96 σ/sqr(n)

26
Q

Sample size formula for a population mean (µ) with 95% z-interval

A

n= (critical value * σ/MoE)2

27
Q

sample size for estimating a population proportion

A

n = (1/MoE)2

use only if:

  • you want a 95% CI
  • p (pop proportion) lies between 0.25 & 0.75
28
Q

Ex. a company has estimated the proportion of doctors who say they’ll prescribe a new drug as 30%. They used a sample size of 100, but didn’t provide a CI.

What was the MoE for a 95% CI?

A

n= (1/MoE)2 ⇔ MoE = 1/sqr(n)

MoE = 1/10 = 0.1

CI is approx (30% +/- 10% ) = (20%, 40%)