Confidence intervals Flashcards

Module 12: Handout 23, 24, 25

1
Q

Why do we even need the sample proportion?

A

We need it because we might not know the data for the population proportion. So, we must have an unbiased estimator sample proportion to best estimate the population proportion.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a confidence interval? (in easy terms)

A

It tells us how close the sample proportion is to the population proportion when we do not know the population proportion.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Confidence interval definition

A

Simply, a range of values over which a population statistic occurs with a certain probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the first thing you need to construct a confidence interval?

A

A confidence level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a confidence level?

A

The percentage by which we are confident that the confidence interval will contain the population proportion.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is another word for population?

A

true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Confidence level example

A

If i choose a 90% confidence level, then 90% of the time our confidence interval will contain the true proportion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Confidence interval steps

A

1) Choose confidence level
2) Know associated critical z-score
3) calculate the margin of error (E)
4) Confidence interval is then the interval ((p hat) minus (E), (p hat) plus (E))

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Finding critical z-score steps

A

1) Take the complement of the confidence level aka the alpha (a)
1. 1) complement of 90% is 10%, a=10%
2) Divide alpha in half (a/2 = 10%/2 = 5%)
3) Take complement of a/2 (Complement of 5% is 95%)
4) Use calc invnorm and calculate (invnorm(0.95))=1.645

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is alpha?

A

The complement percentage of the confidence level, such as how the complement of 90% is 10%, so 10% = alpha

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Invnorm on calc

A

“2nd” “vars” “3” aka invnorm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Margin of error formula

A

E = (associated critical z-score) times (square root ((p hat times q hat)/(n))

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

(A1) Example question about students emailing teacher

A

A teacher wants to know the # of his students that send him at least 1 email. He picks 40 of his students at random and checks to see how many of the 40 sent him an email. Suppose 14 have sent him an email.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

(A2) What is the sample proportion?

A

Sample proportion or (p hat) equals 14/40 which equals 0.35 or 35%, since 14 students out of the 40 students sent him an email

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

(A3) What confidence level does the teacher choose?

A

He chooses 90% confidence level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

(A4) What is the associated critical z-score if the confidence level is 90%?

A

The complement of 90% is 10%. Now divide 10% in half = 5%. The complement of 5% is 95%. So then use calc invnorm(0.95) and the answer is 1.645

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

(A5) What is q hat?

A

It is the complement of the sample proportion. So the complement of 35% is 65%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

(A6) Now what is the margin of error aka E?

A

I multiply (1.645) and the square root of ((0.35 times 0.65) divided by (40)) = 0.124 = E

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

(A7) What is the confidence interval?

A

[{(p hat) minus (E)} comma {(p hat) plus (E)}] which is (0.35-0.124, 0.35+0.124) = (0.226, 0.474) or 0.226 < p < 0.474

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

(A8) What is the answer?

A

The teacher can be 90% confident that the percentage of his students (the population) sending him an email is on the interval from 22.6% to 47.4%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

(A9) Can the teacher conclude that 20% of his students are sending him an email?

A

Yes because the interval for the sample proportion is entirely above 20%.

22
Q

(B1) Example question about the proportion of trees that have a disease.

A

A biologist takes a sample of 250 trees and discovers 42 have a disease. What is the 99% confidence interval for the proportions of trees having a disease?

23
Q

(B2) Identify variables, (p hat), (critical z score), (q hat), (n)

A
Sample proportion aka (p hat)=42/250=0.168
Critical z score=2.575
Complement of (p hat)=(q hat)=0.832
n=250
24
Q

(B3) Finding critical z score aka z_(a/2)

A

1) 99% confidence interval
2) Complement of 99% is 1%
3) 1% divided by 2 = 0.5%
4) Complement of 0.5% = 95.5%
5) Use chart or invnorm to find the number for 0.955
6) z_(a/2)=2.575

25
Q

(B4) Find the margin of error E

A

E = 2.575 (square root ((0.168 times 0.832) divided by 250)) = 0.609

26
Q

(B5) Finding the confidence interval (p hat minus E, p hat plus E)

A

(0. 168 - 0.061, 0.168 + 0.061) = (0.107, 0.229) or

0. 107 < p < 0.229

27
Q

Researchers need an appropriate amount of data (n) or sample size in order for the margin of error to be smaller

A

Since a smaller amount of data means that E (standard error) will be bigger and the confidence interval bigger

28
Q

Formula that solves for the required sample size

A

n = ([critical z-score]^2 (p hat) (q hat)) / E^2

29
Q

(C1) Example problem solving for sample size, publisher, and foreign online orders

A

A publisher wants to know what proportion on average his online orders are from overseas. He wants a margin of error of less than 5%. How big of a sample do they have to look at?

30
Q

(C2) Define variables E and z_a/2

A

E=0.05

z_a/2=1.96

31
Q

(C3) Finding z_a/2

A

1) 95% complement
2) 5% / 2
3) 2.5% complement
4) 97.5%
5) 0.975
6) z_a/2= 1.9+0.06 = 1.96

32
Q

(C4) For p hat and q hat

A

Use 0.5 since it “assumes the worst” so (p hat=0.5) and (q hat = 0.5)

33
Q

(C5) Using sample size formula

A

n = [(1.96^2)(0.5)(0.5)]/(0.05^2)=384.16=385

34
Q

(C6) Answer in sentence form

A

The publisher needs to examine at least 384.16 orders or 385 orders to be certain that E will be no more than 0.05 or 5%

35
Q

When the population standard deviation is not known, the sample standard deviation is used to create a confidence interval

A

And the distribution is not a normal distribution, but instead, an alternate distribution called the “Student’s t” or “t distribution”

36
Q

What is the difference between the normal distribution and “t distribution”?

A

Both look similar in terms of shape since it is a hill, BUT are not. The tails of the “t distribution” is wider aka the tails are a little higher from the x line as the sample size gets smaller

37
Q

What is t_a/2?

A

t-distribution critical values

38
Q

What is a t-table?

A

The table has critical values for 90%, 95%, 98%, and 99% confidence intervals for selected values of n

39
Q

How to find critical values on a t-table?

A

Find the degrees of freedom aka n-1

40
Q

Whats the point of degrees of freedom?

A

a good way to think about degrees of freedom is that one of the data values gets “used up” in a mathematical
sense when the standard deviation is calculated so that effectively there are only n – 1 data values left.

41
Q

(D1) Example problem to find t_a/2

A

1) A confidence level of 90% is chosen
2) a=10% and a/2=5%
3) On a graph, the distribution will look like a hill
4) 90% is closest to the mean, so there will be 5% on both tails
5) The t_a/2 is for the 95th percentile since the 5% on the left side + 90% middle = 95% or 95th percentile

42
Q

(D2) Margin of error formula

A

E = (t_a/2) times (s/square root(n))

43
Q

(D3) Confidence intervals for proportions can be expressed in either two ways

A

1) An inequality: (x bar minus E < Myu < x bar plus E)

2) An interval: (x bar-E, x bar+E)

44
Q

(E1) Example problem, worm length and population mean

A

A researcher has a random sample of 41 worms and calculates mean length as 62.4 mm, SD is 14.7. Construct 90% confidence interval for the population mean

45
Q

(E2) What is critical value t_a/2

A

On t-table for 90% confidence interval, the area in each tail of distribution is 5% or in total 10%.
So critical value is 1.684 ofc with degrees of freedom in mind

46
Q

(E3) What is the degrees of freedom?

A

(n - 1) is (41 - 1 = 40)

DOF is 40

47
Q

(E4) What is margin of error E

A

E = t_(a/2) times ([standard deviation] divided by [square root of n])

E=1.684 (14.7/square root of 41) = 3.866 or 3.9

E=3.9

48
Q

(E5) Put into interval expression

A

(x bar minus E, x bar plus E)

62. 4-3.9=58.5, 62.4+3.9=66.3
(58. 5, 66.3)

49
Q

(E5.5) Put into inequality form

A

58.5 < myu < 66.3

aka myu is mean

50
Q

(E6) Answer in sentence form

A

The mean length for population of worms is 58.5 and 66.3 with 90% confidence