Confidence Interval Estimation Flashcards

1
Q

what is the central limit theorem?

A

—As n→∞, the distribution of the sample mean becomes Normal, with centre µ and standard deviation σ/√n.
—This happens regardless of the shape of the original population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

if X bar follows a Normal distribution then,

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is the size of n?

A

—If the distribution of X is normal, then for all n the sample mean will follow a normal distribution.
—If the distribution of X is VERY not normal, then we will need a large n for us to see the normality of the distribution of the sample mean.
—In all cases, as n gets larger, the distribution of the mean gets more normal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what are two types of estimators used to estimate population parameters?

A

—Point estimate:

Interval estimate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Describe a point estimate

A

—a single value or point, i.e. sample mean = 4 is a point estimate of the population mean, µ.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

describe a confidence interval estimate

A

interval (range) created around the point estimate .

Includes the population parameter known
—E.g. We are 95% confidence that the unknown mean score lies between 56 and 78.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is the purpose of a confidence interval estimate?

A

indicates confidence of correctly estimating the value of the population parameter μ

allows you to say with specified confidence that μ is somewhere in the range of numbers defined by the interval

eg. 95% confident that the mean GPA at university is between 2.75 and 2.85

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what do we want our estimators to be?

A

unbiased and consistent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what does it mean when an estimator is unbiased?

A

expected value of the estimator is the parameter we are trying to estimate

i.e E(X bar) = U

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what does it mean when an estimator is consistent?

A

if the difference between the estimator and the parameter gets smaller as the sample gets larger.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what are ways to improve the estimator?

A
  1. take a larger sample size (increase n)
  2. —Smaller variance as n→∞, means that sample mean should be “closer” to true parameter
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

point estimate is good but we…

A

—would like some way to show how confident we are in our estimator.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

in practice, what is the problem with increasing sample size (Increasing n) for the purposes of improving the estimator?

A

— in practice, can’t keep increasing n indefinitely!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

show what a 95% confidence interval for μ looks like

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what does
—95% confidence interval for μ refer to?

A

in —repeated sampling, 95% of the intervals created this way would contain μ and 5% would not.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

we can change how confident we are by changing?

A

—are by changing the 1.96
—Use 1.645 to get a 90% confidence interval
—Use 2.33 to get a 98% confidence interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what does this mean?

A

lIf the experiment were carried out multiple times, 95% of the intervals created in this way would contain μ.

lLCL: 0.78, UCL: 2.38

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

—In general, a 100(1-α)% confidence interval estimator for μ is given by

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

<!--StartFragment-->

What does 100(1-α)% mean<!--EndFragment-->

A

—If we want 95% confidence, α=0.05 (or 5%).
—If we want 90% confidence, α=0.10 (or 10%).
—If we want 99% confidence, α=0.01 (or 1%).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does Zα/2 mean?

A

—We want to find the middle 100(1- α)% area of the standard normal curve:
—So the area left in each tail will be α/2.
—Zα/2 is the point which marks off area of α/2 in the tail

21
Q

<!--StartFragment-->

How can we change the width of the interval?<!--EndFragment-->

A

—1. Vary the sample size: as n gets bigger, the interval gets narrower.
—2. vary the confidence level: If we want to be more confident, then we simply change the 1.96 to another number from the standard normal, 2.33 will give 98% confidence, 2.575 will give 99% confidence; increasing confidence will make the interval wider.

22
Q

what are useful z values?

A
23
Q

what changes from sample to sample?

A

—the INTERVAL that changes from sample to sample.

24
Q

describe µ

A

—µ is a fixed and constant value. It is either within the interval or not.

25
Q

how should you interpret a 95% confidence interval?

A

In repeated sampling, 95% of such intervals created would contain the true population mean

i.e in repeated sampling, we would expect 95% of the intervals created this way to contain μ.

26
Q

describe the other application of CLT

A

before we gather data, we know that we want to get an average within a certain distance of the true population value.
We can use the CLT to find the minimum sample size required to meet this condition, if the standard deviation of the population is known

27
Q

we are estimating the length of my bus trip in the morning. Assume that the standard deviation of the trip length is 5 minutes. I want to estimate the true population mean length to within 3 minutes, with 99% certainty.

A

Step 1: set up the equation needed

P( |x̄ - μ| < 3 ) = 0.99

Step 2:

—Step 3: solve for n.
P( |Z| < 2.575) = 0.99

28
Q

develop the confidence interval estimate for u for a sample of n=25 boxes with mean 362.3g and standard deviation of 15

A

interval developed to estimate u is

362.3 +- (1.96)(15)/ (√25) or 362.3 +- 5.88

estimate of u is 356.42

29
Q

is the interval for u always correct?

A

no, u does not necessarily have to be included in the interval developed from a sample

30
Q

In practice, can you determine whether an interval estimate is correct?

A

no in practice, one sample is selected and the population mean is unknown and rarely known. Therefore, it cannot be determined whether an interval estimate is correct

31
Q

for a higher degree of confidence of including the population mean within the interval,

A

you might need higher confidence levels such as 99%

other situations lower confidence is accepted such as 90%

32
Q

what is the level of confidence symbolized by?

what is α representative of?

A

(1- α) x 100%

α is the proporation in the tails of the distribution that is outside the confidence interval. The proportion in the upper tail of the distrubition is α/2 and proportion in the l_ower tai_l of distribution is α/2

33
Q

the value of Za/2 needed for constructing a confidence interval is called

95% confidence corresponds to an a value of….The critical Z value….

A

the critical value for the distribution

95% confidence corresponds to an a value of 0.05. The critical Z value….correspondding to a cumulative area of 0.975 is 1.96 because there is 0.025 in the upper tail of the distribution

34
Q

level of confidence of 95% leads to z value of

A

1.96

35
Q

99% confidence corresonds to an a value of

The z value is approximately

A

0.01.

the z value is approximately 2.58 because the upper-tail area is 0.005 and the cumulative area less than Z= 2.58 is 0.995

36
Q

if the population of X does not follow a normal distribution, what ensures the X bar is approximately normally distributed when n is large?

A

The central limit theorem almost always ensures the xbar is approximately normally distributed when n is large

37
Q

when there is a small population size and population does not follow a normal distribution, the sampling distribution of x bar is not normally distributed, then

A

the confidence interval is inappropriate

38
Q

in practice, when can the confidence interval be used?

A

confidence interval can be used so long as the sample size is large enough and population is not very skewed to estimate the population mean when standard deviation is known

39
Q

what does central limit theorem suggest?

A

know that by the Central Limit Theorem, as n→∞, the distribution of the sample mean becomes Normal, with centre µ and standard deviation σ/√n

—This happens regardless of the shape of the original population.
—i.e. X̄ follows a Normal distribution with E(X̄) = µ

var(X̄) = σ2/√n

40
Q

relative frequency (also called sample proportion) is

A

—an average: SUM / n

—relative frequency of a category is the number of observations in that category divided by the total sample size.

41
Q

when does CLT apply?

A

when sample is large enough

42
Q

in particular case of binomial, what do we have?

A

—We have X, the number of successes in n trials.
—X/n = sample proportion of successes
—Can be used to estimate p, the true probability of success

43
Q

how large is large enough?

A

—If the distribution of X is normal, then for all n the sample mean will follow a normal distribution.
—If the distribution of X is VERY not normal, then we will need a large n for us to see the normality of the distribution of the sample mean.
—In all cases, as n gets larger, the distribution of the mean gets more normal.

—If we are dealing with a proportion, we would usually want n > 50, preferably larger than 100 before we rely on the Central Limit Theorem to tell us that p hat is Normally distributed.

44
Q

what can proportions be viewed as?

A

sample averages
—Which means the CLT will apply if the sample is large enough.

—In general, this approximation (approximate normality) is good for n>50, and very good for n>100.

45
Q

how do you estimate confidence interval for proportion?

A
46
Q

how do you interpret results from confidence interval estimation of proportion?

A

—That is, a 90% CI for proportion of cyclists is (0.6361, 0.6867). 90% of the intervals created in this way will contain the true proportion.

47
Q

how do you determine sample size for the mean?

A

e being sampling error eg. sampling error of no more or less than +- 5

48
Q

how do you determine sample size for proportion?

A
49
Q

where you have no prior knowledge or estimate for population proportion P̄, what should you do?

A

assume p̄ = 0.5

results to largest possible sample size