Confidence Intervals Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Point Estimate

A
  • uses a single value to estimate a population parameter
  • doesnt express the uncertainty
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Interval estimate

A
  • uses a range of values to estimate a population parameter.
  • Confidence Interval
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What’s the equation for Confidence Interval?

A

= sample statistic (mean) ± margin of error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Main Components of Confidence Interval

A
  • sample statistic
  • margin of error
  • confidence level
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Sample Statistic

A

sample mean or sample proportion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Confidence level

A
  • The confidence level describes the likelihood that a particular sampling method will produce a confidence interval that includes the population parameter.
  • common confidence levels
    • 90%
    • 95% - most popular
    • 99%
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

margin of error

A

ME = z-score * SE (large smaple n>=30)
ME = t-score * SE (small sample n<30)

represents the maximum expected difference between a population parameter and a sample estimate.
- This range of values expresses the uncertainty in your estimate due to random sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Steps to Construct Confidence Interval

A
  1. Identify a sample statistic.
  2. Choose a confidence level.
  3. Find the margin of error.
  4. Calculate the interval.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why does data professional use confidence interval?

A

to help describe the uncertainty surrounding an estimate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Interpretation of the confidence interval

A

Technically, 95% confidence means that if you take repeated random samples from a population, and construct a confidence interval for each sample using the same method, you can expect that 95% of these intervals will capture the population mean. You can also expect that 5% of the total will not capture the population mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Incorrect interpretation of the confidence interval

A
  1. 95% refers to the probability that the population mean falls within the constructed interval. It’s not correct to say there is a 95% chance that your confidence interval captures the population mean because this implies that the population mean is variable. Intervals change from sample to sample, but the value of the population mean is constant
  2. 95% refers to the percentage of data values that fall within the interval
  3. 95% refers to the percentage of sample means that fall within the interval
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Z-scores

A

For large sample sizes, you use z-scores to calculate the margin of error.
This is because of the central limit theorem: for large sample sizes, the sample mean is approximately normally distributed.

For a standard normal distribution, also called amz-distribution, you usez-scores to make calculations about your data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

T-scores

A

For small sample sizes (n < 30), you need to use the t-distribution.

Statistically speaking, this is because there is more uncertainty involved in estimating the standard error for small sample sizes

But, the t-distribution has bigger tails than the standard normal distribution does. The bigger tails indicate the more outliers that come with a small dataset.

As the sample size increases, the t-distribution approaches the normal distribution.

When the sample size = 30, the distributions are practically the same, and you can use z-score for your calculations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

CI Step 1: Identify a sample statistic

A

IF your sample represents the average emissions rate for 15 engines. You’re working with a sample mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

CI Step 2: Choose a confidence level

A

common confidence levels
- 90%
- 95% ( most popular)
- 99%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

CI Step 3: Find the margin of error

A

large sample size n ≥ 30
ME = z-score * SE

small sample n < 30
ME = t-score * SE

17
Q

Standard Error for sample mean

A

SE = S/√n

S = std

18
Q

Standard Error for sample proportion

A

SE = √ [p (1-p) / n)]

19
Q

degree of freedom

A

The t-distribution is defined by a parameter called the degree of freedom.

dof = sample size - 1

20
Q

CI Step 4: Calculate the interval

A

sample statistic (mean/proportion) ± margin of error

for mean [12,30]
for proportion [14%, 20%]

21
Q

Relationship between confidence level and confidence interval

A

the confidencelevelgets higher, the confidenceintervalgets wider.