Lecture 6 - Confidence interval and hypothesis testing Flashcards

1
Q

What is a confidence interval?

A

• Statistics means never having to say you’re 100% certain
• Statements must be qualified as any data is imperfect
• In any research, all we have is the sample data
• But we want to know more
– We know what happened with the sample, but what about the wider population of interest?
• We use information from the sample statistic to make inference about the population parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Parameters vs Statistics

A

• Parameters are
– Numerical characteristics of a population
– Constant (fixed) at any one moment – Usually unknown
• Statistics
– Numerical summary of a sample
– Calculated from sample data (not constant) – Used to estimate a parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a confidence interval?

A

• Using sample data to make inference to the wider population has limitations
– After all you only have one sample and not all samples will be the same
• So, all you can do is to estimate
• The statistic calculated from the sample is a point estimate of the corresponding population parameter
– The sample average is a point estimate of the true population mean
– The sample proportion is a point estimate of the population proportion
• Interval Estimate
– Range of values which is likely to contain the population
parameter
• Confidence intervals are constructed to provide information about the precision of the estimate
– A confidence interval is an interval within which we can estimate, with some confidence, that the true population parameter lies
• The sample statistic is the sample mean ( x ) which is a point estimate of the true population mean (μ)
– Because the mean can vary from sample to sample, this estimate from one sample does not necessarily equal the true mean
• Confidence Intervals provide a measure of the precision of the estimate of the mean from one sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the confidence interval based on?

A

• Point estimate
• Desired level of confidence
– Common CIs are 90%, 95% and 99%
• Sampling variability or the standard error of the point estimate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Interpretation of confidence intervals

A

• Correct interpretation
– If we were to select 100 samples from the population and use these samples to calculate 100 different confidence intervals for μ, approximately 95 of the intervals (95%) would include the true population mean
• Incorrect interpretation
– The probability that the true mean (μ) is in the interval = 0.95

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Confidence intervals of the mean using Z-scores

A
  • Z-scores are appropriate confidence coefficients for a confidence interval of the mean when the population standard deviation (σ) is KNOWN
  • However, most of the time when the population mean is being estimated from sample data the population variance is unknown and must also be estimated from sample data
  • The sample standard deviation (s) provides an estimate of the population standard deviation (σ)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Confidence intervals when the population variance is unknown

A
  • The sampling distribution of the sample mean is a t-distribution with n-1 degrees of freedom instead of a normal distribution
  • Instead of using the standard normal distribution to find confidence coefficients, the t-distribution with n-1 df is used to find the confidence coefficients which are called t- coefficients or t-critical values
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Hypothesis testing

A

• Method of making decisions using data
• Two Hypotheses:
– Null: Hypothesis of no difference
– Alternative: researcher’s hypothesis
• Analysis of the data using statistical tools
• Use of the significance level (α) for the decision rule
• Decision: REJECT or FAIL to reject (No sufficient evidence to reject) the Null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Steps in hypothesis testing

A

• State the research question
• Specify the null and alternative hypothesis
– In hypothesis testing, we assumed null hypothesis is true and try to find evidence to support the alternative
– Difficult to be absolutely confident something works
• If I cant prove, it doesn’t work, the logical alternative is, it must work
– E.g.: guilty vs not guilty
• Decide on the significance level
– a statistical assessment of whether observations reflect a pattern rather than mere chance
– Think about type I and II error
• Calculate test statistic – Z, t or other statistic
• Decide whether to reject or fail to reject the null hypothesis (use of p value)
• State the conclusion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly