Module 10.2: Confidence Intervals and t-Distribution Flashcards
What are point estimates?
single values used to estimate population parameters. popular point estimate is the mean of a sample.
What is a confidence interval?
range of values in which the population parameter is expected to lie.
What is Student’s t-distribution?
bell-shaped probability distribution that is symmetrical about its mean. it is the appropriate distribution to use when constructing confidence intervals based on small sample from populations with unkown variance and a normal distribution.
What are the properties of a Student’s t-distribution?
1) It is symmetrical
2) defined by a single parameter, the degrees of freedom (df) where the degrees of freedom are equal to the number of sample observations minus 1.
3) Has more probability in the tails than the normal distribution.
4) As the degrees of freedom gets larger, the shape of the t-distribution more closely approaches a standard normal distribution.
How does a t-distribution compare to a normal distribution?
flatter with more area under the tails. as the degrees of freedom in a t-distribution increase, its shape approaches that of a normal distribution.
when is it easier to reject the null, when using t-distribution or normal distribution (z-distribution)?
when using t-distribution because there’s more area under the tails so higher chance of observations away from the center of the distribution.
What is a confidence interval? also explain the level of significance.
estimates result in a range of values within which the actual value of a parameter will lie, given the probability of 1-a. here a is called the level of significance for the confidence interval. 1-a = degree of confidence.
Formula for confidence intervals = point estimate +/- (reliability factor x standard error)
What is the reliability factor?
The probability that the point estiamte falls in the confidence interval, (1-a)
How is a confidence interval calculated in a normal distribution with a known variance?
point estimate (mean) +/- Za/2 (reliability factor) * standard error
What are the most commonly used standard normal distribution reliability factors?
1) za/2 = 1.645 for 90% confidence intervals (level of sig is 10%, 5% each tail).
2) za/2 = 1.960 for 95% confidence intervals (the sig level is 5%, 2.5% in each tail)
3) za/2 = 2.575 for 99% co nfidence intervals (sig level is 1%, .5% in each tail)
How do you calculate confidence intervals with a normal population but unkown variance?
point estimate (mean) +/- ta/2 (standard error).
What are the steps to pull the t-values from the t-table?
1) calculate degrees of freedom which is t-1.
2) find the appropriate level of alpha or significance. a/2.
3) look up the chart.
How do you construct a confidence interval for a distribution that is nonnormal? Can it be achieved?
1) if the distribution is nonnormal, but the population variance is known, the z-statistic can be used as long as the sample size is greater than 30.
2) if the distribution is nonormal and the pop variance is unkown, the t-statistic can be used as long as population size > 30.
What happens if the sampling of the population is not random?
the central limit theorem doesn’t apply, our estimates won’t have the desirable properties, and we can’t form unbiased confidence intervals.
What are the two downsides to “larger is better” when it comes to selecting an appropriate sample size?
1) Risk of selecting observations from a different population.
2) cost - larger sample size is more expensive.