module 6 Flashcards
population parameters
- attributes of the statistical population
- the values are fixed
- labeled using the greek alphabet, such as μ for the mean or σ for the standard deviation
what sit called when descriptive statistics of a sample become the population parameters of the statistical population.
estimation
sampling distribution
- distribution of some descriptive stat that would emerge if one were to repeatedly draw samples from the stat pop
What effect does sample size have on the characteristics of the sampling distribution
Shape independence in the sampling distribution only holds if the sample size is sufficiently large. If your statistical population is already symmetrical and roughly bell-shaped, then sample sizes of n=5 are enough to ensure that the shape of the sampling distribution is independent of the statistical population.
What are the characteristics of a sampling distribution?
- The shape of a sampling distribution is independent of the statistical population so long as the sample size is sufficiently large.
- The variance of a sampling distribution increases as the number of sampling units in the sample decreases. As the size of a sample increases, the variance of the sampling distribution decreases.
Central limit theorem
allows us to calculate the mean and standard deviation of the sampling distribution without the need to construct it.
What are the two key characterisicts of sampling distributions?
- that a sampling distribution has a bell-shape that is independent of the statistical population.
- that the variance of a sampling distribution decreases as the sample size increases.
Standard error
is the standard deviation of the sampling distribution.
Student’s t-distribution
is a probability distribution used to describe the sampling distribution when the parameters of the statistical population are estimated from a sample.
Standard error
is the standard deviation of the sampling distribution. SE=σ/√n
What does this equation mean?
SE=σ/√n
The central limit theorem adds that the standard error (SE) is the standard deviation of the statistical population divided by the square-root of the sample size.
What is a chain of inference?
is that a single sample from a statistical population is enough for us to estimate the sampling distribution.
What is is the problem with central limit theorum?
The problem is that we do not know the value of the standard deviation of the statistical population in practice. Rather, we need to estimate it from our sample. This reality means we have greater uncertainty in the sampling distribution than would be suggested by the Normal distribution.
T-distribution
is a probability distribution used to describe the sampling distribution when the parameters of the statistical population are estimated from a sample.
Confidence intervals
The range over a sampling distribution that brackets the centre-most probability of interest. They are a statement of what future samples will look like. Confidence intervals are derived from sampling distributions, and they describe the range over the x-axis that brackets a specific probability of finding a value from a new sample.