Week 7 - Sampling Distributions Flashcards
Sample descriptive statistics provide an estimate of what?
The population parameter
What is statistical inference?
Being able to infer and generalize from a sample to a larger population
Fixed means…
If you were to sample everyone it would be the same mean and SD
What is the difference between what sampling and population distributions show?
population: Spread
sampling: Variation
Population distributions: the spread of the variable of interest across individuals in the target population
Sampling distributions: show how the estimate of the population mean varies between samples if the experiment was repeated and the mean was recalculated each time
As sample size increases, what happens to the data?
The data becomes more normally distributed
How does the increase in the n value affect the distribution of the mean?
The larger n is, the closer the sample mean matches the population mean
What are the properties of sampling distribution?
- The mean of the sampling distribution is equal to the sampling and population mean
- SD of the sampling distribution is equal to the SD of distribution mean and the standard error of the mean
- The shape of he sampling distribution is a normal bell curve and as sample size increases, approximation to the normal distribution does too (even if the population distribution is not bell-shaped)
What is the central limit theorem? What n value is considered normally distributed for sampling distributions based on the mean?
When n => 30 for sampling distributions based on the mean, the data is considered normally distributed
As sample size increases, what occurs to the sampling distribution?
Variability in the mean measurements decreases; standard error (SE) decreases
What is the 95% confidence interval?
If we were to conduct the experiment 100 times, 95 times out of the 100, the value would fall within the 95% confidence interval range.
What are parametric vs non-parametric tests?
Parametric tests assume the data is normally distributed (t-tests, ANOVA) while non-parametric tests are data that do not follow a normal distribution.
What is the calculation to determine the 95% CI for a single population when the population SD is known?
Lower limit: mean - 1.96 (SE)
Upper limit: mean + 1.96 (SE)
What is the calculation to determine the 95% CI for a single population when the population SD is not known?
Xu = m + tu x SE
XL = m + tL x SE
Tlower and Tupper are standardized test scored found using the t-distribution table
What does the t-distribution look like?
The shape varies depending on sample size yet is still symmetric, bell-shaped, and centered on mean = 0.
The larger the df, the more the sample SD approximates the population SD, and the shape approximates the normal distribution.
How do you calculate the degrees of freedom for t-distribution?
df = n - 1