Mod 5 Flashcards
Chapter 8 & 9
What is sampling variability?
the observed value of a statistic varies from sample to sample depending on the particular sample selected
- statistic is a random variable as the value varies from sample to sample
What is sampling distribution?
the distribution or collection of all possible values for a statistic for repeated samples of the same size
- how the statistics vary under repeated sampling
- the statistic has a distribution because values differ from sample to sample (if you take a different sample, you’ll get a diff value for that statistic)
Does a statistic vary?
Because the sample statistic is a variable, it varies from sample to sample (if you take different sample, you’ll compute a different statistic or p^)
What a sampling distribution describe?
- the sample-to-sample variability of a statistic
- it shows all possible sample statistics that we could obtain from samples of the same size of a population
What is the standard deviation of the sampling distribution of a statistic?
measures how values of the sample statistic might vary across different samples from same population
How does standard deviation of sampling distribution related to sample size (n)?
- As n INCREASES, the standard deviation of the sampling distribution DECREASES
- When n INCREASES, the statistic estimates the parameter more accurately
Notation for sample proportion
sample proportion (p^)
Notation for population proportion
true population proportion (p)
What is the sampling distribution for a single proportion?
- P^ = x/n
- if it was a census, you’d have true population proportion (p)
What happens to the standard deviation of a sampling distribution if sample size (n) increases?
as n INCREASES, standard deviation of a sampling distribution decreases
What is the center of the sampling distribution?
mean/center of sampling distribution is the true population parameter (p)
p^ ~ AN (p, square root of p(1-p)/n)
What is the spread of the sampling distribution?
- standard deviation
- standard error
p^ ~ AN (p, square root of p(1-p)/n)
What is standard error of sampling distribution?
- estimating the standard deviation of a sample distribution using sample data
replaces p with p^ in the standard deviation expression
SE (p^) = square root of p^(1-p^)/n
What happens to standard deviation/error if sample size (n) increases?
Because of the square root,
- Increase sample size (n) four times, cuts the standard deviation in half
- Increase sample size nine times, then the spread goes down to a third
When is the shape of the sampling distribution approximately normal?
np and n(1-p) ≥ 10
What is the shape of the distribution of p^?
p^ cannot be binomial or uniform; only approximately normal
conditions:
- sample taken from population 10x larger than sample
- np and n(1-p) ≥ 10
How do you use the sampling distribution of p^ to compute probabilities involving p^?
- estimate population proportion (p) with sample proportion (p^)
- p^ ~ AN (p, p(1-p)/n)
- Conditions: good data collection, random sample taken from population is at least 10 times larger than the sample, np AND n(1-p) ≥ 10
- normcdf to compute probabilities
What are the parameters of a binomial random variable?
X ~ binom(n,p)
What are intervals?
p^ +/- z* or number of standard deviations away (standard erorr)
z* = 95% or 2 standard deviations away from the mean
What is a point estimate?
A single number that is our best guess for the parameter
- doesn’t tell us how close the estimate is likely to be to the parameter
What is an interval estimate?
An interval of numbers within which the parameter value is believed to fall
- includes a range of plausible values which can capture the true parameter (p) of the point estimate
- includes margin of error
What are the differences between statistic vs population characteristics
statistic
- estimate
- changes based on the sample we drew
- follows a distribution
population characteristic
- fixed value
- doesn’t change
- unknown when conducting inference
What are the properties of a good estimator?
- unbiased: sampling distribution of the estimator is centered at population characteristic (p) – the center of the statistic is the thing you’re trying to estimate
- precise/accuracy: an accurate estimator falls closer to the parameter than others; has relatively small standard deviation
What is margin of error?
the maximum likely estimation error (unusual for an estimate to differ from the actual value of the population characteristic by more than the margin of error)
- How far off we expect p^ to be from true p thus use standard deviation of sampling distribution