week 7 & 8 -- t-Distribution, df, CIs, hypothesis testing Flashcards
What is the problem with small samples?
ERROR (non-systematic difference btw estimator and paramter)
–> small samples are more likely to yield extreme results
t-distribution
Used to make probabilistic inferences from small samples (instead of Normal)
paramters: df (or degrees of freedom), or “tail fattening”
degrees of freedom (n - 1)
when estimating the population SD:
- Bessel’s correction corrects for BIAS
- the t-distribution corrects for ERROR
n = number of independent observations df = number of independent observations...given any statistics we have already calculated for our sample
so if we have already calculated the mean, the last observation is not independent of the others (because we can calculate it…)
another way to see sampling distribution
- we have a sample and have calculated its mean
- the sampling distrib. tells us what other means we might have got instead (for samples of this size, from this pop.)
confidence intervals
?
For each change, how does the CI change?
- we want to be more confident (95% > 99%)
- we take a larger sample
- we study a larger population
- we study a more variable population
- wider
- narrower
- nothing
- wider
Which statement about the CI is correct for
85
5!
Error bars
SD
- shows spread of sample
- usually widest
- doesn’t easily allow us to infer significant difference
SE
- related to the precision of our estimate
- narrowest
- doesn’t easily allow us to infer significant difference
CI
- DOES easily allow us to infer significant difference
Good evidence is:
- probable given the hypothesis
- improbable otherwise
p value = P (D given NOT hypotehsis)
the probability of observing data as amazing as the data we have, given that our hypothesis is false