Ch 4 Statistics Flashcards
distribution of replicate measurements formula
y = (e^(-(x-µ)^2) / (2sigma^2) / (sigma √(2π))
distribution of replicate measurements formula
general + z distribution ones
y = [e^(-(x-µ)^2) / 2sigma^2] / [(sigma√(2π))]
µ: population mean
sigma: population SD
y = [e^-z^2 / 2] / [sigma√(2π)]
small vs large SD precision vs accuracy
small SD: precise, not a measure of accuracy
large SD: not precise, not a measure of accuracy
z vs t distribution
z: when sigma is known
- taller and smaller tails
- as df goes to infinity
t: when sigma is unknown
- shorter and longer tails
- specified df
z value formula
zi = (xi - µ) / sigma
z distribution % of values between 1, 2, 3 SD from mean
1 SD: 68%
2 SD: 95%
3 SD: 99.7%
if random is the only error, 95% of measurements should fall between how many SD
-1.96 sigma & +1.96 sigma
confidence interval vs confidence value
CI: high + low values (like interval notation)
CV: the error (the ± value)
confidence interval formula
CI value = value ± z sigma(value) / √n
t value formula
ti = (xi - µ) / s
when to use 3 t calc formulas
1. observed - expected
2. x bar 1 - x bar 2, s pooled
3. d bar
- compare expected value to observed value (ex. mud at crime scene vs amount to be guilty)
- compare 2 observed means (ex. water contaminants in two parts of river)
- compare paired data (ex. drug test and multiple athletes)
t calc > t table
t calc < t table
calc > table: significantly different; reject H0; random error can’t explain difference
calc < table: not significantly different; accept H0; random error can explain difference
alternative vs null hypothesis
alternative: try to prove (has an effect); in tails of t distribution
null: try to disprove (doesn’t have an effect); in large bump of t distribution
small sample t test formula
t = [x bar - µ0] / [s/√N]
Q test for outliers formula + interpretation of value
Q = |questionable results - nearest neighbor| / range
Q calc > Q critical: reject value; it’s an outlier