L18, L20, L22, L23- biostats lectures Flashcards
what is sampling distribution
where you take all data points (2) and pair them with themselves and each other to find a distribution of means => n^2 sample size with normal distribution curve
(mean stays the same, SD dec)
what is the formula for standard error of the mean
SEM = σ / sqrt(n)
- measures variability in the mean
- decreases as sample size (n) increases
Central Limit Theorem in terms of not normal population distribution
it yields a normal distribution- increases the sample size of the population
calculation of degrees of freedom
df = n - 1
define t-distribution
normal distribution with fatter tails (more extreme values included)
-as df increases (n - 1 inc), tails get smaller and t-values approach Z-values (n = infinity)
define random sample
randomness means everyone has an equal probability of being included
what does the Central Limit Theorem infer about a sample
allows us to infer population parameters (mean, SD) from a single sample of sufficient size
describe when it is better to use standard error OR standard deviation
SEM- how well does the sample estimate the mean
SD- how widely scattered are the measurement in the population
define the purpose of Confidence Interval
makes inferences about the true mean based on the mean and SD of the sample
define null and alternative hyopthesis
null- nothing unusual is happening, no relationship between exposure/disease
alternative- something unusual is happening, exposure and disease are related
what are the two ways to test a hypothesis
One-sided: alternative hypothesis specifies a direction (only better or only worse)
Two-sided: alternative hypothesis can go in either direction (either better or worse)
CI = (1)
P = (2)
1- CI = 1 -α
2- P = 1 - β
what is the Z value formula
Z = (X - µ) / (SD/sqrt(n))
define type I and type II errors in terms of null hypothesis
Type I (α error/FP)- null hypothesis is true, study rejects Ho
Type II (β error/FN)- Ho is false, study supports Ho
what situations are bad to have Type I errors
(false positives)
- Tx is expensive, difficult
- cost of false alarm is high
- no effective Tx
what situations are bad to have Type II errors
(false negative)
- Tx is cheap, easy
- cost of false alarm is low
- Tx is only responsive in early stages