Inferential Biostatistics Flashcards
what are the steps for hypothesis testing?
- state the null and alternative hypotheses
- select the alpha value (usually 0.05)
- gather data and perform appropriate statistical tests generating a p value
- accept or reject the null
what is the null and alternative hypotheses for investigating whether a new drug improves symptoms of tension headache?
n: drug produces no change in headache symptoms compared to placebo
a: drug produces a significant change in headache symptoms compared to placebo
- -> must not say it makes symptoms better because you need to leave open possibility that it makes symptoms worse
- just say it produces a change or it does not
what is the alpha level?
designates the level of uncertainty you’re willing to accept
- alpha=0.05… you’re willing to accept that 5 % of the time the test could be wrong, the results are generated by chance
what is the p value?
the measure of significance obtained from statistical tests
-the probability that the difference between group means found in the research is due to chance alone
describe a parametric test
- focused on population parameters
- requires a continuous variable (ex cholesterol levels)
- assumes a normal distribution
- generally more powerful than nonparametric
what are examples of parametric tests
anova, t-test, paired t-test, confidence intervals
what is the central limit theorem
creates a normal distribution by sampling the population and distributing the means of each sample
- states that for a sufficiently large sample size, the means will aways tend to be normal irrespective of the shape of the population from which the samples were drawn
why are sampling distributions (central limit theorem) useful?
-allows parametric (ones that require normal distribution) tests to be done even if the underlying distribution is not normal so long as the sample size is large enough
describe a non-parametric test
not focused on population parameters
can be applied to discrete variables
used for smaller samples
makes few assumptions about distribution
-used more frequently than parametric because they don’t need continuous data and don’t make assumptions about distribution
what are examples of non-parametric test
chi-square
—also mann-whitney, fischer’s wilcoxon, etc (don’t need to know specifics)
what does a paired t-test compare
compares means of a single group before/after an intervention
ex: pre/post testing to access of value of a workshop
what does a t-test compare
compare means between two groups
ex: experimental drug group and placebo group
what does an ANOVA compare?
compares means between more than one group
ex: birth weights in non, light, and heavy smokers
what does a chi-square test do?
- non-parametric test
- used for testing hypothesis about nominal scale data
- test of proportions
ex: is there a significance difference in seat belt use between high school graduates and college graduates?
ex: is there a significant difference in growth of plants treated with water vs plants treated with grape solution?
what is type I error?
false positives
-null is actually true but it was rejected