Data Analysis 1: Comparing Means And Medians Flashcards
What is the purpose of performing statistical tests when analysing the data produced in an experiment?
To determine whether sufficient evidence has been generated to enable the experimental hypotheses to be rejected or retained, we need a method of the differences observed between groups/timepoints/conditions are: genuine or meaningful
What is a genuine method?
the difference observed due to a genuine difference between the nature of the varying
experimental conditions, or was it the result of uncontrolled/random variation?
What is a meaningful difference?
the difference constitute an effect-size that is biologically or clinically relevant?
What point does the statistical address?
- provide a method of determining the probability that differences
observed in data sets (e.g. different groups/timepoints) are the result of uncontrolled/random variation, in
contrast to a genuine difference resulting from the varying experimental conditions
What is a p-value?
Q: How are p-values used to interpret results?
A p-value is the calculated probability of observing an effect size as large or larger than the one observed
between the groups, if there was no genuine difference between the groups (i.e. if all of the data points for
the groups being compared actually came from the same group)
How are p-values used to interpret results?
P values are compared to a threshold value (‘α’ value), chosen prior to conducting the study, that is used as
an objective method of determining if an effect size is statistically significant or not.
p=0.05 means….
if there was no genuine difference between the groups, there would be a 5% probability of
observing a difference of at least as large as the one observed.
If p < α, p = 0.01 and α = 0.05, the difference…
…is statistically significant (reject H0)
If p ≥ α, p = 0.10 and α = 0.05, the difference…
…is not statistically significant (accept H0)
The most commonly used alpha value in biomedical science is probably 0.05. Why?
Alpha values used by convention (e.g. 0.05, 0.01) are arbitrary. The problem is that it is ultimately impossible
to know for certain if a result is due to a rare event or a genuine difference, but we still need to make a
decision regarding what the result means.
What should the choice of alpha/critical values consider?
the nature/design of the study and the relative consequences of
Type 1 (false positive) and Type 2 (false negative) errors.
What is a type 1 error?
false positive
What is a type 2 error?
false negative
What is a smaller alpha value more likely to result in?
a false negative
What is a larger alpha value more likely to result in?
a false positive