Statistics Flashcards
Inferential statistics
Allows for generalisations to be made about a population from a sample representative
What are common problems in biological data
- small sample size
- unequal sample size
- correlation within data (measurements from subject over time, or measurements from brain regions at the same time will always be correlated)
- unequal variance (heterogeneity)
- non-normal (skewed) distribution
Discrete variables
variables who values are finite, or countably infinite values within a range
- eg: ‘pain relief’ vs ‘no pain relief’ or subjective rating scales
these numbers do not have the same academic integrity as continuous variables (thus means have ‘less’ meaning)
continuous variables
variables whose values exist on an infinite continuum/are uncountable
- e.g. frequency, temperature, amplitude, enzyme concentration, receptor density
Binary variables
Yes or No outcomes
Nominal variables
Represents groups with no ‘rank’ or ‘order’ within them
- eg: species, colour, brands
Ordinal variables
Groups that are ranked in a specific order
- eg: likert scales,
Parametric statistics
that the data follows a normal distribution, and that there is equal variance within each group (homogeneity of variance)
Nonparametric statistics
used when the data does not follow a normal/known distribution
- tend to be less statistically powerful
Parametric test used for 2 unpaired groups
Unpaired t-test
Non-parametric test used for 2 unpaired groups
Man-Whitney U test
Parametric test used for 2 paired groups
Paired t-test
Non-parametric test used for 2 paired groups
Wilcoxon test
Parametric test used for ≥3 unmatched groups
1 way ANOVA
Non-parametric test used for ≥3 unmatched groups
Kruskal-wallis test
Parametric test used for ≥3 matched groups
Repeated measures ANOVA
Non-parametric test used for ≥3 matched groups
Friedman test
Parametric test used to determine association between two variables
Pearson correlation
Non-parametric test used to determine association between two variables
Spearman correlation
Parametric test used to predict a value for one variable from other(s)
Simple linear/non-linear regression
Multiple linear/non-linear regression
Parametric test used to predict a value for one variable from other(s)
Non-parametric regression
Central limit theorem
As the sample size increases, the probability that the data will be normally distributed increases
The null hypothesis
Assumes that there is no difference between groups
Power
(1-b)
the probability of rejecting the null hypothesis
- increasing sample size results in decreased variability, and thus greater power