DatAna Flashcards
What is the null hypothesis?
→no statistically significant difference in the
relevant endpoint exists (or will be observed) between the groups
What is the alternate hypothesis?
→statistically significant difference
does exist between the groups
What should the hypothesis detail?
→key endpoint, dependent variable, or parameter is being measured
and compared between groups
What is the p-value?
→estimates the probability of the
difference observed between the groups occurring due to random variation in
the data if there was not any genuine difference between the groups
What is p<a></a>
→the difference is statistically significant → H0 rejected
What is p>a?
→the difference is not statistically significant → H0 retained
What is a smaller alpha value more likely to result in?
→result in a false positive as
very strong evidence of a statistically significant difference is required to
reject the null hypothesis
What is a greater alpha value more likely to result in?
→a false positive as
only weak evidence of a statistically significant difference is required to
reject the null hypothesis
What is the unpaired t-test?
→analyse the statistical significance of differences
between the means of two groups, where the data is parametric (normally
distributed) and samples are independent (unpaired)
What variables can impact the p-value by t-test?
→the magnitude of the difference between the mean values of the two groups,
→the standard deviation,
→ the number of samples in each group
What is a two-tailed test?
→assesses whether there is a statistically significant difference
between one group and another, regardless of whether it is an increase or a
decrease
→X being
different to Y
What is a one-tailed test?
→only assesses whether there is a statistically significant
increase/decrease between one group and another
→X being
greater than/less than Y
When are non-parametric methods used?
→evidence that the variable/data
in question is not normally distributed
What is the Mann-Whitney U test?
→non-parametric equivalent of the unpaired t-test used to assess whether
differences between two groups of independent samples are statistically
significant
How is Mann-Whitney U test carried out?
→assigning ranks to the data values
observed in the study
→estimating the probability of the ranks being
distributed between the two groups in the manner observed due to random
variation
What is used to measure the central tendency for non-parametric data?
→median rather than the mean which is used in parametric
How can we determine whether data is parametric or non-parametric without a test?
→the data (when expressed as a histogram) can be visually inspected to
subjectively judge whether the shape is approximately normal
→whether there is substantial kurtosis (flattening/peaking) or skewing
(asymmetry)
How can we determine whether data is parametric or non-parametric with a test?
→Shapiro-Wilks test
evaluates the null hypothesis that the data is not normally distributed
How do we interpret the Shapiro-Wilks test?
→If
the p-value calculated is less than the chosen alpha (threshold value,
e.g. 0.05) there is sufficient evidence to indicate that the data is non-
parametric, otherwise the data is assumed to be parametric due to there
being insufficient evidence to reject the null hypothesis
Define paired data
→Samples are dependent if each group contains the same samples; each
experimental condition involves the same set of participants
→eg. the same group of participants have their blood pressure
measured before and after taking a drug
Define unpaired data
→Samples are independent if each group contains different samples; each
experimental condition involves a different set of participants, animals, cells,
→E.g. one group of participants is treated with a drug, a different group of
participants is treated with placebo
What type of test for paired and non-parametric data?
→Wilcoxon Signed-Rank
What type of test for unpaired and non-parametric data?
→Mann-Whitney U test
What can result if multiple t-test are conducted for multiple comparisons?
→increased likelihood of type-1
errors (false positives)
What test is used for multiple comparisons?
→ANOVA followed by post test
What is a one-way ANOVA?
→test the null hypothesis that there are no
statistically significant differences between any of the groups
When is a post test carried out?
→If the result of the ANOVA is statistically significant
→o evaluate the statistical significance of specific combinations of
pairwise comparisons
What are the types of post-tests?
→Bonferroni
→Dunnet
→Turkey
What is Bonferroni?
→used when the researcher wishes to select
specific pairwise comparisons that do not a have particular pattern to
them
What is Dunnet’s?
→used when there are multiple groups to be compared to
a control group
What is Turkey’s?
→when a study requires pairwise comparison of every
possible combination of groups
What does the ANOVA test assume?
→independent samples and parametric
data
What are the alternatives to ANOVA?
→ The Kruskal-Wallis test (or ‘Kruskal-Wallis ANOVA’) is used when data
is non-parametric.
→A repeated measures ANOVA (rANOVA) is used to compare groups with dependent samples (paired data) and where data is parametric.
→The Friedman test is a non-parametric alternative to the rANOVA.
When is a two-way ANOVA used?
→analyse studies investigating whether there is
interaction between multiple factors
What does a two-way ANOVA examine?
→There are no statistically significant differences in the dependent
variable between the groups based on factor 1 (e.g. there is no
significant difference in the mean blood pressure of participants between
drug-treated and control treated groups)
→There are no statistically significant differences in the dependent
variable between the groups based on factor 2 (e.g. there is no
significant difference in the mean blood pressure of male participants
and female participants)
→There is no statistically significant interaction between factor 1 and
factor 2 in relation to the dependent variable (e.g. there is no significant
interaction between the effect of treatment on blood pressure and
participant’s sex).
What is an alternative to two-way ANOVA?
→Aligned Rank Transformation
→non-parametric data, rank based alternatives can be used
What test is used for categorical data?
→Pearson’s chi-squared test
What is the Pearson’s chi-squared test?
→there is no statistically significant difference
between the frequency distribution of different outcomes for different groups
What post tests can be used to compare specific pairs of groups or outcomes?
→Bonferroni correction
What test is used for unpaired categorical data?
→McNemar’stest
What is Fisher’s test appropriate for?
→analysing 2x2 contingency tables (i.e. two groups of samples and two
possible options for the categorical variable) and when sample size is low (n < 10)