DatAna Flashcards

1
Q

What is the null hypothesis?

A

→no statistically significant difference in the

relevant endpoint exists (or will be observed) between the groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the alternate hypothesis?

A

→statistically significant difference

does exist between the groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What should the hypothesis detail?

A

→key endpoint, dependent variable, or parameter is being measured
and compared between groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the p-value?

A

→estimates the probability of the
difference observed between the groups occurring due to random variation in
the data if there was not any genuine difference between the groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is p<a></a>

A

→the difference is statistically significant → H0 rejected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is p>a?

A

→the difference is not statistically significant → H0 retained

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a smaller alpha value more likely to result in?

A

→result in a false positive as
very strong evidence of a statistically significant difference is required to
reject the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a greater alpha value more likely to result in?

A

→a false positive as
only weak evidence of a statistically significant difference is required to
reject the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the unpaired t-test?

A

→analyse the statistical significance of differences
between the means of two groups, where the data is parametric (normally
distributed) and samples are independent (unpaired)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What variables can impact the p-value by t-test?

A

→the magnitude of the difference between the mean values of the two groups,

→the standard deviation,

→ the number of samples in each group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a two-tailed test?

A

→assesses whether there is a statistically significant difference
between one group and another, regardless of whether it is an increase or a
decrease

→X being
different to Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a one-tailed test?

A

→only assesses whether there is a statistically significant
increase/decrease between one group and another

→X being
greater than/less than Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When are non-parametric methods used?

A

→evidence that the variable/data

in question is not normally distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the Mann-Whitney U test?

A

→non-parametric equivalent of the unpaired t-test used to assess whether
differences between two groups of independent samples are statistically
significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How is Mann-Whitney U test carried out?

A

→assigning ranks to the data values
observed in the study

→estimating the probability of the ranks being
distributed between the two groups in the manner observed due to random
variation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is used to measure the central tendency for non-parametric data?

A

→median rather than the mean which is used in parametric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How can we determine whether data is parametric or non-parametric without a test?

A

→the data (when expressed as a histogram) can be visually inspected to
subjectively judge whether the shape is approximately normal

→whether there is substantial kurtosis (flattening/peaking) or skewing
(asymmetry)

18
Q

How can we determine whether data is parametric or non-parametric with a test?

A

→Shapiro-Wilks test

evaluates the null hypothesis that the data is not normally distributed

19
Q

How do we interpret the Shapiro-Wilks test?

A

→If
the p-value calculated is less than the chosen alpha (threshold value,
e.g. 0.05) there is sufficient evidence to indicate that the data is non-
parametric, otherwise the data is assumed to be parametric due to there
being insufficient evidence to reject the null hypothesis

20
Q

Define paired data

A

→Samples are dependent if each group contains the same samples; each
experimental condition involves the same set of participants

→eg. the same group of participants have their blood pressure
measured before and after taking a drug

21
Q

Define unpaired data

A

→Samples are independent if each group contains different samples; each
experimental condition involves a different set of participants, animals, cells,

→E.g. one group of participants is treated with a drug, a different group of
participants is treated with placebo

22
Q

What type of test for paired and non-parametric data?

A

→Wilcoxon Signed-Rank

23
Q

What type of test for unpaired and non-parametric data?

A

→Mann-Whitney U test

24
Q

What can result if multiple t-test are conducted for multiple comparisons?

A

→increased likelihood of type-1

errors (false positives)

25
Q

What test is used for multiple comparisons?

A

→ANOVA followed by post test

26
Q

What is a one-way ANOVA?

A

→test the null hypothesis that there are no

statistically significant differences between any of the groups

27
Q

When is a post test carried out?

A

→If the result of the ANOVA is statistically significant

→o evaluate the statistical significance of specific combinations of
pairwise comparisons

28
Q

What are the types of post-tests?

A

→Bonferroni
→Dunnet
→Turkey

29
Q

What is Bonferroni?

A

→used when the researcher wishes to select
specific pairwise comparisons that do not a have particular pattern to
them

30
Q

What is Dunnet’s?

A

→used when there are multiple groups to be compared to

a control group

31
Q

What is Turkey’s?

A

→when a study requires pairwise comparison of every

possible combination of groups

32
Q

What does the ANOVA test assume?

A

→independent samples and parametric

data

33
Q

What are the alternatives to ANOVA?

A

→ The Kruskal-Wallis test (or ‘Kruskal-Wallis ANOVA’) is used when data
is non-parametric.

→A repeated measures ANOVA (rANOVA) is used to compare groups with 
dependent samples (paired data) and where data is parametric. 

→The Friedman test is a non-parametric alternative to the rANOVA.

34
Q

When is a two-way ANOVA used?

A

→analyse studies investigating whether there is

interaction between multiple factors

35
Q

What does a two-way ANOVA examine?

A

→There are no statistically significant differences in the dependent
variable between the groups based on factor 1 (e.g. there is no
significant difference in the mean blood pressure of participants between
drug-treated and control treated groups)

→There are no statistically significant differences in the dependent
variable between the groups based on factor 2 (e.g. there is no
significant difference in the mean blood pressure of male participants
and female participants)

→There is no statistically significant interaction between factor 1 and
factor 2 in relation to the dependent variable (e.g. there is no significant
interaction between the effect of treatment on blood pressure and
participant’s sex).

36
Q

What is an alternative to two-way ANOVA?

A

→Aligned Rank Transformation

→non-parametric data, rank based alternatives can be used

37
Q

What test is used for categorical data?

A

→Pearson’s chi-squared test

38
Q

What is the Pearson’s chi-squared test?

A

→there is no statistically significant difference

between the frequency distribution of different outcomes for different groups

39
Q

What post tests can be used to compare specific pairs of groups or outcomes?

A

→Bonferroni correction

40
Q

What test is used for unpaired categorical data?

A

→McNemar’stest

41
Q

What is Fisher’s test appropriate for?

A

→analysing 2x2 contingency tables (i.e. two groups of samples and two
possible options for the categorical variable) and when sample size is low (n < 10)