Statistical Tests Flashcards

1
Q

What is a hypothesis?

A
  • allows you to state your idea in a specific testable form
  • used to describe a working theory about the data sets you’re considering
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define ‘null hypothesis’

A
  • a working assumption that there’s no difference between the data sets you wish to compare
  • i.e. there is no difference/relationship/association
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define ‘alternative hypothesis’

A
  • assumption that there is a difference between data sets
  • i.e. there is a difference/relationship/association
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define ‘significance’

A
  • a measure of likelihood that the NULL hypothesis is correct
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define ‘two-sided hypothesis’

A
  • states the difference could be in either direction
  • null = NO difference between methods A and B
  • alternative = A difference between methods and B
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define ‘one-sided hypothesis’

A
  • states a difference in a specific direction
  • alternative = test results using method A are HIGHER/LOWER than those using method B
  • null = test results using method A are NOT HIGHER/LOWER than those using method B
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a ‘type I error’?

A
  • false positive (reject the NH when it is true)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a ‘type II error’?

A
  • false negative (accept the NH when it is false)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the type II error rate (b)?

A
  • the probability of incorrectly retaining a false null hypothesis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is meant by ‘power’?

A
  • the probability of correctly rejecting a false null hypothesis
  • power = 1 - b (type II error rate)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How can we reduce the chance of type I error?

A
  • choose a lower probability/higher significance
  • e.g. P = 0.01
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the issue of choosing a higher significance?

A
  • critical value of test statistic increases
  • thus probability of type II error increases
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the significance levels (P-values) and what they mean?

A
  • P > 0.05 = not significant
  • P < or = 0.05 = significant
  • P < or = 0.01 = highly significant
  • P < or = 0.001 = very highly significant
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a parametric test?

A
  • makes particular assumptions about mathematical nature of population distribution from which the samples were taken
  • better able to distinguish between true and marginal differences between sample (have greater ‘power’)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a non-parametric test?

A
  • doesn’t assume that data fit a particular pattern, but may assume some characteristics of their distributions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is ‘effect size’ (ES)?

A
  • measures the strength of the result is solely magnitude-based
  • it does not depend on sample size
17
Q

What is meant by ‘A priori power analysis’?

A
  • done in planning phase to determine N (study size)
  • necessary to justify project resources in funding processes
  • minimise use of animals or risk to patients in clinical research
  • involves estimating the sample size required for a study based on predetermined maximum tolerable Type I and II error rates and the minimum effect size that would be clinically, practically, or theoretically meaningful
18
Q

What is meant by ‘Post-hoc power analysis’?

A
  • done on completion of study to determine Observed Power
  • Necessary to check that your expected and measured ES align well
  • i.e. Did you have sufficient subjects to detect differences reliably?
19
Q

What is meant by ‘Sensitivity power analysis’?

A
  • done in planning phase when the sample size is predetermined by study constraints e.g. if there are only 20 subjects available in a pilot study
  • instead we determine what level of effect we might be able to find, referred to as the minimal detectable effect (MDE) or minimum clinically important difference (MCID)
20
Q

How does sample size affect the effect size?

A
  • as sample size increases, the effect size decreases
21
Q

What is the relationship sample size and power?

A
  • increase in sample size increases power
  • but NOT a linear relationship
22
Q

When would you use a t-test?

A
  • comparing means from TWO independent samples
23
Q

What other situations would you use a t-test?

A
  • comparing means of paired data
  • comparing a sample mean with a chosen value
24
Q

When would you use ANOVA?

A
  • comparing means from TWO OR MORE samples
25
Q

Similarity between t-test and ANOVA

A
  • both assume data has a Gaussian distribution and variances of the samples are homogeneous
26
Q

What are the 2 chief non-parametric tests for comparing locations of 2 samples?

A
  • Mann-Whitney U-test
  • Kolmogorov-Smirnov test
27
Q

What does the Mann-Whitney U-test assume and what sample’s size should it be?

A
  • assumes that the frequency distributions of samples are similar
  • sample’s size must more or equal to 4
28
Q

What sample’s size must Kolmogorov-Smirnov test have?

A
  • more or equal to 4
  • samples must have equal sizes
29
Q

Significant differences found with the Kolmogorov-Smirnov test could be due to what?

A
  • differences in location
  • or shape of distribution
  • or both
30
Q

What are the 2 suitable non-parametric comparisons of location of paired data?

A
  • Wilcoxon’s signed rank test
  • Dixon and Mood’s sign test
31
Q

What is the Wilcoxon’s signed rank test?

A
  • used for quantitative data
  • assumes that distributions have similar shape
  • sample size equal to or more than 6
32
Q

What is the Dixon and Mood’s sign test used for?

A
  • paired data scores where one variable is recorded as ‘greater than/better than’ the other
  • sample size equal to or more than 6
33
Q

What are the 2 suitable non-parametric comparisons of location for 3 or more samples?

A
  • Kruskal-Wallis H-test
  • Friedman S-test
34
Q

What is the Kruskal-Wallis H-test?

A
  • number of samples is without limit and can be unequal in size
  • underlying distributions are assumed to be similar