Statistical Tests Flashcards
1
Q
What is a hypothesis?
A
- allows you to state your idea in a specific testable form
- used to describe a working theory about the data sets you’re considering
2
Q
Define ‘null hypothesis’
A
- a working assumption that there’s no difference between the data sets you wish to compare
- i.e. there is no difference/relationship/association
3
Q
Define ‘alternative hypothesis’
A
- assumption that there is a difference between data sets
- i.e. there is a difference/relationship/association
4
Q
Define ‘significance’
A
- a measure of likelihood that the NULL hypothesis is correct
5
Q
Define ‘two-sided hypothesis’
A
- states the difference could be in either direction
- null = NO difference between methods A and B
- alternative = A difference between methods and B
6
Q
Define ‘one-sided hypothesis’
A
- states a difference in a specific direction
- alternative = test results using method A are HIGHER/LOWER than those using method B
- null = test results using method A are NOT HIGHER/LOWER than those using method B
7
Q
What is a ‘type I error’?
A
- false positive (reject the NH when it is true)
8
Q
What is a ‘type II error’?
A
- false negative (accept the NH when it is false)
9
Q
What is the type II error rate (b)?
A
- the probability of incorrectly retaining a false null hypothesis
10
Q
What is meant by ‘power’?
A
- the probability of correctly rejecting a false null hypothesis
- power = 1 - b (type II error rate)
11
Q
How can we reduce the chance of type I error?
A
- choose a lower probability/higher significance
- e.g. P = 0.01
12
Q
What is the issue of choosing a higher significance?
A
- critical value of test statistic increases
- thus probability of type II error increases
13
Q
What are the significance levels (P-values) and what they mean?
A
- P > 0.05 = not significant
- P < or = 0.05 = significant
- P < or = 0.01 = highly significant
- P < or = 0.001 = very highly significant
14
Q
What is a parametric test?
A
- makes particular assumptions about mathematical nature of population distribution from which the samples were taken
- better able to distinguish between true and marginal differences between sample (have greater ‘power’)
15
Q
What is a non-parametric test?
A
- doesn’t assume that data fit a particular pattern, but may assume some characteristics of their distributions