Statistical Hypothesis Tests Flashcards
Parametric (t-, F-, ANOVA, chi-squared), non-parametric (Mann-Whitney, Kruskal-Wallis, bootstraping)
What are hypothesis tests?
- tests one possible value (unlike how CI tests a range)
- determine the strength of evidence, provided by data, against the proposition the single value is the true mean
- work by comparing an estimate obtained from data with what we expect to find under H0
What is the research hypothesis?
The hypothesis that the research is designed to investigate.
What is the null hypothesis?
H0: hypothesis we test, typically a skeptical reaction to the research hypothesis.
What is the alternative hypothesis?
H1: specifies a departure from H0, typically corresponds to the research hypothesis.
What is a test-statistic?
Used to quantify how much our data-estimate differs from the value in our H0.
What does the t-test statstic show us?
t-stat will be small if H0 is true.
When comparing two means, what is the null hypothesis?
That the difference between the values is 0.
How is the p-value obtained?
Compare t-stat to a reference t-distribution, p-value is obtained.
What is the p-value?
The probability of observing a test statistic at least as extreme as the one observed, given H0 is true.
What p-values count as strong evidence against H0?
- 1 - no evidence
- 05 - weak evidence
- 01 - some evidence
- 001 - strong evidence
When do we reject H0?
When the p-value is small. We reject H0 in favour of H1.
What is the non-parametric alternative to the t-test?
The Mann-Whitney test.
What does non-parametric mean? What is another name for it?
Methods to deal with non-normal data.
“distribution-free”
How is the Mann-Whitney test done?
- data is converted into ranks:
H0: there is NO difference between the ranks of each group - if H0 true, average ranks of each group about equal
- if ranks are very different we have reason to believe that the samples were not drawn from the same population
What is the procedure for the Mann-Whitney test?
- combine the scores from both groups and rank them in order of increasing size
- take each group and calculate the sum (Wn, n is the group)
- find the test stat for each group (Un)
- choose either Un value as the test stat (Ustat), usually the smaller value is used
- the test stat is then compared to the relevant distribution
- U is approx normally distributed, standard value (z) can be found
- we can now compare this test stat to the distribution of the test stat under H0.