Statistical Hypothesis Tests Flashcards

Parametric (t-, F-, ANOVA, chi-squared), non-parametric (Mann-Whitney, Kruskal-Wallis, bootstraping)

1
Q

What are hypothesis tests?

A
  • tests one possible value (unlike how CI tests a range)
  • determine the strength of evidence, provided by data, against the proposition the single value is the true mean
  • work by comparing an estimate obtained from data with what we expect to find under H0
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the research hypothesis?

A

The hypothesis that the research is designed to investigate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the null hypothesis?

A

H0: hypothesis we test, typically a skeptical reaction to the research hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the alternative hypothesis?

A

H1: specifies a departure from H0, typically corresponds to the research hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a test-statistic?

A

Used to quantify how much our data-estimate differs from the value in our H0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does the t-test statstic show us?

A

t-stat will be small if H0 is true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When comparing two means, what is the null hypothesis?

A

That the difference between the values is 0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How is the p-value obtained?

A

Compare t-stat to a reference t-distribution, p-value is obtained.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the p-value?

A

The probability of observing a test statistic at least as extreme as the one observed, given H0 is true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What p-values count as strong evidence against H0?

A
  1. 1 - no evidence
  2. 05 - weak evidence
  3. 01 - some evidence
  4. 001 - strong evidence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When do we reject H0?

A

When the p-value is small. We reject H0 in favour of H1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the non-parametric alternative to the t-test?

A

The Mann-Whitney test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does non-parametric mean? What is another name for it?

A

Methods to deal with non-normal data.

“distribution-free”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How is the Mann-Whitney test done?

A
  • data is converted into ranks:
    H0: there is NO difference between the ranks of each group
  • if H0 true, average ranks of each group about equal
  • if ranks are very different we have reason to believe that the samples were not drawn from the same population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the procedure for the Mann-Whitney test?

A
  1. combine the scores from both groups and rank them in order of increasing size
  2. take each group and calculate the sum (Wn, n is the group)
  3. find the test stat for each group (Un)
  4. choose either Un value as the test stat (Ustat), usually the smaller value is used
  5. the test stat is then compared to the relevant distribution
  6. U is approx normally distributed, standard value (z) can be found
  7. we can now compare this test stat to the distribution of the test stat under H0.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How is the standardised value for the Mann-Whitney test found?

A

z = (U - 0.5 - μU)/σU

17
Q

What is statistical significance?

A

H0 is rejected, indicates that differences in group means are not likely due to sampling error.

18
Q

What is practical significance?

A

Asks are the differences between samples big enough to have real meaning?

19
Q

What is a type I error?

A

Conclude there is a genuine difference when there is in fact not (reject H0 when its true).

20
Q

What is a type II error?

A

Conclude there is no difference when in reality there is (fail to reject H0 when it’s false).

21
Q

What does ANOVA mean?

A

ANalysis Of VAriance

22
Q

What is an ANOVA test?

A

One test that compares all means simultaneously.

  • H1: at least one mean is different from at least one of the others
  • H0 is true: expect group means to be similar and any differences to be from sampling variation alone
23
Q

What type of statistic does an ANOVA test produce?

A

F-test statistic.

24
Q

What is s^2B and how is it calculated (for F-test stat)?

A
  • difference between group mean and overall mean
  • mean sq between groups (ANOVA table)

s^2B = (Σ ni (x̄ i - x̄)^2)/k-1 = SSB/ k - 1

25
Q

What is s^2W and how is it calculated (for F-test stat)?

A
  • variability between groups
  • mean sq residuals (ANOVA table)

s^2W = (Σ (ni - 1) si^2)/n(tot) - k = SSW/n(tot) - k

26
Q

How does the F-stat look when H0 is false?

A

Should be large.

27
Q

What is Tukey’s Difference test?

A
  • used for multiple comparrisons

- output: difference, lower and upper bounds of CI, adjusted p-value

28
Q

What is the Kruskal-Wallis test?

A
  • does not assume normality (non-parametric)
  • assumes different groups have the same stand. deviation
  • H0: all medians are the same
  • data is assigned ranks: smallest score = 1, ties = mean of the ranks concerned (eg 1.5)
  • average rank calculated for each group
29
Q

Give a summary of Hypothesis Testing.

A
  1. state H0, H1
  2. statstical test (eg. F-test, Wilcoxon)
  3. significance level (α)
  4. sampling distribution
  5. finding p-value
  6. do we reject H0?
  7. interpret results based on 6.
30
Q

What is the z-test used for?

A
  • comparing probabilities

- test statistic can be calculated

31
Q

What is the null hypothesis in a χ^2 goodness of fit test?

A

H0: expected count = total x specified cell probability

32
Q

When do we reject H0 for a χ^2 test?

A

The larger the test stat (χ0^2), the stronger the evidence against H0.

33
Q

How do we know if a χ^2 test stat is extreme under H0?

A

Compare χ^2 test stat (χ0^2) with a value obtained from a reference chi-squared dist (χdf^2).

34
Q

Explain how the values in an ANOVA table are found from the first two columns (Df and Sum Sq)?

Df Sum Sq Mean Sq F-value Pr(>F)

A

Mean Sq - Sum Sq/Df

F-ratio - Mean Sq Group/Mean Sq Residuals

p value - F ratio (above) associated with Df Group and Df Residuals

35
Q

What do the columns of a Tukey’s HSD test mean?

diff lwr upr p adj

A

diff - estimated difference of means
upr - upper bound of CI
lwr - lower bound of CI
p adj - evaluating H0, Pr that the difference between means is 0