hypothesis testing Flashcards

1
Q

What are the types of hypotheses?

A

Research hypothesis (the question being investigated)

Null hypothesis (𝐻0): The hypothesis that is tested

Alternative hypothesis (𝐻1 or 𝐻𝐴): The opposite of 𝐻0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a confidence interval?

A

An interval (lower & upper limit) within which the true value of a population parameter lies with a specified confidence level.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do hypothesis tests differ from confidence intervals?

A

Hypothesis tests assess whether a single value is the true parameter.

Confidence intervals estimate a range where the true parameter likely falls.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the steps in a hypothesis test?

A
  1. Specify the hypothesis
  2. Obtain a test statistic from the data
  3. Compare the test statistic to a reference distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the null hypothesis (𝐻0)?

A

The hypothesis being tested, the test determines how much evidence the data provides to support this hypothesis.

Example:
𝐻0: πœ‡ = 3 (Mean density is 3 birds/kmΒ²)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the alternative hypothesis (𝐻1 or 𝐻𝐴)?

A

The hypothesis that contradicts 𝐻0, suggesting a difference or effect.

Example:
𝐻1: πœ‡ β‰  3 (Mean density is not 3 birds/kmΒ²)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are one-tailed and two-tailed tests?

A

One-tailed: Tests for a directional effect (e.g., 𝐻1: πœ‡ < 3 or 𝐻1: πœ‡ > 3)

when an effect can only occur in one direction

an effect can occur in both directions but only one direction is of interest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is two-tailed tests?

A

Two-tailed: Tests for any difference (e.g., 𝐻1: πœ‡ β‰  3)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What do we compare the test statistic to?

A

A reference distribution (e.g., t-distribution) to determine if the observed difference is significant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does variability affect hypothesis testing?

A

Low variability β†’ Easier to detect a true difference

High variability β†’ Harder to conclude significant differences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What types of statistical tests are covered?

A

One-sample & two-sample t-tests

ANOVA (more than two groups)
z-tests (proportions)
Chi-square tests (categorical data)
Linear regression (t and F tests)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is a t-statistic calculated in a one-sample t-test?

A

tstat = (data estimate - hypothesised value) / SE(data estimate)
​

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what will the tstat be if H0 is true

A

the test statistic (π‘‘π‘ π‘‘π‘Žπ‘‘
) will be small (dependent on sampling variability) because the difference between the data-estimate (sample mean) and the hypothesised value is small.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what will the tstat be if H0 is false

A

the test statistic will be large (dependent on sampling variability) because the difference between the data-estimate and the hypothesised value is large.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What distribution is typically used as the reference distribution in these examples?

A

The t-distribution is used as the reference distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What do the degrees of freedom (df) for the t-distribution depend on?

A

The degrees of freedom depend on what is being tested.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are the two ways the reference distribution helps determine the strength of evidence for the null hypothesis?

A
  1. By obtaining an exact probability for the test statistic.
  2. By comparing the test statistic to a critical value based on a predetermined significance level.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

In a one-sample two-tailed test, what is the null hypothesis (H0) and alternative hypothesis (H1)?

A

-𝐻0:πœ‡=3.6

𝐻1:πœ‡β‰ 3.6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the reference distribution for the test with
𝑛=16?

A

𝑑𝑑𝑓=π‘›βˆ’1=𝑑15

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

In a two-tailed test, how is the area in the two tails interpreted?

A

The area in the tails represents the probability of obtaining a test statistic as extreme or more extreme than the observed value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How do you calculate the area in the two tails for the test statistic βˆ’0.753?

A

Add the area in the left tail (< -0.753) and the right tail (> 0.753): 0.226+0.226=0.452

22
Q

What is a p-value in hypothesis testing?

A

The p-value is the probability of observing a test statistic as extreme, or more extreme, than the one observed, assuming the null hypothesis is true.

23
Q

What does the p-value quantify in hypothesis testing?

A

The p-value quantifies the chance of observing the data (or something more extreme) if the null hypothesis (H0) is true.

24
Q

When is the null hypothesis typically rejected based on the p-value?

A

The null hypothesis (𝐻0) is usually rejected when the p-value is very small.

25
Q

What are some common threshold values (significance levels) for p-values?

A

0.10 β†’ No evidence against 𝐻0
0.05 β†’ Weak evidence against 𝐻0
0.01 β†’ Some evidence against 𝐻0
0.001 β†’ Strong evidence against 𝐻0

26
Q

What does a large p-value indicate?

A

The test statistic is likely under 𝐻0

We fail to reject 𝐻0

27
Q

What does a small p-value indicate?

A

The test statistic is unlikely under 𝐻0

We reject 𝐻0 in favor of 𝐻1

28
Q

What does it mean to β€œfail to reject the null hypothesis”?

A

It means that the p-value is large, indicating that the test statistic is likely to occur if the null hypothesis is true. Therefore, there is no strong evidence against 𝐻0

29
Q

What does it mean to β€œreject the null hypothesis”?

A

It means the p-value is small, indicating that the test statistic is very unlikely to occur if the null hypothesis is true. This provides evidence in favor of the alternative hypothesis (𝐻1)

30
Q

What is the relationship between a large p-value and the test statistic?

A

A large p-value suggests that the test statistic is likely to occur under the null hypothesis, providing no strong evidence to reject 𝐻0

31
Q

What is Fixed Level Significance Testing in hypothesis testing?

A

Fixed Level Significance Testing involves comparing the test statistic to a critical value based on a fixed significance level (e.g., 0.05, 0.1) to decide whether to reject the null hypothesis.

32
Q

What are statistical tables used for in significance testing?

A

Statistical tables provide critical values (quantiles) for different reference distributions (like the t-distribution) at various significance levels.

33
Q

What does the critical value represent in significance testing?

A

The critical value is the threshold beyond which the test statistic would be considered extreme enough to reject the null hypothesis.

34
Q

In a two-tailed test with a 5% significance level, how is the significance level distributed in the tails?

A

2.5% in the lower tail
2.5% in the upper tail

35
Q

What is a two-sample t-test used for?

A

A two-sample t-test is used to compare the means of two groups to test if there is a statistically significant difference between them.

36
Q

What are the null and alternative hypotheses in a two-sample t-test?

A

𝐻0:πœ‡π΄βˆ’πœ‡π΅=0 (no difference between means)

𝐻1:πœ‡π΄βˆ’πœ‡π΅β‰ 0 (a difference exists between means)

37
Q

What formula is used to calculate the test statistic in a two-sample t-test?

A

tstat = (ΞΌ^A-ΞΌ^B)βˆ’0) / SE(ΞΌ^Aβˆ’ΞΌ^B)

38
Q

What happens if the test statistic is less extreme than the critical value in a fixed-level test?

A

The null hypothesis is not rejected, indicating that the observed data does not provide strong evidence against the null hypothesis.

39
Q

Why do we divide the significance level equally between two tails in a two-tailed test?

A

Because we are testing for a difference in either direction (greater or smaller), so the probability of an extreme result must be shared between both tails.

40
Q

What is the conservative assumption made about variance in the two-sample t-test?

A

It is assumed that the variances are unequal, making the test more conservative.

41
Q

What is a paired t-test used for?

A

A paired t-test is used to compare two dependent groups where observations in one sample are paired with observations in the other sample (e.g., before and after treatment).

42
Q

What is the formula for the paired t-test statistic?

A

tstat = (ΞΌ^dβˆ’0)/ SE(ΞΌ^d)

43
Q

How is the standard error for the paired t-test calculated?

A

SE(ΞΌ^d)=SDd / sqrt(n)

ΞΌ^d = Mean of the differences

𝑆𝐷𝑑 = Standard deviation of the differences

𝑛 = Sample size

44
Q

What are the key assumptions for t-tests?

A

Independence of data within and between groups

Normal distribution of data (assess via histograms or Shapiro-Wilk test)

t-tests are robust to non-normal data if sample sizes are similar.

45
Q

What should you do if normality is unreasonable in a dataset?

A

Use non-parametric tests, such as the Mann-Whitney-Wilcoxon test.

46
Q

What does the Mann-Whitney-Wilcoxon test compare?

A

It compares the ranks of two groups to test if their distributions are the same or different.

47
Q

How is the Mann-Whitney U test statistic calculated?

A

U=Wβˆ’ (n(n+1))/2
​
W = Sum of the ranks
𝑛 = Sample size

48
Q

What does a large p-value in the Mann-Whitney test indicate?

A

A large p-value suggests no evidence of a difference between the two groups.

49
Q

How can you assess the practical significance of a result?

A

Present the effect size and confidence interval to provide context beyond just statistical significance.

50
Q

What are the key steps in hypothesis testing?

A

Specify null and alternative hypotheses

Calculate a test statistic

Compare to a reference distribution

Use data to determine the strength of evidence for the null hypothesis