Hypothesis testing Flashcards

Familiarise with significance tests etc

1
Q

What are the three central features of significance testing?

A

1) Assume a normally distributed range of error around an event
2) Assess likelihood of a particular variant of this event accounting for the normal range of error
3) Decide that the likelihood of that event under these circumstances is far too low and therefore must be significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How would we go about calculating the difference between two means to determine the presence/absence of a significant effect?

A

We would calculate a TEST STATISTIC i.e. a z-score
This is standardised so we are able to know its exact distribution under the null hypothesis, and through the distribution curve we are also able to calculate the probability of that test statistic given the null hypothesis is true - probability is area under the curve and this is found using reference tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

RECAP: How is the Test Statistic calculated?

A

Need to calculate SE first
Then z = (sample mean - population mean (i.e. mean under null hypothesis))/SE
We then look the resulting value up in the tables and we get the probability of that test statistic assuming null hypothesis to be true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What would we say if a sample mean falls below the lower boundary of a 95% interval?

A

It is in the CRITICAL REGION under the null hypothesis and the sample is SIGNIFICANTLY DIFFERENT from the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a uni-directional hypothesis?

What are the advantages and disadvantages of this type of hypothesis?

A

Predict difference and state direction (One-tailed test)
ADVANTAGE - Value of z-score doesn’t have to be as large to reject null hypothesis (remember we are interested in the most extreme 5% of scores and in a one-tailed test this is all at one end)
DISADVANTAGE - if direction of difference is opposite to what was predicted, null hypothesis cannot be rejected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a bi-directional hypothesis?

What are the advantages and disadvantages of this type of hypothesis?

A

Predict difference but do not state direction (Two-tailed test)
ADVANTAGE - We can reject the null if an effect is significant in either direction
DISADVANTAGE - The z-score has to be a larger value in order to reject the null as the extreme 5% is now halved and is 2.5% either end of the distribution so to be beyond the critical point the value for z is going to need to be higher

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a Type 1 error?

A

Finding an effect which isn’t actually there i.e. rejecting null hypothesis even when it is true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a Type 2 error?

A

Missing an effect that actually is there i.e. accepting null hypothesis when it is actually false

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why is effect size important?

A

The relative size of an effect is more useful than its absolute size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the 3 main measures of effect size that can be used?

A

1) Cohen’s d
2) Pearson’s correlation coefficient, r
3) Odds ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When would we want to calculate Cohen’s d and how can we use it?

A

Comparing 2 means - the d value is the difference in the means of 2 groups divided by the average of their standard deviations (so if we see d=1, we know that the 2 means differed by 1 SD)
Small effect = 0.2
Medium effect = 0.5
Large effect = 0.8

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When would we use Pearson’s correlation coefficient and why is it useful?

A

Covers whole strength of relationship range, from none at all (0) to perfect (1)
Tells us how large a relationship between two variables actually is, INDEPENDENT OF SAMPLE SIZE Small effect = 0.1 (i.e. effect measured explains 1% of total variance)
Medium = 0.3 (i.e. effect measured explains 9% of variance, r squared)
Large = 0.5 (i.e. effect measured explains 25% of total variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why would the odds ratio be used?

A

Measures how many times bigger the odds of one outcome is for one value of an IV compared to another
Essentially measures association between exposure and outcome
To convert from odds ratio to effect size, simply divide odds ratio by 1.81

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define what a z-score actually is

A

The number of standard deviations a score is from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is meant by “Sampling Distribution”? What do such distributions look like?

A

Distribution of means of samples rather than individual values
“The collection of means of all possible random samples, size N, from a population, where the mean of the distribution is estimated as the sample mean”
Approaches a normal distribution as sample size increases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are Confidence Limits for the mean of a sample?

A

Range of values within which we are 95% certain the true population mean lies - within plus and minus 1.96SE of the sample mean

17
Q

What is the correct definition of a null hypothesis?

A

NOT a prediction about what will happen, but a CLAIM about what the populations the samples were drawn from are like

18
Q

What is the general probability of making a Type 1 error during hypothesis testing?

A

5% (Rejection region is 5%)

19
Q

What is one strong way of reducing risk of making Type 1 errors?

A

Replication of study - if able to successfully replicate findings, probability of results having been a fluke decreases

20
Q

When may a researcher use a 1% level of significance?

A

1) Challenging a well-established theory
2) Only have one chance to demonstrate the effect - replication may be impossible in many field studies/natural experiments

21
Q

When might a researcher use less than a 1% level of significance?

A

In research producing applications for human health/life changes e.g. drug testing - we would want to be absolutely certain that no chance effects were recorded

22
Q

What is a major problem with using one-tailed tests?

A

If a result is extreme in the opposite direction to that predicted, a significant result cannot be reported. If you were to switch you would actually be working with a significance level of 0.1 because you would have your original 0.05 and the new 0.05 at the other end