Effect Size and Statistical Power Flashcards

1
Q

Think about the things that might make an experiment have a significant result

A
  • If you are measuring something that is easy to measure
  • Huge difference exists in real life (e.g., IQ of toddlers vs. university students)
  • If you have a really good measurement method
  • Everything controlled for by experimental design (e.g., test the same participants in two conditions)
  • If you have lots of resources
  • Test lots of participants
  • Take many measurements from each participant
  • [Ethically unsound: keep doing the experiment again until it is significant!]
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Effect Size

A
  • Why does it matter?
  • Small effects are often not meaningful in real life (e.g., if an intervention reduced depressive symptoms by 1/100th of an SD but took 8 hours a day for five years to complete)
  • People might want to compare different experiments from the literature
  • Significance tells us which group did better or which relation is present, but effect size tells us how big is the difference or how strong is the relationship
  • If you know how big an effect is, you can estimate how difficult it will be to measure a significant result: Power Analysis
  • This allows you to calculate how many participants to test/ how many measurements to take from each participant
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Effect Size Numerical Examples (Cohen, 1988 p 10)

A

• If the percentage of males in the population of psychiatric patients bearing a diagnosis of paranoid schizophrenia is 52%. and the effect is measured as a departure from the hypothesized 50%. the ES is 2%; if it is 60%, the ES is 10%, a larger ES.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Standardised Effect Sizes

A
  • Correlation between two variables, the r-statistic itself is the effect size:
  • r = .1, weak; r = .5, moderate; r = .7, strong; r = .9 very strong
  • Differences between means in t-tests Cohen’s d:
  • d = .2 small; d = .5, medium; d = .8, large
  • ANOVA/F-tests (variability of DV explained by IV) effect size is η² (Eta squared)
  • η² = .01, small; η² = .059, medium; η² = .138, large
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Cohen’s d

A

Just as z-scores represent points of data in a standardised way. Cohen’s d expresses the size of a difference in a standardised way.

Cohen’s 𝑑= (M_2-M_1)/〖SD〗_Pooled

And where M and SD are the mean and
standard deviation of each measure/group

Individuals in the experimental group remembered an average of 6.7 words (SD = 2.1), whilst individuals in the control group remembered and average of 6.2 words (SD = 2.3). Calculate the effect size:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do we measure small effects?

A
  • If effects are small, they are harder to measure

* This means we need more statistical power

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Think about the things that might make an experiment have a significant result

A

• If you are measuring something that is easy to measure
- Huge difference exists in real life (e.g., IQ of toddlers vs. university students)
• If you have a really good measurement method
- Everything controlled for by experimental design (e.g., test the same participants in two conditions)
• If you have lots of resources
- Test lots of participants
- Take many measurements from each participant
- [Ethically unsound: keep doing the experiment again until it is significant!]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Power is influenced by

A
  • Effect Size
  • Sample Size
  • Alpha Level
  • Directional Hypotheses (i.e., 1-tailed)
  • Variance of the population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is power?

A

The power of a statistical test of a null hypothesis is the probability that it will lead to the rejection of the null hypothesis, i.e., the probability that it will result in the conclusion that the phenomenon exists.
Cohen (1988, p4)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How much power should we have

A

• Typically researchers aim for a power of 0.8
- This means that they have an 80% chance of finding a significant effect if that effect really exists: E.g., if they ran an experiment 100 times, they would expect 80 of the experiments to be significant.
• Why not 100%
- With large enough power, spurious tiny effects can lead to a significant result
• Such effects may not be attributable to a real-life phenomenon

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

summary

A
  • Power is the probability that an experiment will lead to a significant result if there really is an effect in the population
  • This depends on:
  • Effect Size
  • The size of the effect being measured
  • The variance in the data
  • The number of participants being tested
  • The power of a study can influence the resources needed to measure a significant result
How well did you know this?
1
Not at all
2
3
4
5
Perfectly