Quiz 3 Flashcards
Quiz 3 for Stats I
Flip side of power ( __%) is ___ (__%)
80%, type II error / beta (20%)
directional hypothesis is to _____ as non-directional is to ____
one tailed, two tailed
What is the mean of the distribution of a z statistic when H0 (null) is true
mean of zero
When null is false, H1 is ____
true
Power
- The greater the number of subjects in a sample, the more power (generally)
- Though the incremental increase in power becomes smaller as the sample size increases
- Power analysis is a statistical method to calculate the sample size needed to achieve desired power
Influences on Power
- Significance level (alpha), more lenient alpha = more power
- One-tailed vs two tailed test (one tailed has more power)
- As standard error decreases, there is more power
more lenient alpha = _____ power
more
which has more power: a one-tailed t-test or a two tailed
one tailed
As standard error decreases, power ________
increases
why is there greater likelihood for us to reject the null hypothesis when there is less standard error?
there is less overlap between the graphs of the null and alternative distributions
One-tailed test
Tests a directional hypothesis (specific direction of an effect)
E.g. change > 0
Two-tailed test
Test a non-directional hypothesis (doesn’t specify the direction of an effect
E.g. Change ≠ 0
Why does a one-tailed test provide us with greater power?
One tailed test provides us with greater power because the critical value is smaller than the cutoff for a two-tailed test
Researchers tend to use a two tailed test in order to ______.
discriminate between a zero effect and a negative effect
How are one-tailed tests often misused?
One-tailed tests are often misused in a way which increases type 1 error rate.
If people incorrectly reject the null after there is the illusion of an effect at the wrong side of the curve for a directional hypothesis.
Effect Sizes
- A standardized measure of the size of an effect:
- Standardized =
comparable across
studies- Not (as) reliant on the sample size - Allows people to objectively evaluate the size of the observed effect
- Amount that two
populations do not
overlap
- Amount that two
E.g. cohen’s d, correlation
Types of effect sizes
The family of effect size measures has been categorized into TWO broad groups:
Measures of mean differences (e.g. Glass’s Delta, Cohen’s d)
Measures of strength of relations (e.g. r, R^2, eta squared)
Two groups of measurement of effect sizes
- Measures of Mean differences
- Measures of strength relations
Measures of mean differences examples
Glass’s Delta, Cohen’s d, Hedges g
Measures of strength of relation examples
r, R^2, eta squared
Measures of Mean Differences
Largely calculated in the same manner
E.g. d = mean1/mean2 / population SD
We don’t know the population SD,
Differ in how they estimate the population SD
Glass’s delta
- uses the SD using the control group because it is “untainted” by the treatment
- Strength of this metric depends on the size of the control group, the larger the control group, the more appropriate the SD as it estimates the population SD
Cohen’s d
- combines the SD of both groups
- Because cohen’s d pulls information from sd of both groups, when the sd’s across groups are very different, it makes more sense to use glass’s delta
Hedges g
- Multiply cohen’s d with some scaler
- Cohen’s d tends to overestimate the amplitude of effect in smaller samples
- Corrects for cohen’s d bias, so better to use in smaller samples (20 or less)
As we have less variability, effect size becomes _____, as SD is in denominator, even with the same degree of mean difference
bigger
Measures of Strength of Relations
- Effect sizes based on variance explained
- These effect sizes estimate the amount of the variance in an outcome variable that is explained or “accounted for” by the model/predictor variables
- E.g. include r, which is the correlation coefficient and r^2 which is the coefficient of determination
- On a range of ±1 for pos/neg correlation
- Tells us the size/strength of the association and the direction of the association of the two variables
Some useful guidelines for the magnitude of effect sizes
r = .1, d = .2 (small effect):
the effect explains 1% of the total variance
r = .3, d = .5 (medium effect):
the effect accounts for 9% of the total variance
r = .5, d = .8 (large effect):
Importance of Effect Size
- Estimates of anticipated ES can be used to project the sample size that would be adequate for detecting statistically significant results → power analysis
- They enable researchers to inform judgment about the practical significance of the study
- Because effect sizes are standardized measures of the size of mean differences or strength of relations, they are used to compare the results of different studies with one another and to be used in meta-analysis
- A qualitative approach would be a systematic review
Power Analysis
Estimates of anticipated ES can be used to project the sample size that would be adequate for detecting statistically significant results
______ enable researchers to inform judgment about the practical significance of the study
effect size
Because _______ are standardized measures of the size of mean differences or strength of relations, they are used to compare the results of different studies with one another and to be used in _______.
effect sizes, meta-analyses
Clinical Significance
Jacobson, Follette, and Revenstorf (1984)
- Clinically significant change conceptualized as return to normal functioning
- Clinical significance → the extent to which therapy moves someone outside the range of dysfunctional population or within the range of functional population
Significance testing does not tell us about _____.
effect size