Lecture 3 Flashcards
statistical inference:
what represents type A error?
what represents type B error?
type A/type 1 error: false positive
> reject h0 while its true
type B/type 2 error: false negative
> retain h0 while its wrong
type A related to?
type B related to?
type A: related to h0 being true
type B: related to h1 being true
how to describe the power of a test?
power: the probability of rejecting h0 when its not true
> probability of discovering a true effect
what are 7 factors that influence power?
power depends on:
- size of the effect
> bigger effect, more power
- variation in population
> less variation, more power
- sample size
> bigger sample size, more power
- design type
> within subjects more power than between
- significance level
> greater alpha, more power (not recommended)
- one vs two sided testing
> one sided more power
- type of test: parametric vs nonparametric
what factors contribute to effect size?
what factors do not?
effect size: the intrinsic size of an effect depends on
- the measured effect (m1-m2)
- the standard deviation
it does not depend on sample size!
N never contributes to measures of effect size
cohen’s d: what is small/medium/large effect?
cohen’s d
small: 0.2
medium: 0.5
large: 0.8
why would you use non parametric tests
parametric tests make assumptions
> observations interval/ratio level
> normally distributed populations
> equal population variance for different groups
>>> if those are violated, use non parametric tests
what are 2 general ideas of non parametric testing?
non parametric testing
- use of ordinal/nominal scales of measurement
- no assumptions over underlying distributions
how to apply non parametric test to interval/ratio level data?
transform data to ordinal scale by rank ordening
how to calculate mann-whitney U
mann-whitney u
- rank order scores of both groups in one dimension
- count for each observation how many higher observations there are in the other group
> sum A / sum B
- check whether n1 x n2 = A + B
- U = smallest of A / B
- smaller u, more evidence for difference
how to calculate a wilcoxon matched pairs signed rank test
wilcoxon matched pairs signed rank
- rank order differences, ignoring sign
- W is the sum of the rank corresponding to the least occurig sign in the difference column
- smaller w > greater evidence for difference
chi-squared test: what general procedure?
chi square:
> nominal data: frequencies
> transform percentages to “raw frequencies” (numbers)
> compare observed frequencies to expected frequencies
chi square:
how to interpret chi X²
how to determine degrees of freedom
X²
> if 0, then there is no difference between observed and expected >>> H0 true
> greater X², greater evidence for H1
degrees of freedom:
> df = (An-1)(Bn-1)
> An and Bn are the number of categories for A and B
3 advantages of distribution free tests
distribution free tests
- are simple
- do not assume normality / equal variances
- can be applied regardless of the scale of measurement
3 disadvantages of non parametric tests
distribution free tests
- involve loss of information due to transformation to ordinal scale
- have less power than corresponding parametric test.
- advanced statistical procedures require parametricitiy