Lecture 5 Flashcards

1
Q

What are the null and research hypotheses for ANOVA in notation?

A

H0: μ1 = μ2 = μ3
H1: μ1 =/= μ2 =/= μ3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the null and research hypotheses for ANOVA in words?

A

H0: No difference between the means.
H1: There is at least one mean difference in all of these means/between the groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which hypothesis do we test?

A

We always test the null, always. No exceptions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How do we make the decision to reject or fail to reject H0?

A

Distribution of F statistic under H0. For continuous variables, area under the curve = 1. Because we turned it into a percent, then into probability.
Fcritical separates rejection region from the rest of the curve. If F falls in the rejection region, then we reject the null.
Rejection region = alpha = .05 (unless told otherwise).
If F is less than .05, we reject H0. Obtained value, Fobtained, or Fobserved, comes from our data.
Fobtained in summary table (from data), Fcrit comes from F table.
Compare Fobtained to Fcrit, if Fobtained is greater than Fcrit we reject H0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What do we say if Fobtained is less than .05?

A

We say that the difference between means is statistically significant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How is significance graded?

A

There is no gradation of significance, it is only either above or below Fcrit (i.e. either significant or not significant).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When writing a conclusion about rejecting H0, what language must be included?

A

Words such as “probably”, “it is likely”, or “it seems that”.
This is because the area under the rejection region is a probability, so will be wrong 5% of the time. So we will be rejecting H0 when it’s actually true 5% of the time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The conclusions from any stat test may be in error. True or false

A

True, we’ll never know for sure if the results from any stat test are in error or not, but it is always possible that they are in error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the two possible errors that could be made when making a conclusion from a statistical test?

A

Type I or Type II error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a type I error?

A
  • Alpha
  • By convention .05
    Means when rejecting H0 when H0 is true.
    If alpha = .05, F5 exceeding Fcrit will occur 5% of the time due to chance alone.
    So 5% of the time when we accept H1, we’ll be wrong.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Type II error?

A
  • Beta
  • “accepting” H0, when H0 is false.
  • (We actually don’t ever “accept” it, we fail to reject it)
  • We can never know the value of Beta for sure, because we need to know parameters to determine it.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe a 2X2 contiingency table

A

Truth: either H0 is true, or H0 is false
Top:
H0 true | H0 false

Sides:
“Accept” H0 decision
|
Reject H0

For box H0 true + “accept” H0 decision: 1 - alpha
For box H0 false + “accept” H0 decision: Type II error - Beta
H0 true + reject H0: Type I error - alpha
H0 false + reject H0: 1 - Beta

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is power?

A

Correct rejection off H0

Acceptable power values range from 0.7 to 0.8

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Give a close up of power, why is having a high power important?

A

If spending lots of time and money on a study, want a 70-80 percent chance of finding a significant difference if one exists.
- need a measure of effect size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Recall Cohen’s d

A

d = population effect size
d = (μ1 - μ2)/σ
dcarrot = (Xbar1 - Xbar 2)/σcarrot
Therefore d = distance between means in standard deviation units.
so dcarrot = .4 means that group means differed by .4 of a standard deviation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does power equal?

A
Power = degree of non overlap, how many standard deviations groups differ.
d = .2 = small
d = .5 = medium
d = .8 = large
d = 1.5 = very large
d = 2.2 = huge (international superstar level psychologist)
17
Q

What is squiggly fcarrot?

A

effect size for ANOVA

18
Q

squiggly fcarrot =

A

√(K-1)F/N or √(a-1)F/N where K and a stand for number of groups.

19
Q

squiggly fcarrot ranges

A

~.1 = small
~ .25 = medium
~ .4 = large

20
Q

What are the two ways that we use power?

A

Post hoc (after the fact) and apriori.

21
Q

Describe post hoc determinations of power

A
  • use effect size, n, and power table to calculate power.
  • eg:
    n1 = 15
    n2 = 13
    n 3 = 20
    n4 = 17
    F = 1.4
    N = 65
    alpha .05
    average n = 16.25 –> 16
    U = K-1 = 3

squiggly fcarrot = √(K-1)F/N = √(4-1)1.4/65 = √(3)0.2153 = √0.064615384 = 0.254 (medium) - this is the effect size.
Power: in table C.3 = .34!
The power was inadequate to detect a medium effect size - need to replicate study with a more adequate N!!

22
Q

Describe apriori power tests:

A
  • Tells us how many participants per group are needed to have adequate power.
  • Slight problem: need an expected effect size to determine n for a specified power at some alpha level.
23
Q

How do we overcome the problem of apriori power tests?

A
  • Past research –> i.e. average of squiggly fcarrot’s from similar studies –> similar treatment, treatment duration, type of subjects, instruments used to measure DV, etc. Better to have some estimate than no estimate.
24
Q

How do you find an apriori estimation of power?

A
estimated squiggly fcarrot = .25
power = .07
alpha = .05
u = 3
table C.3 - we need 36 participants per group are needed.
25
Q

What are four ways to improve power?

A
  1. Adopt a more lenient alpha (e.g. alpha = .10)
  2. Use a one tailed test when literature supports a directional hypothesis
  3. Decrease within group variability
    - sample selection (homogeneous group)
    - repeated measures designs
  4. Have equal n
26
Q

What is the harmonic mean? When is it used?

A

With unequal n, rather than use the arithmatic mean of the sample sizes, we use the harmonic mean
- General form of the equation:
Harmonic mean = # of treatment groups/Σ(1/sample size)

So for two samples:
nh = 2/1/n1+1/n2