Module 6 Flashcards

1
Q

Dummy coding

A

Most straightforward approach to including an independent variable with J categories in a linear model is to use dummy coding

Dummy coding - independent variable with J levels is broken down into J-1 separate binary dummy variables, each of which is coded to equal either 0 or 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Reference category

A

one of the J levels of the IV is chosen as the reference category - reference category is assigned a value of 0 on each of the dummy variables

control is usually the reference category

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what do we want our statistical model to do?

A

represent how variation in the dependent variable is a function of the 5 category independent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Slope parameter of dummy variable

A

Each dummy variable has its own slope parameter - difference between mean of reference category and mean of other category

B1d1+B2d2+B3d3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

B0 dummy variable

A

Population mean of the reference categorry

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Error term dummy variable represents what

A

Because not every participant in the reference category has the same value as the population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Confidence intervals around each dummy variable represent what?

A

Because the slope coefficient estimate of the d1 variable is a point estimate of the difference between the population mean of the rhyming condition and the population mean of the counting condition, then the interval [-2.90, 2.70] captures the population mean difference with 95% confidence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How to calculate estimated error term

A

subtract predicted mean from actual value

= score - mean of that group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does i index

A

participant ID

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How to calculate SS residual

A

get residual for each participant (score - predicted), square it and add them all up

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How to calculate SS model

A

Predicted mean of the group - sample mean of the dependent variable (grand mean), squared, summed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is SS model

A

variability in the model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

R^2

A

= eta squared = 1-(SSredid/SStotal) = SSmodel/SStotal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

If null is true what should the linear slope parameters be equal to?

A

Eachother and 0
B1=B2=B3=0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

MS model

A

SS Model/df model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

MS Residual

A

SS residual/df residual

17
Q

F statistic

A

MS model/MS residual

18
Q

APA style one way anova

A

“The overall proportion of variance explained by the linear model, R2 = .45, was significant, F (4, 45) = 9.09, p < .001, indicating that the number of words recalled significantly varied across the five conditions representing different levels of depth of processing

19
Q

Significant result on ANOVA says what?

A

only indicates that at least one population mean is unlikely to be unequal to the
other population means.

20
Q

Planned comparisons

A

t-tests are valid if a researcher has explicitly and transparently planned at the beginning of the study to compare the mean of the reference category with the means of the other categories.

21
Q

Planned comparisons reporting

A

“Because the dummy variables in the linear model were defined a priori, the corresponding ttests represent planned comparisons. The rhyming mean (M = 6.90) did not significantly differ from the counting mean (M = 6.90), t (45) = 0.07, p = .94. But the adjective mean (M = 11.00) was significantly different from the counting mean, t (45) = 2.88, p = .006.”
Etc. for the t-tests for the remaining dummy variables

22
Q

ANOVA model assumptions

A
  1. Independent observations
  2. Normally distributed errors
  3. Homogeneity of variance
23
Q

How to assess assumption of normally distributed errors

A

examine the residuals from the estimated model

  • strong kurtosis would violate this error
24
Q

How to evaluate the homogeneity of variance assumption

A

Look at sample standard deviations

Dont want any to be like twice another one

but equal sample sizes may protect against unequal SDs