Lecture 1: General Linear Model Flashcards

1
Q

What are the assumptions of a t-test?

A
  1. continuous DV
  2. SE only valid if variance is not different between groups (homoscedasticity)
  3. subjects must be independent, otherwise SE formula not valid (cannot be tested, thinking exercise)
  4. look up t-statistic under $H_0$ but only if DV is normally distributed in both groups
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is there a translational problem between basic and applied research?

A
  1. communication and shared language → communication issues between scientists in basic & applied
  2. replicability of findings → replication problems with many psychological findings - stability is crucial in the translational process
  3. establishing Causality → establishing causality is difficult since much of research is focusing on correlations
  4. unclear external validity → often mentioned that non-clinical samples is not beneficial
  5. fat-handedness problem → fat-handedness of psychological treatments - treatments often affect multiple processes, they do not operate in isolation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does the t-distribution change shape based on the df values?

A

The more df the narrower it becomes, the less df the wider it becomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is b0?

A

The intercept and the expected/ average score on the DV when the IV is 0 o

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Error

A

The difference between the observed and the expected score

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the assumptions for the GLM?

A
  1. DV continuous
  2. variance of residuals equal at all levels of x (homoscedasticity)
  3. data is independent
  4. residuals normally distributed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the assumptions for the factorial ANOVA?

A
  1. DV continuous
  2. variances of DV equal in all groups
  3. data of all subjects is independent of each other
  4. DV is normally distributed in each group
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is b1?

A

The score on the dependent variable when the independent variable increases by 1 or the difference in averages of the different groups in the IV

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How does the GLM work?

A

It creates a line that best fits with the observed data, results found to be the same as for the linear regression as the the t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the t-test test?

A

Whether the differences in averages across the groups is significant. Formula is= (y group 1- y group2)/ se

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Standard error

A

√(sd²1/N1)+ (sd²2/N2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What needs to be interpreted for a t-test?

A
  1. Look at the means to see the direction
  2. Check equal variances
  3. Look at the F values and p values to see significance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How would this be interpreted?

A

Self-esteem at posttest differed significantly between running and sitting
Self-esteem at posttest on average higher for running than for sitting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is y(i)?

A

The observed score for the dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What needs to be interpreted for a linear regression?

A

The unstandardized b coefficients and the value and p value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How could we report regression analysis (does not need to be APA on the exam)?

A
  • Intervention predicts self-esteem at posttest, F(1, 38) =
    4.22, p = .047.
  • On average, self-esteem at posttest is higher for the running group than for the sitting group (b1), t(38) = 2.06, p =
    .047.
17
Q

What is the GLM with two independent variables?

A

y(i) = b0 + b1x(i) + b2z(i) + b3x(i)z(i) + error(i)

18
Q

What is the interpretation of an interaction effect?

A

The effect of {intervention} differs between people across different groups

19
Q

What needs to be interpreted for a factorial ANOVA?

A

Any significant main effects and the interaction effect

20
Q

How can you interpret a plot for factorial ANOVA?

A

Average between groups= in difference between the mid point of the average lines
Interaction effect= the difference between the average lines at the starting point and end point

21
Q

What is the sum of squares?

A

Deciphers how well the model fits the data, uses N-k-1

22
Q

What is the total sum of squares?

A

The sum of squared differences between the observed and predicted values using the baseline model

23
Q

What is the model sum of squares?

A

SST- SSR

24
Q

What is R^2?

A

SSM/SST-> proportion of improvement with the predictive model compared to the baseline model

25
Q

How can influential cases be detected?

A

→ calculating the adjusted predicted value for a case (excluding a case and seeing if the predicted value is equally well predicted by both models)

→ calculating Cook’s distance (values larger than 1 are potentially problematic)

26
Q

What do B values explain?

A

how much the outcome changes if the predictor changes by one standard deviation

27
Q

What is the rationale for the t-test?

A
  • Standard error used to gauge variability, difference between means in the observed sample compared to the difference expected
  • The larger the difference compared to the standard error, the more likely that the means are different
28
Q

How to check for normality assumption?

A

P-P plots (probability-probability plots), which plot the cumulative probability of a variable against the cumulative probability of a distribution, can be used. Ideally, this is a straight diagonal line.

29
Q

How to check for homoscedasticity?

A

Through a plot which has standardized residuals on the x axis and standardized predicted values on the y axis-> if there is a funnel shape the assumption has not been met. Or through Levene’s test, if significant the assumption has not been met.

30
Q

How can violation of independence affect the conclusions drawn?

A

If the data are dependent (within groups), the test is far too liberal (results that are significant too quickly). In that case, you may conclude that the treatment group
scores are significantly better than the control group, whereas in reality this is not the case.

31
Q

What are clinical examples of t-test violations?

A
  • Continuity: when the outcome variable is nominal (e.g., recovered or not).
  • Equal variances (homoscedasticity): when in one group all people have about the same degree of complaints, while in the other group the spread is much
    greater.
  • Independence of data: when groups of clients have been treated by the same therapist (= nested data).
  • Normality of data: within the treatment group, half of the subjects improved while the other half did not. The distribution is then bimodal.
32
Q

Explain all aspects of the general linear model

A

y(i) = b0 + b1*x(i) + error(i), where:
y(i) = DV at posttest for subject i,
x(i) = IV where subject i belongs to (0=waiting list, 1=intervention)
b0 = intercept: average DV scores at posttest in the x=0 condition
b1 = difference in average DV scores at posttest between different IV groups
error(i) = difference between observed and expected score for DV scores at posttest for subject i.

33
Q

What is b3?

A

difference in average DV score between IV1 groups and between IV2 groups