F13 Multilevel modeling I Flashcards

1
Q

What is a hierarchical model?

A

A model with a nested datastructure.

Multilevel models are also called hierarchical linear models (HLM), mixed-effects models, or random-effects models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What two kinds of data structures have multiple relevant levels? Draw them

A

(1) Nested data structures. One level is contained within another (e.g. students in universities).

(2) Non-nested data structures. Different levels make sense in themselves and are independent from each other (e.g. airports and treatments)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a key advantage of multilevel models compared to fixed effects

A

FE is power intensive and problematic with many groups and few observations pr. group - not many degrees of freedom + model is prone to outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does multilevel models account for regarding the effect estimate that standard linear regression doesn’t

A

Possible heterogeneity in effect size. The
assumption of homogeneous effect magnitudes is often problematic. Very plausible in the real world.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is random effects?

A

A multilevel model. Opposite of fixed effects. Not the best formulation as the randomness isn’t clear.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a random effects model?

A

If group-specific intercepts are modeled via a multilevel approach

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a random slope model?

A

If group-specific slopes are modeled via a multilevel approach

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a mixed or mixed effects model?

A

A model that combines both fixed effects and random effects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the two primary components of a multilevel model?

A

Units (i) and groups (j).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What three combinations are possible with multilevel models and what is the regression line?

A

Varying:
Intercepts: y_i = α_(j[i]) + βx_i + ε_i

Slope: y_i = α + βx_(j[i]) + ε_i

Both: y_i = α_(j[i]) + βx_(j[i]) + ε_i

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the key difference between fixed and random effects?

A

We introduce the assumption that intercepts or slopes are normally distributed in random effects: α_j ~ Normal (μ_α , σ_α^2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are hyperparameters?

A

Parameters that are not actually in the regression model but estimated ‘behind’ the model. Instead of estimating 100 unique intercepts we estimate two hyperparameters with random effects, which is way more efficient.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is an important assumption for random effects?

A

The coefficients from the multilevel estimate are normally distributed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Draw the intercepts from random effects, fixed effects and pooled regression.

A

Normal distribution, line with intercepts and one global intercept.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How is observations pooled in random effects, fixed effects and pooled regression?

A

RE: Partial pooling
FE: No pooling
Regression: Complete pooling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the simpel multilevel estimate? Explain parameters.

A

A weighted average between the specific average for group j and the overall average for all units.

(n_j/σ_y^2)y-bar_j + (1/σ_α^2)y-bar_all

divided by

(n_j/σ_y^2)+(1/σ_α^2)

(y_j): Average of group j
(y_all): Average of all units

σ_y^2: Unit specific error. The error term / residual. Unexplained variance Var(ε)
σ_α^2: Variance in intercept (hyperparameter)

17
Q

What happens with the multilevel estimate when there are many/few observations per group?

A

Many: The average of the group j will weigh more (outliers are less influential) relative to the overall average.

Few: We rely heavily on the overall average - the distinction between FE and RE is very clear here.

18
Q

What happens with the multilevel estimate when the residual of the model is big/small?

A

Big: There is a lot of variation that the model cannot explain (often because of outliers) - we would rather rely on the overall average and thus a larger set of observations.

Small: The model is very precise - we would rather rely on the group average

19
Q

What happens with the multilevel estimate when the variance of the random intercepts are big/small?

A

Big: Groups are not very comparable - we are less confident in weighting in the overall average.

Small: Groups are very similar and a lot of the intercepts are clustered around the mean. We can rely more on the overall average.

20
Q

When does the group j average weigh in more in the multilevel estimate?

A

With a lot of observations in the group
When model is precise (low residual)
When the groups are different (high variance in group intercept)

21
Q

When does the overall average weigh in more in the multilevel estimate?

A

With few observations in the groups
When a lot of variation in the model is unexplained (high residual)
When the groups are similar (low variance in group intercept)

22
Q

What is the variance ratio?

A

The relation between variation between and within groups

(σ_α^2) - between group / (σ_y^2 ) - within group

The higher the ratio the more relevant the model. If the ratio is only 0,04 then it would take more than 25 observations (1/0,04) to be pooled closer to the no-pooling estimate.

23
Q

How can the variance ratio be useful?

A

This ratio helps to assess how much of the variation in the outcome is explained by the grouping structure versus individual-level variation

24
Q

What is the intraclass correlation?

A

Bound between 0-1. 0=no between group variation. All groups have the same intercept. 1=no residual. Within a group units doesn’t differ. Everything is explained by the group

σ_α^2 / (σ_α^2 + σ_y^2)

25
Q

What are three advantages of multilevel modeling?

A

If I have very few observations within the group, I’m prone to mistakes with fixed effects. I multilevel I can rely on other groups to make the estimate (not just the specific group).

We need less statistical power because less power to groups specific intercepts and more pow-er to estimate the coefficient of interest.

More realistic in the real-world. There are probably differences between groups, but they probably vary around a specific mean. A lot of things in this world is normally distributed. Some units will gain more from a treatment and some less.

26
Q

When does multilevel modeling make sense?

A

A lot of groups with few observations.

27
Q

When does multilevel modeling NOT make sense?

A

Few groups with many observations

28
Q

Does multilevel modeling make sense with a lot of observations and groups?

A

Jan: Edge case. Apply both and see