Intro into Multi-level data Flashcards

Question 1

Q

What is the difference between level-1 & level-2 variables?

Answer

A

level-1 variables vary at level 1 (i.e. different SES levels of students)

level-2 variables cannot vary at level 1 (i.e. every student in school has the same student to teacher ratio, same school type, size, etc.)

Question 2

Q

How are mutli-level and panel datasets similarly structured?

Answer

A

PISA, SOEP, etc. (i.e. students nested in schools)
longitudinal data in general (timepoints nested in individuals)

Question 3

Q

Why do we need special methods for multi-level data?

1) Correct statistical inference

Answer

A

“Dependence as nuisance“ (Snijders & Bosker): Basic assumptions of regression/inferential statistics are violated –> no

Independent observations
Independent error terms
Homoscedastic errors
Normal distribution of errors

Examples

Exam scores are more similar within classes
Political attitudes cluster in regions
Measurements of body weight are correlated over time

ml/longitudinal data highly correlated
which means we cannot evaluate statistical uncertainty appropriately as our standard errors are getting too small (the larger the sample the smaller the standard error but our sample is artificially inflated with not independent observations aka denominator is largely than it is supposed to be) –> make SE smaller

Question 4

Q

Why do we need special methods for multi-level data?

2) Substantial questions

Answer

A

Dependencies/correlations within clusters as a subject matter (e.g., how much of variance in grades can we attribute to differences between schools?)

Question 5

Q

Why do we need special methods for multi-level data?

3) Dealing with unobserved heterogeneity

Answer

A

Due to hierarchical data structure we have unobserved heterogeneity at different levels which can affect the relationships between variables at those levels.
By capturing and addressing this heterogeneity - through a RE - researchers gain a deeper understanding of how group-level factors influence individual outcomes + more accurate estimates

Question 6

Q

Possible level-2 variables influencing math performance of students?

Question 7

Q

How can the meaning of a level-1 variable change when its aggregated?

Male -> Math performance

Answer

A

male level-1: pressure from parents, teacher, socialization → pos influence on math performance
male level-2 (% of boys in class): more disturbance in class → neg influence on math performance

Question 8

Q

What could be level-1 and level-2 confounders of motivation -> math performance?

Question 9

Q

How can we split variance?

Question 10

Q

How do we model the mean?

Question 11

Q

What is in the residuals/error term?

Model for the mean

Answer

A

unobserved heterogeneity/omitted variable, in here are all the factors that influence the outcome but are not in the model
also, those factors are assumed to be random