Endogeneity and Multilevel Analysis Flashcards

1
Q

Endogeneity

A

Endogeneity refers to a situation where an explanatory variable is correlated with the error term in a regression model. This correlation can lead to biased and inconsistent estimates of the coefficients, making it difficult to draw valid inferences about causal relationships.

Endogeneity can arise from several sources, including omitted variable , measurement error, simultaneity, reverse causality, common-method bias and single-source bias.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Omitted variables in endogeneity

A

Uncontrolled confounding variable that is correlated with both the IV and with the error term in the regression formula / the DV

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Simultaneity in endogeneity

A

This occurs when the dependent variable also influences the independent variable. In such cases, it is unclear which variable is the cause, and which is the effect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Reverse causality in endogeneity

A

The dependent variable effects the independent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Measurement error in endogeneity

A

When the independent variable is measured with error, it can lead to a correlation between the IV and the error term. This is particularly problematic when the measurement error is systematic.

Example: If income is self-reported and individuals underreport their income, this measurement error can bias the estimated effect of income on consumption.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Common-method bias in endogeneity

A

Variance is attributable to the measurement method (e.g., survey with 5-point Likert scales) rather than to the constructs the
measures are assumed to represent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Single-source bias in endogeneity

A

Variance is attributable to the IV and DV being collected from a single source (e.g., self-rated motivation and performance).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Confounding variables

A

A confounding variable is an extraneous variable that is related to both the independent variable and the dependent variable, potentially leading to a spurious association between them. It can create a false impression of a relationship or mask a true relationship.

Example: Age confounding exercise and weight loss.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Multilevel analysis

A

Multilevel analysis, also known as hierarchical linear modelling or mixed-effect modelling, is used when the variable of interest has explanatory factors on more than one level.

This type of analysis is particularly useful when dealing with nested data structures—such as students within classrooms, patients within hospitals, or repeated measures within individuals—where observations are not independent

In multilevel models, data is structured in levels:

Level 1: Individual-level variables (e.g., student characteristics).
Level 2: Group-level variables (e.g., classroom characteristics).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Requirements for multilevel analysis

A
  • Sufficient observations (>15) on the higher level (“level 2”) of independent
  • variables.
  • The dependent variable resides on the lower level (“level 1”). [exceptions
  • possible!
  • Nested data structure
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Intraclass correlation coefficient (ICC)

A

This statistic indicates the proportion of total variance that is attributable to the group level. An ICC close to 1 suggests that most variance is between groups, while an ICC close to 0 suggests that most variance is within groups.

Interpretation: If ICC = 0.25, it means that 25% of the total variance in the outcome is due to differences between groups. Typical cutoff value at ICC>0.1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Random intercept / fixed slope models

A

This is what we normally assume.

These models allow the intercept to vary across groups but assume that the effect of predictors is consistent across groups.

Interpretation: The average outcome can differ across groups, but the relationship between predictors and outcome is the same.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Random slope / fixed intercept models

A

These models allow both the intercept and the slope of a predictor to vary across groups.

Interpretation: The relationship between a predictor and the outcome can differ across groups, reflecting more complex interactions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly