Tutorial 5 - Panel data, Pooled OLS, Random-Effects-Estimator, Fixed-Effects-Estimator Flashcards

1
Q

What is panel data?

A

panel data or longitudinal data are multi-dimensional data involving measurements over time. Panel data contain observations of multiple phenomena obtained over multiple time periods for the same firms or individuals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How does a Linear Panel Data Model look like?

A
  • i = 1, .., N cross-sectional dimension (e.g. persons, firms, countries)
  • t = 1, …, T time-series dimension (e.g. years)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does a stacked model for all NT observations look like?

A
  • i = 1, .., N cross-sectional dimension (e.g. persons, firms, countries)
  • t = 1, …, T time-series dimension (e.g. years)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does Pooled OLS estimator mean and how is it defined?

A

pooling all NT observations and applying OLS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which condition needs to be met for the POLS to be consistent?

A

POLS is only consistent under the assumption of weak exogeneity of the regressors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the variance of the POLS error term?

A

NT × NT matrix:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What may the error variance matrix Ω of POLS contain and what not?

A

it may contain:

  • heteroscedasticity, i.e. different variances of error terms across individuals
  • autocorrelation of error terms within individuals over time

BUT:

  • Assumption: no correlation of error terms across individuals
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does the Breusch-Godfrey-Test test?

A

Test for serial correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How does a model with serial correlation look like?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the steps for applying the Breusch-Godfrey test?

A
  1. Estimate an (P)OLS regression to estimate the model
  2. Using the estimated residuals ûᵢ as an estimate for û, estimate following model (below)
  3. Test (n − p)R² against critical value from X²ₚ (p needs to be justified, e.g. how many observations per person (cluster))
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is μᵢ in a model with random effects?

A

individual-specific, time-constant unobserved effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the assumptions for Random Effects?

A

see (1), (2), (3) below

with (3): the regressors are strictly exogenous, i.e. mean independence of composite error term and regressors of all (!) time periods. Assumption is violated if the individual-specific effect μᵢ is correlated with xᵢₜ!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What effect does μᵢ have on the composite error (in random effects models)?

A

Cov (ϵᵢₛ, ϵᵢₜ) ≠ 0

Note: RE explicitly models the serial correlation in a GLS framework

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does the variance between two individual observations look like in a model with random effects?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Show that there is serial correlation of error terms within individuals over time in a model with random effects.

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Is there autocorrelation across individuals in a model with random effects?

A

No autocorrelation across individuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How does the Var(ϵ) look like in a model with random effects?

A
  • Var(ϵ) = Ω, a NT × NT matrix.
  • However, under the RE assumptions, it only includes two distinct parameters: σμ² and σₑ².
  • In practice, Ω has to be consistently estimated -> ^Ω
18
Q

What is the random effects estimator?

A
19
Q

What are problems both with POLS and RE that Fixed Effects and First Differences can fix?

A

POLS and RE are inconsistent if the individual-specific effect μᵢ is correlated with xᵢₜ (example: individual-specific ability could be correlated with education, or with joining a union)

20
Q

How do Fixed Effects and First Differences handle the case that the individual-specific effect μi is correlated with xit ?

A

Fixed Effects and First Differences allow for this correlation by using only variation in xᵢₜ within individuals over time -> μᵢ cancels out -> controls for time-constant selection bias

21
Q

How can you find an expression for fixed effects?

A

Fixed Effects = POLS on time-demeaned data:

22
Q

How can you find an expression for first differences?

A

First Differences = POLS on first-differenced data:

23
Q

How would a fixed effects model be different if estimated on the data of the model shown below?

A
  • variables educ and black drop out because they don’t change over time for a given individual
  • FE is the same as a “Least Squares Dummy Variable Estimator” - here: POLS with a dummy for each individual
  • Time-constant variables like educ drop out, but we can estimate the interaction of educ with time dummies -> i.e., we can’t estimate the level of the return to education, but we can estimate whether the return changed over time
24
Q

What do the different terms in the model mean?

A

index ᵢₜ: not only identify the person, but also the time of the observation
λₜ stands for control variables (one for each year, except the base year) = like a dummy for each year
μᵢ = 545 values, one for each observation

25
Q

Which four characteristics/assumptions describe the multiple linear regression mode?

A
  • Linearity:
  • Independence:
  • Strict Exogeneity:
  • Error Variance:
26
Q

What is linearity in this model?

A

Linearity: The model is linear in parameters α,β,γ, effect ci and error uit

27
Q

What is independence in this model?

A

Independence: Xi, zi, yi are i.i.d. = independent and identically distributed. The observations are independent across individuals but not necessarily across time.

28
Q

What is exogeneity in this model?

A

Strict Exogeneity: The idiosyncratic error term uit is assumed uncorrelated with the explanatory variables of all past, current and future time periods of thesame individual. This is a strong assumption which e.g. rules out lagged dependent variables.

29
Q

What is assumed about error variance in this model?

A
30
Q

What is ci in the random effects model?

A

In the random effects model, the individual-specific effect is a random variable that is uncorrelated with the explanatory variables

31
Q

Which three additional assumptions make this a random effects model?

A
  • Unrelated effects
  • Effect variance
  • Identifiability
32
Q

What is the assumption of related effects in the fixed effects model?

A

That the assumption from the random effects does not hold:

E [ci | Xi, zi] ≠ 0

assumes that the individual-specific effect is a random variable that is correlated with the explanatory variables of all past, current and future time periods of the same individual

33
Q

What is the assumption of unrelated effects in the random effects model?

A

E [ci | Xi, zi] = 0

assumes that the individual-specific effect is a random variable that is uncorrelated with the explanatory variables of all past, current and future time periods of the same individual

34
Q

What is the assumption of effect variance in the fixed effects model?

A

absence of assumption of constant variance of the individual specific effect

35
Q

What is the assumption of effect variance in the random effects model?

A

assumes constant variance of the individual specific effect

36
Q

What is ci in the fixed effects model?

A

In the fixed effects model, the individual-specific effect is a random variable that is allowed to be correlated with the explanatory variables

37
Q

Which three additional assumptions make this a fixed effects model?

A
  • Related effects
  • Effect variance
  • Identifiability
38
Q

What happens if this random effects model is estimated by Pooled OLS?

A
  • The pooled OLS estimator of α, β and γ is unbiased
  • It is consistent and approximately normally distributed
  • However, the pooled OLS estimator is not efficient
  • The usual standard errors of the pooled OLS estimator are incorrect -> Correct standard errors can be estimated with the cluster-robust covariance estimator treating each individual as a cluster
39
Q

What happens if this fixed effects model is estimated by Pooled OLS?

A

The pooled OLS estimators of α, β and γ are biased and inconsistent, because the variable ci is omitted and potentially correlated with the other regressors

40
Q

What is the first difference etimator?

A
  • The first-difference (FD) estimator is an approach used to address the problem of omitted variables with panel data.
  • The estimator is obtained by running a pooled OLS estimation for a regression of Δ yit on Δxit