Panel Data Flashcards
Clustered standard errors are presented for the Pooled OLS results in Table #. Is there a need for clustering? Please motivate your answer.
The presence of random effects causes the error term to be autocorrelated. By clustering the standard errors, this has been taken into account.
What may cause the differences between the Pooled OLS and Random Effects results in table #.
The differences might be due to the difference in the exogenetiy assumption of each model.
The key assumption of the Random Effects estimator is strict exogeneity [𝐸(𝑢t|𝑋)= 0], while the key assumption of the OLS estimator is contemporaneous exogeneity [𝐸(𝑢t|𝑋t)= 0].
By taking first differences, the individual effect ai…
…is removed from the model.
…exogeneity is needed (no lagged dependent variable, no feedback effects) for consistency (unbiasedness) of the first-difference estimator.
strict exogeneity
Why would one use a First-Difference (FD) or a Fixed Effects (FE) estimator instead of a Random Effects estimator when estimating the panel equation? Based on the estimation results, was there a need for using a FD or FE estimator?
The Random Effects RE model assumes that the individual effect (ai) is uncorrelated with the explanatory variables.
The Fixed Effects FE model/estimator and the FD estimator allow for correlations between the individual effect (ai) and the explanatory variables.
Yes, the effects are (in absolute) more significant when using FE & FD estimators than a RE estimator. However, we do not know for sure and need to test FE vs RE using a Hausman test formally.
Claim: random effects yield significant t- statistics more easily than fixed effects.
TRUE
In the Hausman test, If H0 is true then…
…both estimators are unbiased (consistent) However, random effects yield smaller standard errors, so that it is preferred to fixed effects.
Why might a simple regression estimate be positive while the first differences and fixed effects estimators negative?
They differ because the first difference and fixed effects remove the effect of a fixed part of the estimator, ai, and they are negative because the ai was positively correlated with x.
Stata: how do you use first differences versus fixed effects regressions?
A
For first differences just set up for panel data, use “xtreg” and then “d.” in front of every variable. For fixed effects set up for panel data, use “xtreg”, and end with “, fe”
Why might you want to use a lagged version of a variable of interest in a fixed effects model?
You would want to include a lagged version in case the effect takes some time to show up.
The fixed effects model is…
…a method of estimating panel data equations that works by allowing each cross-sectional unit to have a different intercept.
When comparing estimates from an OLS regression and a fixed effects regression, why might the OLS estimate be positively biased?
There may be a correlation between unobserved effects and the explanatory variables in each time period, causing positive omitted variable bias. The fixed effects estimator removes such fixed effects from the error term, removing the bias.
What do the estimators of Fix Effects and First Differences have in common? And what are the consequences of the similarities
They allow for correlation between 𝑎, and the explanatory variables E(𝑎,| x1it … xkiT) ≠0
The assumption of strict exogeneity means that the regression equation contains no feedback mechanism and no lag of dependent variables.
The consequence of the similarities is that the parameter estimates of both estimators should be about the same (if the assumption of strict exogeneity is true)
In the Hausman test, If H0 is not rejected then…
RE is preferred over FE because RE yields smaller standard errors and corrects for the presence of autocorrelation in the error term.
If the RE estimator’s variance is less than the variance of the within estimator (FE), then RE yields significant t-statistics easier than FE. (therefore smaller s.e)
What can explain the difference between the RE and FX in table..
In the RE model, it is assumed that the ai and RHS are uncorrelated whereas in the FE they are allowed to be corrllated.
> reg D.HoursWork DL.HoursWork D.Wage DL.Wage Linear regression, cluster(nr);
Why might the First Difference estimator of the regression result table be inconsistent?
The model includes a lagged dependent variable so the strict exogeneity assumption is violated, leading to inconsistent (biased) estimates.
What does it mean that strict exogeneity is needed for consistency (unbiasedness) of the first- difference estimator?
no lagged dependent variable
no feedback effects
Claim: In the 1st Diff and Fixes Effects estimator we allow for correlation between the explanatory variable ai and the other explanatory variables x1it ,…xkit .
TRUE
Claim: contemporaneous exogeneity is sufficient enoight to prove consistency of the first-difference estimator bhat (fdif)
FALSE
contemporaneous exogeneity is too weak to prove consistency of the first-difference estimator.
Claim: Pooled OLS allows for correlation between the individual-specific effects ai and the RHS variables XkiT.
FALSE!
Pooled OLS is an estimator for a zero correlation between ai and the explanatory variables.
Which estimator models assume that the individual specific effect ai and the explanatory variables are uncorreleted?
Random Effects
Pooled OLS
Claim: In the Pooled OLS uit is uncorrelated with all of the explanatory variables and
ai.
TRUE