10 Panel data Flashcards
1
Q
advantages of panel data
A
- better control over omitted variables
- more observations
- many research questions typically involve a time component
2
Q
unobserved heterogeneity embodied in Zi
A
…
3
Q
panel data with more than 2 time periods
A
Yit =β0 + β1Xit + β2Zi + uit
Equation can be interpreted as model with n specific intercepts (one for each state)
Yit =β1Xit + αi + uit, with αi =β0 + β2Zi
αi , i = 1, …, n are called entity fixed effects
αi models impact of omitted time-invariant variables on Yit
Use dummy variables to quantify αi
4
Q
entity demeaned variable
A
̃(Yit) = Yit − avg(Yi)
5
Q
LSDV
A
Least Squares Dummy Variables
6
Q
fixed effects regression model
A
Yit = β1Xit + αi + λt + uit
statistical assumptions are:
- E(uit|Xi1,…,XiT,αi,λt)=0
- (Xi 1 , …, XiT , Yi 1 , …, YiT ) are i.i.d. over the cross-section - large outliers are unlikely
- no perfect multicollinearity
- cov(uit, uis | Xi1, .., Xit, αi, λt) = 0 for t != s
7
Q
(panel data regression)
What if error terms are correlated over time?
A
HAC-standard errors (clustered standard errors)
- robust to arbitrary correlation within clusters (entities)
- robust to heteroskedasticity
- assume no correlation across entities