term 2- regression with panel data- revise Flashcards

1
Q

what is a panel dataset?

A

a panel dataset contains observations on multiple entities where each entity is observed at two or more points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is a balanced panel?

A

no missing observations; that is, all observations are observed for all entities and all time periods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is an unbalanced panel?

A

some observations are missing; that is some observations are not observed for some entities and time periods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what can we control for with panel data?

A

we can control for factors that:
vary across entities but do not vary over time
could cause committed variable bias if they are committed
are unobserved or unmeasured - and therefore cannot be included in the regression using multiple regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

how does panel data control for ommitted variable bias?

A

if an ommitted varibale does not change over time, then any changes in Y over time cannot be caused by the ommitted variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what if there is more than two time periods?

A

if there are more than two time periods you can rewrite the regression in two ways:
1) “n-1 regressor” regression model
2) “ fixed effects” regression model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

express the fixed effects model in n-1 binary regressor form?

A

Y_it = B0 +B1X_it + s_2 D_2i +… + s_n D_ni + u_it
where D_2i = 1 for i=2 or 0 otherwise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

express the fixed effects model in the fixed effects form?

A

Y_it =a_i +B1 X_it +u_it where a_i is called a state fixed effect and is the constant effect of being in state i

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

when is the n -1 binary regressors OLS regression pratical?

A

it is only pratical when n isnt too big

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

how do you do n-1 binary regressors OLS regresiion?

A

first create the binary vairable D2i,…, Dni
then estimate 1 by OLS
inference (hypothesis tests, confidence intervals) is as usual (using heteroskedasticity robust standard errors)
this is impractical when n is large ie n=1000

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

how do you complete entity demeaned OLS regression?

A

first construct the entity demeaned variables Y~_it and X~_it
then estimate 2 by regressing Y~_it on X~_it using OLS
(it is like the changes approach but instead Y_it deviated from the state average instead of Y_i1. )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

give examples of an ommitted variable which might vary over time but not across states?

A

safer cars such as airbags; changes in national law
these produce intercepts that change over time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what are the two formulations of regression with time fixed effects?

A

T-1 binary regressor formulation
time effects formulation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is the T-1 binary regressor formulation?

A

Y_it =B0 +B1X_it+ 𝛿2B_2t+….+𝛿_TB_2t+u_it
where B2t= 1 when t=2 ,0 otherwise, etc for B3 up to BT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is the time effects formulation

A

Y_it=B0 +B1X_it + 𝜆_t+u_it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

under the anel data version of the least squares assumtion, is the ordinary least squares fixed effects estimator B1 normally distributed?

17
Q

what are the fixed effects regression assumptions?

A

1) E(u_it|X_i1,….,X_iT,a_i) =0
2) (X_i1,…,X_iT,u_i1,…,u_iT), i =1,..,n are iid draws from their joint distribution
3) large outliers are unlikely (X_iT,u_iT) have finite fourth moments
4) there is no perfect multicollinearity (multiple X’s)

18
Q

when is the assumption 2 of the fixed effects regression assumptions satisfied?

A

(Xi1,…,XiT,ui1,…,uiT), i =1,…,n, are i.i.d.draws from their joint distribution.
it is satisfied when entities are randomly sampled from their populations from simple random sampling

19
Q

what does the assumption 2 of the fixed effect regression assumption not need to satisfy?

A

the observations do not require to be iid over time for the same entity

20
Q

what is autocorrelation?

A

autocorrelation means correlation with itself. suppose a variable Z is observed at different dates so observations are on Z_t t=1,..,T, then Z is autocorrelated if Corr(Z_t,Z_t+j) not equal to zero for some dates j when j is not equal to zero

21
Q

if ommitted factors are serially correlated, what is the error term?

A

the error term is also serially correlated

22
Q

why are OLS standard errors in general are wrong for panel data?

A

they assume that the error term is serially uncorrelated. in reality the OLS standard errors often underestimate the true sampling uncertainty

23
Q

what are clustered standard errors?

A

clustered standard errors estimate the variance of B1 when the variables are iid across entities but are potentially autocorrelated within an entity

24
Q

what is the equation of the clustered SE of Y^_?

A

square root[{s^2(mean of Y)}/n] where s^2(mean of Y)} = 1/(n-1) * Σ( sample mean of Y for entity i - mean Y)

25
what is the one key features of the clustered SE's?
in the cluster SE derivation we never assumed that observations are iid within an entity thus we have implicitly allowed for serial correlation within an entity
26
why might panel data help?
potential OV bias from variables that vary across states but are constant over time potential OV bias from variables that vary over time but are constant across states
27
what are the advantages of fixed effects regression?
you can control for unobserved variables that vary across states but not over time and/or vary over time but not across states more observations give you more information estimations involves relatively straightforward extensions of multiple regressions
28
what are the limitations of regression with fixed effects
need variation over time with entities time lag effects can be important