term 2- regression with panel data- revise Flashcards

1
Q

what is a panel dataset?

A

a panel dataset contains observations on multiple entities where each entity is observed at two or more points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is a balanced panel?

A

no missing observations; that is, all observations are observed for all entities and all time periods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is an unbalanced panel?

A

some observations are missing; that is some observations are not observed for some entities and time periods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what can we control for with panel data?

A

we can control for factors that:
vary across entities but do not vary over time
could cause committed variable bias if they are committed
are unobserved or unmeasured - and therefore cannot be included in the regression using multiple regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

how does panel data control for ommitted variable bias?

A

if an ommitted varibale does not change over time, then any changes in Y over time cannot be caused by the ommitted variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what if there is more than two time periods?

A

if there are more than two time periods you can rewrite the regression in two ways:
1) “n-1 regressor” regression model
2) “ fixed effects” regression model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

express the fixed effects model in n-1 binary regressor form?

A

Y_it = B0 +B1X_it + s_2 D_2i +… + s_n D_ni + u_it
where D_2i = 1 for i=2 or 0 otherwise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

express the fixed effects model in the fixed effects form?

A

Y_it =a_i +B1 X_it +u_it where a_i is called a state fixed effect and is the constant effect of being in state i

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

when is the n -1 binary regressors OLS regression pratical?

A

it is only pratical when n isnt too big

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

how do you do n-1 binary regressors OLS regresiion?

A

first create the binary vairable D2i,…, Dni
then estimate 1 by OLS
inference (hypothesis tests, confidence intervals) is as usual (using heteroskedasticity robust standard errors)
this is impractical when n is large ie n=1000

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

how do you complete entity demeaned OLS regression?

A

first construct the entity demeaned variables Y~_it and X~_it
then estimate 2 by regressing Y~_it on X~_it using OLS
(it is like the changes approach but instead Y_it deviated from the state average instead of Y_i1. )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

give examples of an ommitted variable which might vary over time but not across states?

A

safer cars such as airbags; changes in national law
these produce intercepts that change over time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what are the two formulations of regression with time fixed effects?

A

T-1 binary regressor formulation
time effects formulation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is the T-1 binary regressor formulation?

A

Y_it =B0 +B1X_it+ 𝛿2B_2t+….+𝛿_TB_2t+u_it
where B2t= 1 when t=2 ,0 otherwise, etc for B3 up to BT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is the time effects formulation

A

Y_it=B0 +B1X_it + 𝜆_t+u_it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

under the anel data version of the least squares assumtion, is the ordinary least squares fixed effects estimator B1 normally distributed?

A

yes

17
Q

what are the fixed effects regression assumptions?

A

1) E(u_it|X_i1,….,X_iT,a_i) =0
2) (X_i1,…,X_iT,u_i1,…,u_iT), i =1,..,n are iid draws from their joint distribution
3) large outliers are unlikely (X_iT,u_iT) have finite fourth moments
4) there is no perfect multicollinearity (multiple X’s)

18
Q

when is the assumption 2 of the fixed effects regression assumptions satisfied?

A

(Xi1,…,XiT,ui1,…,uiT), i =1,…,n, are i.i.d.draws from their joint distribution.
it is satisfied when entities are randomly sampled from their populations from simple random sampling

19
Q

what does the assumption 2 of the fixed effect regression assumption not need to satisfy?

A

the observations do not require to be iid over time for the same entity

20
Q

what is autocorrelation?

A

autocorrelation means correlation with itself. suppose a variable Z is observed at different dates so observations are on Z_t t=1,..,T, then Z is autocorrelated if Corr(Z_t,Z_t+j) not equal to zero for some dates j when j is not equal to zero

21
Q

if ommitted factors are serially correlated, what is the error term?

A

the error term is also serially correlated

22
Q

why are OLS standard errors in general are wrong for panel data?

A

they assume that the error term is serially uncorrelated. in reality the OLS standard errors often underestimate the true sampling uncertainty

23
Q

what are clustered standard errors?

A

clustered standard errors estimate the variance of B1 when the variables are iid across entities but are potentially autocorrelated within an entity

24
Q

what is the equation of the clustered SE of Y^_?

A

square root[{s^2(mean of Y)}/n] where s^2(mean of Y)} = 1/(n-1) * Σ( sample mean of Y for entity i - mean Y)

25
Q

what is the one key features of the clustered SE’s?

A

in the cluster SE derivation we never assumed that observations are iid within an entity thus we have implicitly allowed for serial correlation within an entity

26
Q

why might panel data help?

A

potential OV bias from variables that vary across states but are constant over time
potential OV bias from variables that vary over time but are constant across states

27
Q

what are the advantages of fixed effects regression?

A

you can control for unobserved variables that vary across states but not over time and/or vary over time but not across states
more observations give you more information
estimations involves relatively straightforward extensions of multiple regressions

28
Q

what are the limitations of regression with fixed effects

A

need variation over time with entities
time lag effects can be important