Applied Micro Economics Flashcards

1
Q

Assumptions OLS

A
  1. Linear in parameters
  2. Random sampling
  3. No perfect collinearity
  4. Zero conditional mean
    (5) homoskedasticy
    (6) normality
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Linearitity in parameters

A

In the population model the dependent variable, y, is related to the independent variable, x, and the error, u as

Y=beta0 + beta1x +u

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

No perfect collinearity

A

In the sample
• none of the independent variables is constant, and
• there are no exact linear relationships among the independent variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Panel data

A

Several observations in the same unit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Exogene variable

A

No IV needed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Endogene variable

A

IV needed
•related to the parameter
•whether we are able to obtain an estimate of the variable of interest that is in expectation the population parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Idiosyncratic error term

A

Time varying component of the error term. Unique for each unit-time observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Error term alpha: unit heterogeneity

A
  • The same for all observations of one unit
  • Unique to each unit
  • Time invariant component of the error term
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Estimater panel data

A

•un biased if both components of the error term are uncorrelated with the variable of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Inefficiënt estimators

A
  • produce incorrect standard errors
  • are error terms correlated across time?
  • unlikely that error terms are not correlated: you have to account for that
  • efficiency is related to how precisely one can estimate the parameter…thus the standard erros and confidence interval
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Pooled OLS

A

•estimation of parameters using OLS on data combining multiple observations of units in the sample
•use if error term is uncorrelated with the variable of interest
• exploits al data variation
• is the Best Linear Unbiased Estimator (BLUE)
- zero conditional mean holds
- no serial correlation/ no autocorrelation
Assumptions:
1. Cov=0 (exogeneity assumption)
2. No serial correlation
3. Other MRL assumptions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Fixed effects

A
  • prefer FE unless estimating alpha is relevant
  • use if idiosyncratic shock is uncorrelated with the variable of interest and the individual heterogeneity is correlated with the variable of interest.

Assumptions:

  1. Cov=0 (strict exogeneity assumption)
  2. Other MLR assumptions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Least square dummy

A
  • provides estimates for intercept alpha

* computationally burdensome since you need to estimate one additional parameters for each unit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

First difference FD

A

•Differentiates alpha out

Assumptions:

  1. Cov=0 (strict exogeneity assumption)
  2. Other MLR assumptions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

FE,LSDV & FD challenges

A
  1. Only produce estimates from within variation therefore:
    - not possible to estimate the effect of time-invariant characteristics
    - Lesse efficiënt methods since they use a limited source of variation
  2. Measurement error problems are more important
  3. Strict exogeneity assumption
  • to be unbiased the idiosyncratic shock must be uncorrelated with the variable of interest.
  • if Pooled OLS is unbiased than FE/LSDV/FD is also unbiased
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

RE model

A
  • will be more similar to FE model when the individual heterogeneity has a larger share of the variation
  • if FE is unbiased it is uncertain if RE will be unbiased
  • if theta is 0 it implies that most of the variation is coming from an idiosyncratic shock. So it will look more like a pooled ols model
  • use if error term is uncorrelated with the variable of interest
17
Q

CRE correlated random effects

A

Hypothesis gamma=0

Reject hypothesis:

  • at least one component of the time variant averages influences the outcome, therefore it is likely that other unobserved components matter.
  • alpha is likely correlated with the variable of interest: so use FE/LSDV/FD

Failure to reject hypothesis:

  • likely that no other unobserved time variant components matter
  • alpha is likely uncorrelated with the variable of interest: RE is unbiased and more efficient
18
Q

Panel data

A

Several observations in the same unit

19
Q

Exogene variable

A

No IV needed

20
Q

Endogene variable

A

IV needed

21
Q

Idiosyncratic error term

A

Time varying component of the error term. Unique for each unit-time observation

22
Q

Error term alpha: unit heterogeneity

A
  • The same for all observations of one unit

* Unique to each unit

23
Q

Hausman test

A

•Compare FE and RE
•significant differences in time varying coefficients imply that time invariant characteristics matter:
-RE is likely to be biased
-FE is more appropriate to account for all time invariant characteristics.
• Non-significant differences in time varying coefficients imply that time invariant characteristics do NOT matter:
- RE liken to be unbiased
- RE is more appropriate since it is also more efficient

24
Q

RE random effects

A

Assumptions:

  1. Cov=0
  2. Other MRL assumptions

•more efficient than pooled OLS

25
Q

Attrition

A

Units drop from the sample:

  • loss of follow up
  • death
  • refusal to remain in the survey

Also possible:
Item non-response; people do not respond to certain questions.

26
Q

Attrition test

A

Test whether being present is associated with outcomes.

Method:

  • unit is available in all waves
  • unit is available in the next wave
  • number of waves individuals is present

Caution: not all methods can be used with fixed effects.

27
Q

All waves methods

A

Step1. Create binary indicator if unit is present in all waves
Step2. Run model including indicator as covariant
Step3. -interpret coefficient
-significant correlation implies missing non at random
-discuss how such misgivings could lead to bias

28
Q

Unit is available next wave method

Only method with FE

A

Step1. Create binary indicator if unit is present in next wave.
Step2. Run model including indicator as covariant
Step3: -interpret coefficient
-significant correlation implies missing non at random
-discuss how such missing could lead to bias

29
Q

Number of waves method

A

Step1: create continuous indicator for number of waves present
Step2: Run model including indicator as covariant
Step3: -interpret coefficient
-significant correlation implies missing non at random
-discuss how such missings could lead to bias

30
Q

DID main idea

A
  1. -quasi-experimental approach
    • compares two groups
    • Assumes that all unobserved time-variant changes are common to both groups and therefore can be differentiated out one group to the other.
  2. Complements panel data by accounting for unobserved time-variant changes… but not exclusive to panel data
  3. Main idea: we can differentiate out the sources of bias by using two comparable groups where change in the treatment status is the only time-variant difference.
31
Q

DiD assumption

A

Correlation between e and x (treatment variable of interest) is constant

  • trends of control and treatment group are similar in absence of treatment
  • stable unit treatment value assumption (SUTVA)