Assessing Studies based on multiple regression Flashcards
Internal validity
the inferences on causal effects are valid for the population
- the estimator of the causal effect should be unbiased and consistent
hypothesis test shoud have the desired significance level
external validity
the inferences can be generealized from the population studied to other population of interest or settings
- compare studies on differnte but related pops
threats to internal validity
- Endogeneity problem:
- omitted variables
- misspecified functional form
- erros in variables (measurement error)
- sample selection
- simultaneous causality - Inconsistent standard errors
possible solutions to omitted variables
when OV are observed:
- what is the coefficient of interest?
- (before running ols), what is the most important sources of omitted variable bias?
- test nonzero coefficient of questionable variables
- provide full disclosure so that readers can see the effect of including questionable variables
possible solutions to omitted variables
when OV are NOT observed:
- use a randomized controlled experiment
- If IV is time constant., use panel data
- if an instrument is available, use IV estimation
Solutions to measurement error problem:
- get an accurate measure of X if possible
2.
- use IV ( a variable that is correlated with X but uncorrelated with w)
- mathematically model the measurement error and make a correction
solution to simultaneous causality bias:
- IV if instrument is available
2. randomized controlled experiment if possible
Inconsistent standard errors
correlation of the error term across observations ( cross- sectional dependence or serial correlation)
- sampling is based on geographic units (state level data)
- time series or panel data ( repeated observations on the same entity over time)
- cross sectional dependence is possible ( unobserved common shock)
- Test and CI do not work properly ( significant level incorrect)
solutions to inconsistent S.E
- heteroskedasticity-robust S.E
- Heteroskedasticity-autocorrelation-consistent S.E
- serial correlation- robust S.E for panel data models