Cross-section Metrics Flashcards
Sources of endogeneity
OVB, measurement error, reverse causality
Derive OVB
Derive on paper
Why adjust R^2?
And what is adjusted R^2?
Guaranteed to rise as we add variables: = 1-((n-1)/(n-k-1))(SSR/SST)
Unbiased estimate of S^2?
When uihat=Yi-b0hat-b1hatx1 etc
((1)/(n-betas))*sum of ui squared
GM Assumptions
Linearity / fixed or stochastic non identical regressors / exog / homeskedastic / no serial correlation / no perfect multicolinearity
If GM and e is normal dist?
(bhat - b)/(se(bhat)) ~t n-betas
Partialling out
OV to purify effect.
Reg ind on OV then reg Y on residual to get true effect!
Var of b1hat OLS one regressor
Var of error / Var(X)
Var of b1hat OLS with many regressors?
Var of error / (1-R^2 of X on other regressors)*SSTx
If large sample test b1=x
b1hat -b1 / (se b1hat) ~N(0,1) by CLT
F stat
F=((SSRr-SSRur)/(#restrictions))/(SSRur)/n-betas ~F#,n-betas
Linear combination test
t test
Non linear regressors / interactions
Test jointly (F).
Interactions: consider if compliments / substitutes!
Assumption of F test
All GM plus normal errors!
Law of Iterated Expectations
E(X)=E(E(X given Y))
Ramsey Reset what tests
Model specification. Should be add higher powers / cross terms
How to set up ramsey reset?
Reg yihat on xis and higher powers of yihat. Test (jointly coefficients on higher power yihats) F
Predict w if lnw=b0+b1x+b2y
Must remember Jensen’s inequality: Time e^(var of error/2)
Dummies thing to remember
Be precise and careful on comparison groups!
Chow test
Is model same for A and B groups? Just F test between the 2!
Asymptotics crucial when?
Not ~ N
Chebyshev’s inequality
As n to infinity, we can say confidence interval of xbar to correct value becomes very small!
Preservation of unbiasedness/ consistency by continuous transform?
Unbiasedness: no (see Jensen).
Consistency: Yes by continuous mapping theorem!
Slutsky’s theorem
If Xn converges to dist X and Yn converges in prob to C, then Xn/Yn converges in dist to X/C!
Use Slutsky’s on beta1hat - beta1
CLT on numerator and LLN on denom
Lagrange Multiplier test
nR^2 ~a~ Chi squared dof betas
Effect of hetero
inefficiency, increasing Var(b1hat ols)
Causes of hetero
Model misspecification (eg OVB/subpopulation differences/wrong functional forms)
IVs
Measurement error
Also genuine hetero!
Tests for Hetero
Goldfield Quandt (archaic)/Breusch Pagan/ White
GQ hetero test
Split sample in 2. Thus must be monotonic hetero
BP assumption
Assumes normality of errors.
Logit distribution?
e^x/(1+e^x) = 1/(e^-x+1)