Week 3 Flashcards
What is a population bivariate linear regression?
A tool that allows to decompose random variable Y into a linear function of another variable X and a regression residual.
Y= a + bX + e
[a + bX] is population regression line
What is the best linear predictor of Y given X?
Population regression line
Why is the population regression line the best linear predictor of Y?
It minimizes mean squared error. Minimizes e = Y - a - bX
Minimise the mean squared error by calculus and show how to get (b) and (a).
What is the relation with regression residuals and regressors?
Regression residuals are uncorrelated with regressors by construction. Since b is defined by the property that Cov(X, e) = 0
What are a and b in the population bivariate regression?
a is the intercept and b is the slope of the the regression line.
An economist has an economic model of earnings where education (X) has a positive causal effect on earnings (Y). What is (u) in Y = a + bX + u
(u) is any other determinant of earnings other than education.
What are the 3 assumptions we make that (X) is not affected by (u).
1) E(u|X) = 0
Average error term for any given amount of X is zero
2) E(u|X) = E(u)
Average error term is same for any given amount of X
3) Cov(u, X)
Error term is uncorrelated with X
What is zero conditional mean assumption?
Implies E(u) = 0 hence, E(u|X) = E(u), which also implies uncorrelatedness
Show that b and a are unbiased estimators in population regression function.
How to show omitted variable bias in both cases? compare when X and u and correlated and uncorrelated.
How to describe the assumption Cov(X,u) = 0 makes sense using years of schooling and average ability?
If years of schooling were randomly assigned to individuals across a population. Average ability would be same across different years of schooling. In that case schooling and ability would be uncorrelated.
In reality we do not have all data in population we use a sample. what are the formulas for the estimand (b) and estimator (b hat).
Under which 2 assumptions is b hat a desirable estimator of b.
1) Large random sample
2) Large outliers in X and Y are unlikely
What is an assumption of b hat that allows us to contract confidence intervals and conduct hypotheses tests.
b hat is asymptotically normally distributed