lecture 3 Flashcards
points on a regression line
all data points have a residual, we want them to average to zero, does OLS do this?
what do OLS estimators minimize
the sum of squared residuals
where: uhat i = Y i - Bhat i - Bhat1 Xi
minimizing the sum of squared residuals gave us
two first order conditions (FOC) (SUM n, i=1)
1. uhat i = 0
2. uhat i X i = 0
OLS estimators
Bhat 1 = sXY/S^2 X
Bhat o = Ybar - bhat 1 Xbar
what does necessary conditions mean for FOC
they must be satisfied when we’ve estimated our estimate line via OLS
SUM n, i=1 uhat i = 0
1st first order condition
- states that the sum of the residuals must be equal to zero
SUM n, i=1 uhati X i = 0
2nd first order condition
OLS sample covariance =
0
based on OLS first order conditions we have decided
u hat bar = 0
S u caret X = 0
first order conditions are not
assumptions; they follow from the choice made to use OLS to estimate the regression line
graphical representations
y = give us the average residuals to sum to zero so the line is through the middle, not higher or lower
u caret = when put onto a residual line we can see that it is not the optimal height for a regression line
OLS regression lines are
average residual = 0
what is the height of the regression line
such that the average residuals sum to zero
what can be said about the OLS estimator of the intercept
Bcaret o + Bcaret 1 X bar = Ybar
OLS sets the height of the regression line at Xbar to Ybar: ensuring average residual will be equal to 0
- plug in mean for X, it should equal Ybar if the average residual is 0
OLS regressor must go through what
Xbar, Ybar
- slope doesn’t matter, it just needs to hit this point
even when the average residual is 0 what else is needed
covariance needs to be 0
what is the 2 step process for OLS
satisfy 2 FOC
how do the 2 FOC create the best fit line
- the first FOC of OLS sets the height of the regression at Xbar to be Ybar, ensuring the average residual is zero
- adding the second FOC ensures the slope of the regression results in zero covariance between the X and the residuals
goal of OLS
estimate a population regression line using a sample
OLS
estimation method for population regression line
implications of choosing OLS
1 . OLS must go through (Xbar, Ybar) ensuring residual is zero
2. the OLS slope ensures there is zero covariance between X and u caret
what variance do we prefer for an estimator
the one with more variance
/4 better than /2
least squares assumptions: line written as
there is a population regression line:
Yi = Bo + B1Xi + ui
what can be observed in a population regression line
Y and X
Bo, B1, ui are not observed