Least Squares Intro Flashcards
What are the names of the two variables involved in bivariate regression?
On the LHS y is the dependent variable or the regressand or the outcome or the explained variable
On the RHS x is the independent variable or the regressor or the explanatory variable or the covariate or the control
What is regression to the mean?
If the first observation of a variable was extreme, the next observation is likely to be less extreme, and vice versa
What is the most basic relationship Y could have?
Yi = α + εi for i = 1, …, N
Here these is a constant unknown population parameter and random deviation from this parameter
E(Yi) = α and the estimator for α and Y are a and Ŷ respectively but Ŷi = a in this case
What is the sum of squared residuals for Yi = α + εi?
SSR = Σi(Yi - a)2 = Σiei2
ei = Yi - Ŷi = Yi - a
What is the least squares estimator for Yi = α + εi?
From the criterion function minaSSR we can find the solution using ∂SSR/∂a = Σi∂ei2/∂a which by the chain rule is Σi∂ei2/∂ei ∂ei/∂a = -2Σei
The FOC sets this to zero so 0 = ΣYi - Σa so a = 1/n ΣYi = Ȳ
Note that Ȳ is a function of random variable Y and so is also a random variable with a sampling variance
What is the iterated expectation?
Converting a conditional expectation to an expectation
EX(E(Y|X)) = E(Y)
For continuous X, this comes from ∫XE(Y|X=x)fx(x)
What is a least squares line?
An estimate of a linear relationship between two variables
Ŷ = a + bx where Ŷ is a predicted value paired with a particular x, a is the estimated intercept, and b is the estimated slope
What is one assumption imposed by an OLS estimate?
Equal variances
How do you find the estimators for slope and intercept of a least squares line?
mina, b SSR
FOCs are ∂SSR/∂a = 0 and ∂SSR/∂b = 0 so for a this comes out to a = Ȳ - bX̄ (work through in your head), for b we have 0 = ΣXiei = ΣXY - ΣaX - ΣbX2 so ΣXY = Σ(Ȳ - bX̄)X + ΣbX2 which can be expanded and rearranged to get ΣXY - ȲΣX = b(ΣX - X̄ΣX) and ΣX = nX̄ so b = SXY/SX2 (sample covariance over sample variance)
a and b are random variables with expectation equal to the population parameters
How can the slope estimator be found from Cov(Y, X)?
Cov((α + βX + ε), X) = Cov(α, X) + βCov(X, X) + Cov(X, ε) but last term assumed to be zero so β = Cov(X, Y)/Var(X) and the estimator b = 1/(n-1)Σ(Y - Ȳ)(X - X̄) / 1/(n-1)Σ(X - X̄)2
What are the population and sample conditional expectation functions from OLS?
E(Y|X) = α + βX and E(Ŷ|X) = a + bX
What relationship between Y and X does a linear model impose?
Constant ∂Ŷ/∂Xi
Not always optimal
What is the relationship between the slope estimator and the sample correlation coefficient?
b = rSY/SX