HANDOUT 1 Flashcards
COV(X, Y) =
SUM (XI - XBAR)(YI - YBAR) / N-1
WHY IS DOF FOR COV N-1?
we only need either X bar or Y bar - NOT both
Why is COV not that useful?
It is NOT scale invariant - can make it as big or small as possible by changing units
CORR(x,y)=
= COV(X,Y) / SQRT VAR(X) VAR(Y)
Is correlation scale invariant?
YES
What is epsilon?
Ei = Yi - E(Yi I Xi) = Yi - alpha - beta(Xi)
True but unknown relationship between X and Y. What 2 parts?
Yi = alpha + beta(Xi) + Ei
Systematic & random components
4 CLRM ASSUMPTIONS
- E(Ei I Xi) = 0
- V(Ei I Xi) = sigma^2
- COV(Ei, Ej I Xi) = 0
- Ei I Xi - N(0, sigma^2)
ei =
the RESIDUALS
ei = yi - y hat = yi - a - bxi
OLS method
minimise the residual sum of squares to find the line of best fit.
MIN SUM ei^2
2 OLS conditions
- SUM ei = 0 - zero luck on average
2. SUM ei.xi = 0 - luck independent of x
What does ORTHOGONAL mean?
Independent e.g. SUM ei.xi = 0 means ei and xi are orthogonal.
If sum ei = 0 is NOT satisfied, how do we change the best fit line?
SHIFT it up/down
If sum ei.xi= 0 is NOT satisfied, how do we change the best fit line?
ROTATE it
estimate of alpha = a
a = y bar - b xbar
estimate of beta = b
b = sum (xi - xbar)(yi - ybar) / sum (xi - x bar)^2
COV(x,y) / Var(X)