Linear regression (week 3-5) Flashcards
Linear regression formula
Y = alpha + BetaiXi +E
What is Y in Linear Regression?
What is alpha in Linear Regression?
What is Beta in Linear Regression?
What is X in Linear Regression?
What is E in Linear Regression?
Y is dependent var
Alpha is intercept parameter
Beta is regression coefficient
X is explanatory variable
E is i.i.d error term (use N(o, var))
Estimated Linear Regression
same, but put hat on all the coeff and E is Ɛ
E vs Ɛ?
E is i.i.d error (capture uncertainty) and Ɛ is residual term (diff data from model, is better it act more like E, contains uncertainty and what not captured)
What if its not linear?
Use log
How to minimize Ɛ
use minimize SSR (Sum Squared of Error)
1. Sum symbol (Y - Yhat)
2. derive! alpha and beta
3. Set the derivation to 0
what is δ^2?
it represents (Σ(y-yhat)^2)/n-2
why have n-2 in the δ^2?
it represent unbiased estimator
what if is too large?
use the formula alpha with squingy line on top and beta with squingy line and test the hypothesis for both alpha with squingy line on top and beta with squingy line
Goodness-of-fit measured using
R^2 = regression SS / Total SS
between 0% to 100%
calc test stats
use F1,n-2
R^2 means?
proportion of total data variability explained by model
Total Sum Of Square means?
Deviations between data and sample mean (total variability)
Regression SS mean?
Deviations between model estimate and sample mean (data variability explained by model)
Residual SS mean?
Deviations between data and model estimate (data variability unexplained by model)
what does Y* means
its using the new Y, or Y in the future trs dibagi 100