17 Linear Regression Flashcards
Limitations of correlation coefficient
Doesn’t help us make predictions, it is only calculated for two variables
What does Ei represent?
The error term
Assumption of simple linear regression model
Xi are fixed (non random)
Ei and Yi are random variables
Residuals of regression
Êi= yi- ŷi
Measure the vertical distance between the fitted line ŷi and the actual values of yi
What is the intercept
B0
What is the slope
B1
OLS
Ordinary Least Squares. It works by fitting a line through the data minimising the sum of squared residuals
What is the estimate of B1 equal to?
Cov(x,y)/var(x)
What is the estimate of B0 equal to?
The mean of y - cov(x,y)/var(x) x mean of x
Is the OLS estimator biased?
No because the expected value of the estimates of B0 and B1 is B0 and B1
Which estimators are blue and have the smallest variance?
OLS estimators
Blue = best linear unbiased estimator
What is the variance of the estimators of B0 and B1?
Zero, this shows they are consistent estimators
What is coefficient of determination denoted as?
R^2
What is the coefficient of determination
It calculates the proportion of the variation in the dependent variable that is explained by the fitted regression
Total sum of squares (TSS)
The total squared variation of the yi values about their mean
TSS= sum of (yi-mean)^2