linear regression with one regressor Flashcards
what is the population regression line?
it is the expected value of Y given X ie E(Y/X)
what is the slope of the population regression line?
the slope is the difference in the expected values of Y, for two values of X that differ by one unit.
what can the estimated regression be used for?
casual inference (learning about the casual effect on Y of a change in X)
prediction ( predicting the value of Y given X for an observation not in the data set)
what is the problem of statistical inference for linear regression?
the problem of statistical inference for linear regression is, at a general level, the same as for estimation of the mean or of the differences between two means
what does the statistical or econometric inference about the slope entails?
1) estimation - how should we draw a line through the data to estimate the population slope
2) hypothesis testing - how to test whether the slope is zero
3) confidence interval - how to construct a confidence interval for the slope?
what is the equation for the population regression line?
test score = β0 + β1STR where β1 = slope of population regression line
Why are β0 and β1 “population” parameters?
- We would like to know the population value of β1.
- We don’t know β1, so must estimate it using data
what is the equation of the population linear regression model?
Yi = β0 + β1Xi + ui, i = 1,…, n
* We have n observations, (Xi, Yi), i = 1,.., n.
* X is the independent variable or regressor
* Y is the dependent variable
* β0 = intercept
* β1 = slope* ui = the regression error
what does the regression error consist off?
The regression error consists of omitted factors. In general, these omitted factors are other factors that influence Y, other than the variable X. The regression error also includes error in the measurement of Y.
how can you estimate the β0 and β1 from data?
we can use the ordinary least squares or OLS estimator of the unknown parameters β0and β1. The OLS estimator solves: min(b0,b1) ∑ [Yi - (b0 + b1Xi)]^2
what is the equation for 𝛽1 ?
r(xy)*[Sy/Sx] where r(xy) is the sample correlation and Sy and Sx is the sample standard deviations
what is the equation for 𝛽0?
𝛽0 = mean of Y - 𝛽1*mean of X
what does the regression R^2?
the regression R^2 measures the fraction of the variance of Y that is explained by X; it is unitless and ranges between zero (no fit) and one (perfect fit)
what does the standard error of regression(SER) measure?
the standard error of the regression measures magnitude of a typical regression residual in the units of Y
what is the equation for R^2?
R^2 = ESS/TSS = [∑(Ŷ-mean of Ŷ)^2]/[∑(Y-mean of Y)^2]