statistics - topic 7 - bivariate regression Flashcards
what is regression analysis used to do ?
Explain the impact of changes in an independent variable on a dependent variable
Predict the value of a dependent variable based on the value of at least one independent variable
what is the dependent variable?
The dependent variable is the variable we wish to explain (also called the endogenous variable)
what is the independent variable?
The independent variable is the variable used to explain the dependent variable (also called the exogenous or explanatory variable)
what does the population regression model show?
it shows the relationship between two variables
what are the components of a population regression model?
it has a dependent variable, population intercept, population slope coefficient, independent variable and an error term
what is the sample data used for ?
Sample data is used to provide an estimate of the population regression model
what are the assumptions required for the least squares estimation to be an accurate estimate?
The true relationship is linear (π is a linear function of π, plus a random error)
The error term, π_π, is uncorrelated with the random variable, π
The error term, π_π, has a mean of 0 and constant variance, π^2 (the latter property is called homoscedasticity):
πΈ[π_π ]=0 and πΈ[π_π^2 ]=π^2 for π=1,β¦,π
The error terms, π_π, are not correlated with one another, so that:
πΈ[π_π π_π ]=0 for all πβ π
what is the least squares method?
Least squares provides estimates of π½_0 and π½_1 by finding the values of π_0 and π_1 that minimize the sum of the squared errors (SSE):
minβ‘πππΈ=minβ‘β(π=1)^πβπ_π^2 =minβ‘β(π=1)^πβ(π¦_πβπ¦Μπ )^2 =minβ‘β(π=1)^πβ[π¦_πβ(π_0+π_1 π₯_π )]^2
what is the equation for b1 in the least squares coefficient estimator?
π_1
=(β(π₯_πβπ₯Μ
)(π¦_πβπ¦Μ
) ) /(β(π₯_πβπ₯Μ
)^2 )
=πΆππ£(π₯,π¦)/(π _π₯^2 )
=π x (π _π¦/π _π₯ )
where π is πΆπππ(π₯,π¦)
what is the regression line after you have estimated b1?
π_0=π¦Μ
βπ_1 π₯Μ
because the regression line goes through the sample means π₯Μ
, π¦Μ
what are the two parts of the variation in a dependent ratio?
the total sum of the squares is eqal to the regression sum of the squares + the error sum of the squares
what is the formula for the regression sum of the squares?
β(π¦Μ_πβπ¦Μ )^2 where π¦Μ_π = predicted value of the dependent variable given π=π₯_π and π¦Μ = sample mean of the dependent variable
what is the formula for the error sum of the squares?
β(π¦_πβπ¦Μ_π )^2 where π¦_π = observed value of the dependent variable and π¦Μ_π = predicted value of the dependent variable given π=π₯_π
what is the coefficient of determination?
it is the proportion of the total variation in the depenedent variable that is explained by variation in the independent variable
what is the formula for the coefficient of determination?
π ^2=πππ /πππ=(ππππππ π πππ π π’π ππ π ππ’ππππ )/(π‘ππ‘ππ π π’π ππ π ππ’ππππ )