Chapter 3 lecture Flashcards
difference regression and correlation
a regression line predicts the dependent variable y on the basis of the independent variable x. It describes the relationship between x and the estimated values of y at the various levels of x.
a correlation describes the strength of the association between y and x. It indicates to what extend the data points deviate from the regression line.
what happens if r increases
then the data points will be closer to the regression line
correlation cannot be used to..
describe the lineair relationship between a and b
intercept =
a, value of y when x=0
slope =
b, how much y changes if x increases with 1.
positive b = … association, negative b = … associaton
positive, negative
u should draw the regression line so that…
there are as many dots above as below the line
residual
difference between the observation and the prediction (which is the regression line)
choose the line with the…
smallest sum of squares
sum of squares
sum van (y-y^)^2
hoe heet de lijn met de least sum of squares
least squares line
if we do not know x, what is the best guess for y
average y!
dus wat is het idee achter een regressieanalyse
kijken naar het verschil tussen het average (zonder x) en de regressielijn. how much is the error decreased by adding the predictor?
wat is r2
the proportional decrease in the prediction error
wat betekenen large/small r2?
large r2 -> groot verschil tussen average and prediction. dit betekent dat je een goede predictielijn hebt!
small r2 -> klein verschil tussen average and prediction. dit betekent dat je een slechte predictielijn hebt, je had net zo goed het gemiddelde kunnen gebruiken! dat geeft dan dezelfde informatie.
r2 formule
(total SS - RSS)/total SS
wat is SS
SS= total sum of squares = total variance:
vanaf punt tot average/mean (rechte lijn) = 𝒚-
wat is RSS
RSS = residual sum of squares
vanaf punt tot regression line = y^
RSS -> R, dus vanaf REGRESSION prediction line