Task 6: What time is it? Flashcards
The regression line is a
straight line that describes how a response variable y changes as an explanatory variable x changes.
The regression line is used to
predict the value of y for a given value of x.
The main difference between regression and correlation is that
correlation lets us know the association or the absence of a relationship between x and y.
The formula of regression is:
y = b0 + b1x b0 = intercept = y (bar) - b1*x(bar) b1 = slope = r * sy/sx
Extrapolation is
the use of a regression line for prediction far outside the range of values of the explanatory variable x used to obtain the line. Such predictions are not often accurate.
The prediction errors we make are always error in the variable ..
y.
error =
observed v. - predicted v.
Because the slope and intercept of the least-squares line depend on the units of measurement =>
you can’t conclude anything from their size.
The square of correlation r2 is the
variance of predicted y hat divided by the variance of predicted y.
A residual is the
difference between an observed value of the response variable and the value predicted by the regression line.
The mean of the least-squares residuals is always equal to
0.
A residual plot is a
scatterplot of the regression residuals against the explanatory variable, that helps us asses the fit of a regression line.
Association does not mean
causation.