Regression (Exam 1) Flashcards
1
Q
regression line
A
- a straight line that describes how a response variable y changes as an explanatory variable x changes in a linear relationship
- interprets (can describe how y varies with x)
- predicts (can predict the value of y for a given value of x) the future
- equation: ŷ = b0 + b1x
2
Q
prediction error (“residual”)
A
- how far away the predicted response is from the observed (actual) response
- error = observed – predicted = y – ŷ
- vertical distance between the points and the regression line
3
Q
least squares regression line
A
- want to choose the line that makes the errors (vertical distances) as small as possible
- the line that minimizes the sum of the squared errors
- focus on the magnitude of the error
4
Q
what are some cautions regarding regression?
A
- only for linear relationships
- association ≠ causation
- lurking variables
- influential points/outliers
- extrapolation
5
Q
influential
A
- if removing it from the analysis would significantly change the result of a statistical calculation
- outliers in the x-direction, y-direction, or relationship are often influential for the regression equation
6
Q
extrapolation
A
- a prediction for an x that is far away from the x’s in the dataset
- can be dangerous, because they assume the same linear relationship holds for values far outside of the range of data set used to build the model