Chapter 17 - Prediction: Linear Regression Flashcards
What is linear regression?
- Plots a straight line through the scatter diagram and uses that line to predict the value of one variable from the value of the other.
- The science of linear regression provides rules for determining which is the best line for predicting one variable from the other.
- Determining the best straight line through a data set and using it to predict Y from X.
What is slope?
The change in Y divided by the range of X.
What is the intercept?
The general equation for a straight line is: the height of the line at any point X is equal to the slope of the line times X plus the intercept.
p463 The equation for a straight line
Slope is rise/run.
Where does the best regression line always pass through?
The best regression line passes through the point (X,Y), or in other words the mean of the X values and the mean of the Y values. NB: X and Y have a line above each.
What is the error of prediction?
- The difference between the actual value of Y and the value predicted from X.
- It is a vertical distance between the subject’s data point and the regression line.
- Denoted by the symbol ‘e’.
What is the least squares criterion?
The rule that states that the best regression line is the one that produces the smallest sum of the squared errors of prediction.
What is the standard error of the estimate?
The standard deviation of the errors of prediction.
What is homoscedasticity?
The assumption in regression that the size of the errors of prediction does not depend on the value of X.
What is the coefficient of the determination?
The square of the Pearson correlation coefficient (r) because it measures the extent to which one variable determines the magnitude of another and therefore determines how accurate a prediction is likely to be.
When is a prediction “good”?
When r is close to +1.
When is a prediction “bad”?
When r is close to 0.
What is the regression to the mean?
The predicted variable that generally lies closer to its mean than does the predictor variable.
Finish this sentence… if X and Y are perfectly correlated…
always predict Y to be closer to the mean than is X.
What is the personality coefficient?
The observations that most correlations between personality measures and behaviour are approximately .2 to .3.