week 8 Flashcards
Correlations
whether two variables change together or covary
The value of correlation coefficient can vary from -1 to +1
Regression
uses correlation to predict values of one variable from another
The prediction is done by finding a regression line that best represents the data
X axis
predictor
Y axis
outcome
regression equation
Y=bo+b1x+e
Y
outcome
bo
intercept
b1
slope of the line
X
predictor
e
error
Intercept
The point at which the regression line crosses the Y-axis
The value of Yi when X=0
Slope
a measure of how much Y changes as X changes
Regardless of it’s sign the larger the value of b1, the steeper the slope
For 1 unit of change on the X axis, how much change is there on the Y axis
Residual or prediction error
the difference between the observed value of the outcome variable and what the model predicts
Lines of best fit
a line that best represents the data
a line that minimises residuals
residual sum of squares
residuals can be positive or negative. If we add the residuals, the positive ones will cancel out the negative ones, so we square them before we add them up. We refer to this total as the sum of squared residuals or residual sum of squares
SSr is a gauge of how well the model fits the data