Linear Regression Flashcards
What is correlation used for?
measuring the strength and direction of a linear relationship
What is regression used for?
describing the linear relationship with an equation for other groups or for other situations
When do you use regression?
when only data on the independent variable is known
what variable is being predicted in single linear regression?
the dependent variable, y
what variable is used to make predictions from in single linear regression?
the independent variable or the predictor, X
How do you determine the equation of the best fitting line?
a technique called least squares regression
what are residuals?
the difference between the observed value of y and the predicted value of y (point on the line)
what do the residuals look like when the line fits the data well?
they are small
what regression equation is the best?
the one with the smallest sum of squared residuals
how do you measure the accuracy of the predictions?
standard error of the estimate AKA root mean squared error (RMSE)
what is the RMSE?
root mean squared error, the average error we make when using the regression equation to make predictions
what is the regression equation?
y = b0 + b1x where b0 is the y-intercept and b1 is the slope
what is the coefficient of determination?
R^2 = the percentage variance or measure of how well the line represents the data?
what can you do to test if the linear relationship is a significant relationship?
test for the slope (t-test) and test for explained variance (f-test)
what does the t-test test?
test for the slope = if the slope differs from 0