lecture 3- linear regression Flashcards
what is r- correlation
r is the change in SD units of Y that occurs for every 1 SD change in X
how do you calculate pearsons r?
- Calculate covariance between the X and Y variables, and then standardize
- Convert the X and Y scores to z-scores (standard scores), then divide by n
correlation vs regression
Correlation: is there a relationship between 2 variables?
Regression: how well does one variable predict the other variable?
what does prediction require?
Prediction requires calculating a line of best fit (an equation)
- You can then use this equation to obtain a best-fit estimate for any new data point (X)
within the range of the original data set.
dependant variable
DV or criterion variable or response variable
- the variable that you are trying to predict
independent variable
IV or predictor variable or regressor
= the variable that you are trying to predict from
simple linear regression = 1 predictor variable
multiple regression = 1+ predictor variables
what is the equation for straight line?
y= a + bX
what is the best regression line?
The best regression line is the line that minimizes the sum of squared deviations
(i.e., a line that satisfies the “least squares” criterion)
deviations = predicted value minus observed value