L4 - Intro Regression Flashcards
What is a residual value?
It is the value of the deviation of a data point from a slop line, relative to the y axis.
What should the sum of squared residuals be if you have a good slope line?
zero!
What is the method of least squares?
It is a method that finds the best location for the straight line function because it estimates values for the slope, and the intercept that minimises the sum of squared residuals.
What are the two fundamental equations in the linear regression model?
- Full regression equation
- Regression model equation
How do the full regression equation and the regression model equation differ?
‘Model’ equations will always have a predicted Y score on the LHS, and no residual score on RHS (e)
of x and y, which is the dv and iv?
X - INDEPENDENT
Y - DEPENDENT
What is a and b in the regression equations?
a - intercept
b - slope
What is a partial regression coefficient?
The regression slope parameter, noted as b, in multiple linear regression.
(Regression w/ 2 or more IVs)
What are the two features we focus on in the linear regression model?
- The strength of the overall prediction model
- The strength of the prediction of each individual IV considered separately within the overall model.
Does strong prediction in the overall model imply that each individual IV is a good predictor?
No!
What does R Squared measure?
This indicates the strength of prediction in the overall model.
What are the 3 sources of variability in scores on the DV?
- Observed scores on the DV (SStotal)
- Predicted scores on the DV (SSReg, variation accounted for by the model)
- Difference between observed and predicted scores (SSRes, variation not accounted for by the model)
What is the formula for R squared?
R squared = SS Reg / SS Total
What is the formula for SS total?
SSTotal = SS Reg + SS Res
What is the formula for SS Reg?
SS Reg = SS total - SS Res