Week 2 Flashcards
What are residuals?
Residuals are the portion of the score (case) which is not predicted by the regression. Since real world relationship is going to be perfect, there will always be some error and this is expected, however, large errors indicate that the regression model is not adequate, patterns in the residuals can indicate heteroscedasticity or non linearity or both and isolated cases with large residuals are likely to be outliers
What does Mahalanobis distance show?
Mahalanobis distance provides a way to measure how similar some set os conditions is to a known set of conditions. It accounts for the covariance among variables
What does linear regression plan to show?
linear regression looks to predict an outcome (DV) using predictor variables (IVs)
Linear regression looks to predict a single quantitative DV from multiple quantitative and/or qualitative variables. Are relationships always linear? How can non linear IV’s be made to appear linear?
By transforming them using log functions and so forth
what does the Pearson correlation measure?
Pearson correlation measures the strength of the linear relationship
Does regression model imperfect relationships?
Yes
In regression, do we predict y from X?
Yes
What does the equation to predict Y from X look like?
Y’ = A + BX
Where ‘ indicates “predicted”
The prediction of Y will not always be perfect. How is the error calculated?
Actual Y minus predicted Y
Y - Y’
What does the sum of squares have to do with regression?
The sum of least squares is used to find the value of A and B that create the equation of the line of best fit
Natural variation in Y indicate that the points of Y are dispersed in a scatter, how is this denoted?
SSY
What is the total variation of Y separated into?
Total variation can be separated into “regression” and and “error”
How is the total variance explained
R2 is equal to the Sum of Squares due to Regression divided by the Sum of Squares Total
What is the R value
The R value is the correlation between the DV and the IV’s
What is the R2 value?
The R2 value is the total amount of variation accounted for by the predictors