correlation & multiple regression Flashcards
What is correlation
- An association or dependency between two independently observed variables
Analysis of correlation and what scores mean
0.0 when X and Y are completely independent of each other
1.0 when they are identical to one another
−1.0 when they are exactly inverse to one another
What is partial correlation?
Want to see if more than 2 variables relate to one another
i.e X, Y and Z
What is multiple linear regression?
Multiple linear regression is a similar concept to correlation
Major difference: it describes the relationship between one or more predictor variables (X1, X2, etc.) and a single criterion variable (Y)
Higher the beta… (MR)
Stronger the relationship
Beta tells us… (MR)
how e.g neurotism/stress predicts depression
prediction error is…
difference between the actual Y values and the predicted values
we aim to get this minimised
can be expressed as residual sum of squares
y = ax + b is the same as..
Y = BETA0 + BETA1X1
Multiple correlation coefficient (R)
Correlation between the predicted values Y^ and the observed values Y
Coefficient of determination (R^2)
Proportion of variance of explained by the regression model
This is simply the square of the multiple correlation coefficient
F-Ratio
As for ANOVA, we can derive an F-ratio contrasting the proportion of explained variance with the residual variance, allowing a statistical test
Assessing goodness-of-fit: sums of squares
Total sums of squares - how far all the data points vary from the mean
Residual sums of squares - difference between actual value and predicted value
Ssm = how much does our model vary from the mean - model sums of squares - mean best guess
Equation for coefficient of determination (R2)
R2 = SSM / SST
OR
R2 = 1 - SSR / SST
Higher F-rations indicate ?
Better models
Effect size for MR
Cohen’s f2
small = 0.02
medium = 0.15
large = 0.35