Correlation and Bivariate Regression Flashcards
Correlation
Used to quantify the degree of relationship, or association, between two variables; the extent to which the direction and size of deviations from the mean in one variable are related to the direction and size of deviations from the mean in another variable
Coefficient
(or number) that represents the correlation will always be between +1.00 and −1.00.
Perfect Positive Correlation
+1.00; would exist if every subject varied an equal distance from the mean in the same direction (measured by a Z score) on two variables; if every subject who was 1 Z score above the mean on variable X was also 1 Z score above the mean on variable Y, and every other subject showed a similar relationship between deviation score on X and deviation score on Y
Perfect Negative Correlation
If all subjects who were above or below the mean on variable X were an equal distance in the opposite direction from the mean on variable Y, the result-ing correlation would be −1.00.
A correlation coefficient of 0.00 means that…
No relationship exists between the variables
Positive correlations result when…
Subjects who receive high numerical scores on
one variable also receive high numerical scores on another variable.
Negative correlations result when…
scores on one variable tend to be high num-bers and scores on a second variable tend to be low numbers.
Why is correlation useful?
Measure reliability by comparing test–retest measures on a group of subjects to determine consistency of performance.
When the correlation coefficient between two variables is known…
Scores on the second variable can be predicted based on
Cause and effect may be present…
but correlation does not prove causation (scores from the first variable. does not indicate the cause of that relationship)
Scatter Plot
A visual description of a correlation coefficient; the scores for each subject on two variables are plotted with one variable on the X-axis and the other variable on the Y-axis.
Best Fit Line
Represents the best linear estimate of the relationship between the two variables; If the correlation was a perfect +1.00 or −1.00, all of the plotted points would fall exactly on the line. As the correlation approaches 0.00, the data points drift further away from the line until the data points form a random cloud of points.
Correlation Coefficient
represents the relationship between the Z scores of the subjects on two variables (usually designated X and Y).
Pearson’s product moment correlation coefficient (r) Formula
r = Σ(ZxZy)/N
Residuals
The vertical distance from any point to the line;will be positive and negative and the sum of residuals will be equal to zero (in the same way that the sum of the deviation scores from the mean sum to zero).