Chapter 15 Flashcards
correlation
A statistical value that measures and describes the direction and degree of relationship between two variables. The sign (+/−) indicates the direction of the relationship. The numerical value (0.0 to 1.0) indicates the strength or consistency of the relationship. The type (Pearson or Spearman) indicates the form of the relationship. Also known as correlation coefficient.
A correlation is a numerical value that describes and measures three characteristics of the relationship between X and Y.
- The Direction of the Relationship The sign of the correlation, positive or negative, describes the direction of the relationship.
- The Form of the Relationship (eg. linear form)
- The Strength or Consistency of the Relationship
positive correlation
In a positive correlation, the two variables tend to change in the same direction: as the value of the X variable increases from one individual to another, the Y variable also tends to increase; when the X variable decreases, the Y variable also decreases.
negative correlation
In a negative correlation, the two variables tend to go in opposite directions. As the X variable increases, the Y variable decreases. That is, it is an inverse relationship.
pearson correlation
The Pearson correlation measures the degree and the direction of the linear relationship between two variables.
The Pearson correlation for a sample is identified by the letter r
r = degree to which X and Y vary together/degree to which X and Y vary separately = covariability of X and Y / variability of X and Y seperately
sum of products
A measure of the degree of covariability between two variables; the degree to which they vary together.
definitional formula for the sum of products
SP = sigma (X-Mx)(Y-My)
- Find the X deviation and the Y deviation for each individual.
- Find the product of the deviations for each individual.
- Add the products.
computational formula for the sum of products
SP = sigma XY - sigmaX sigma Y / n
the formula for the Pearson correlation becomes
r = SP/square root of SSx * SSy
sample r
r = sigmazxzy / (n-1)
population r
p = sigma zxzy / N
Where and why Correlation Are Used
- Prediction If two variables are known to be related in some systematic way, it is possible to use one of the variables to make accurate predictions about the other.
- validity
- reliability
- theory verification
When you encounter correlations, there are four additional considerations that you should bear in mind.
- Correlation simply describes a relationship between two variables. It does not explain why the two variables are related.
- The value of a correlation can be affected greatly by the range of scores represented in the data.
- One or two extreme data points, often called outliers, can have a dramatic effect on the value of a correlation.
- Thus, a correlation of r = 0.5 means that one variable partially predicts the other, but the predictable portion is only r^2 = 0.5^2 = 0.25 (or 25%) of the total variability.
coefficient of determination
The value r^2 is called the coefficient of determination because it measures the proportion of variability in one variable that can be determined from the relationship with the other variable. A correlation of r = 0.80 (or −0.80), for example, means that R^2 = 0.64 (or 64%) of the variability in the Y scores can be predicted from the relationship with X.
regression towards the mean
When there is a less-than-perfect correlation between two variables, extreme scores (high or low) for one variable tend to be paired with the less extreme scores (more toward the mean) on the second variable. This fact is called regression toward the mean.