Covariance & Correlation Flashcards
What is covariance?
Reflects the degree to which two variables vary together
How does covariance relate to correlation?
They both measure the linear relationship between two variables; covariance is taken from original scale scores so is affected by measurement scale; correlation is not, it’s a standardised measure
How do you interpret a covariance coefficient?
It’s the average product of the deviation scores of two variables
How do you interpret a correlation coefficient?
Pearson’s r; it tells us whether a relationship is likely to have occurred by chance (0-1 indicates magnitude of relationship; sign (+/-) indicates direction); divide covariance by SxSy (standard deviations of x & y)
Why do we need to test r for significance?
To determine whether the linear relationship between two variables in a sample is large enough to infer a linear relationship in the population, or if the correlation is due to sampling error
What’s the best predictor of the criterion (variable on Y axis)?
The mean of the criterion
Describe the line of best fit
It captures the relationship between the variables on a scatterplot
What does the numerator of covariance involve?
Finding extent to which scores differ from the mean of a variable (both X & Y); multiplying the two deviation scores (XY) for each participant; adding up deviation scores across all participants (SPxy)
What does the denominator of covariance involve?
Dividing SPxy by N-1 so that the covariance is independent of the number of scores
How do we calculate Pearson’s r?
Divide covariability of X & Y (COVxy) by the separate variabilities of X & Y (SxSy)
If we use the standard score formula (ZxZy/N-1) what don’t we need to do?
Calculate the covariance (but still must calculate standard scores for X & Y)
State the statistical & conceptual hypotheses for testing the significance of r
Null: rho (correlation in population) = 0; there is no linear relationship between X & Y in the population; Alternative: rho /= 0; there is a linear relationship between X & Y in the population
What formula do we use to test the significance of r?
t = r x square root of N - 2 divided by the square root of 1 - r squared
What is the degrees of freedom?;
Which table do we look up?
N - 2;
t table
The larger the N, the smaller the what?
Absolute value of r needed for significance