Correlation Flashcards
What could be an indicator of high variance?
Datapoints being further away from their mean (on average).
How could we hypothetically calculate variance (the overall measure of how far each datapoint is from their mean)?
We could hypothetically just add up each deviation score.
What is an example of the reasons that we don’t just add up each deviation score to calculate variance?
The positive numbers would cancel out the negative ones.
Why do we square each deviation score before adding them up to calculate variance?
So that the positive numbers do not cancel out the negative ones.
How is variance calculated?
The difference between each datapoint and the mean is calculated, squared, added together, and divided by the number of datapoints included in the dataset.
To what does covariance refer?
To whether when a participant deviates from the mean on one variable, they also deviate from the mean on another variable in a similar or opposite way.
What are the different possible types of relationships between variables?
A positive relationship (as one variable goes up, the other goes up), negative relationship (as one variable goes up, the other goes down), or no relationship (as one variable goes up, the other does nothing systematic).
Of what is covariance a measure, as opposed to revealing anything about more complicated patterns?
It is a measure of a linear (straight line) relationship.
How can covariance be calculated (similarly to variance)?
By taking a given subject’s distance from the mean in x, and multiplying this by their distance from the mean in y, repeating this for all involved subjects, adding all of these values together, and dividing this value by n-1.
What is the covariance equation?
Cov(x, y) = (Σ(x-x̄)(y-ȳ))/ N-1
How does the covariance value established by the covariance equation explain the relationship between an independent and dependent variable?
Multiplying two positive values or two negative values will give rise to a positive value, and multiplying one positive value and one negative value will give rise to a negative value, meaning that once we have added together the distances of each variable’s datapoints from each mean, a positive relationship will result in a positive covariance measure, and a negative relationship will result in a negative covariance measure.
Of what does the standardisation of the covariance measure provide a measure?
Of the strength of the relationship between the two values being investigated (a correlation coefficient).
What is the ‘Pearson product moment correlation coefficient’?
The covariance measure divided by (sd X x sd Y).
To what does the ‘Pearson product moment correlation coefficient’ give rise?
To coefficients of between -1 (perfect negative relationship) and +1 (perfect positive relationship).
What is ‘Pearson product moment correlation coefficient’ (r)’s formula?
r = COVXY/ SXSY