Correlations Flashcards
What two things can a correlation coefficient be regarded as?
A descriptive statistic (describing the strength and direction of a relationship {between -1 and +1}) and a
measure of effect size
What does the magnitude of the correlation coefficient depend on?
The amount of noise or scatter in the relationship; the less noise, the stronger the relationship
Why is variation in scores critical to correlations?
Both measures must have some decent variability in their scores because the variation is all that’s being measured (it’s all about how people score relative to each other)
Why is it important to ensure you have decent variability in your scores?
If everyone scores the same then any correlation will be meaningless (if variance/standard deviation is zero, you can’t even calculate a correlation)
What if the spread of scores is truncated (i.e. restriction of range)?
Then the correlation magnitude may be reduced (i.e.
will seem that the underlying relationship is smaller than it actually is)
If testing the ability of a sample of electricians, what would you need to ensure?
That you include tasks of appropriate difficulty so you can tell apart electricians of different levels; recruit participants with an appropriate range of skill levels to evaluate the validity and reliability of your test
What would happen if we only simulated very easy tasks in a test?
It will restrict the range, as the worst novices could get the maximum score (ceiling effect) and the scores would be squashed together and suppress the correlation
What would happen if we only recruit novices and not experts?
The range of scores will be restricted to the low end of the scale, suppressing the correlation
Why are absolute values of scores irrelevant in correlations, and why are the raw scores discarded?
Because we’re evaluating scores relative to other scores in the sample; when calculating a correlation coefficient, both measures are standardised to z scores
What if the underlying relationship is not a linear/straight line?
Then a correlation coefficient will be an inaccurate estimate of that relationship; correlations only
represent linear relationships where two measures vary in a directly proportional way to one another
What is the significance test you get when you run a correlation and why do you need this?
A t-test that checks whether the correlation coefficient you obtained is significantly different from
zero; because we’re generally only ESTIMATING the correlation of interest from a sample
Why is the correlation shown by SPSS only an estimate?
In psychology we’re usually only interested in how much two measures correlate in general across the
population, then we can use this to predict outcomes
If we only have a small sample size, can we be confident that it’ll be a reasonable approximation of
the true population correlation?
Yes, but there will be a margin of error (confidence interval), and this will get smaller the more people
we include in our testing sample
What does sample size determine about the accuracy of the correlation estimate?
If it’s a big sample, the estimated correlation is more likely to be accurate and stable, and therefore
closer to the population
What does the p value you get with a correlation determine?
How likely it is that you’ll get this correlation coefficient if the “real” population correlation was actually zero (can it be reasonably discriminated from zero?)