Week 6 (Correlation) Flashcards
What is a correlational design
-We look at pairs of scores to see whether one measure is consistently associated with scores on another measure.
Ordinal data
-Categories that can be ordered/ranked - property of magnitude
-BUT no precise differences between ranks
-So, although the categories have an order, they might not evenly spaced
E.g, above average, average, below average poor
Linear relationships
A relationship between two variables that can be described by a straight line
Outlier
Extreme data point, clear odd one out
What other things can scatterplots tell us
Spot restricted ranges
-When range is restricted , correlation can go down
Subgroups
-Scatterplots can show us whether we have a false positive correlation due to different subgroups of participants.
Pearson’s Product Moment Correlation: r
ranges from -1 (perfect negative correlation) to +1 (perfect positive correlation)
-Parametric test so it’s out “powerful” correlation - check this one first
-Must meet certain assumptions, including normal distribution of data
Directional problem
The direction of the causal relationship may not be what we assume
Confound problem
There may be another variable that isn’t measured in the study, but causes both shirt size and academic achievement
Spearman’s Rank Correlation: rho p
-Non-parametric test
-Can be used with ordinal data , or continuous data that are not normally distributed
Kendall’s Tau Rank Correlation: tau t
-Often used instead of spearman’s when there are tied scores (same scores across multiple people)
Assumptions of Pearson’s r
Type
-Data should be continuous, rather than ordinal (though often we treat Likert data as continuous - we’ll come back to judging when this is appropriate)
Normal
-Both variables should approximate a normal distribution (no strong skew, etc.)
Extreme?
-There should be no extreme values (outliers)
-Outliers can overly influence the calculation f Pearson’s statistic, more so than the other data points - leads to an inaccurate result.
Assumptions of Spearman’s ranked correlation
Type
-Data must be ordinal , interval or ratio level of measurement
Ties
-Participants variable levels should not be the same across multiple people (there should be few tied scores)
Covariance
The extent to which variables co-vary (change together)
-High covariance: Means there is a large overlap between the patterns of change (variance) observed in each variance
-Low overlap: means there is little overlap in the variance of each variable