Correlation Flashcards
What does correlation allow us to investigate?
The extent (strength and direction of relationship) to which 2 numerical variables are related to each other.
What diagram is used to describe a linear relationship?
Scatter diagram
What does a scatter diagram provide?
A visual representation of the relationship between two quantitative variables
What is the direction in a scatter diagram indicated by?
A positive or negative slope
What does the strength of the relationship between two variables on a scatter diagram depend on?
How closely the data points are scattered about a straight line
What does a positive correlation mean?
As the x value increases so does the y value
What does a negative correlation mean?
As the x value increases the y value decreases
What is the correlation coefficient (r)?
A numerical measure of the strength and direction of the linear relationship between two quantitative variables.
When calculating correlation coefficient what values do you need to find out?
You need to find Sxy, Sxx and Syy
- xy
- x2 (xx)
- y2 (yy)
What indicates the direction of the linear relationship when the correlation coefficient has been calculated?
The sign of r
Positive or negative
Has to be between 1 and -1
What indicates the strengh of the relationship when the correlation coefficient has been calculated?
The magnitude of r
If r is close to zero the linear relationship is very week
The strength of the association increases as r moves away from zero, approaching -1 or 1
Why does the correlation coefficient not have any units of measurement?
Its value is independent of the scale of each variable
What are the limitations of using correlation coefficient?
- The values of r can be adversley affected by extreme values
- Can only be used for quantitative variables
- Only describes the strength and direction of linear relationships
What does the rank correlation coefficient show?
The linear relationship between two sets of ranked data
What does rs mean?
Spearmans rank correlation coefficient
What is the equation of the rank correlation coefficient?

What does n and d mean in the rank correlation coefficient formula?
n = the number of pairs of data values
d = the difference in ranks for each pair

What does it mean if rs is positive?
The ranks are positivley correlated
When one variable is ranked highly so is the other variable
What does it mean if rs is negative?
Higher rankings in one variable are measured against lower rankings in the other