Correlation and linear regression Flashcards
What is correlation used for?
to test for association between variables
What is meant by regression?
Once correlation between two variables has been shown regression can be used to predict values of other dependent variables from independent variables.
The degree of correlation is summarised by what
the correlation coefficient (r)
(indicates how closely the points lie to a line drawn through the plotted data)
Describe the correlation between 2 variables if r=1, r=0 and r=-1
r = 1 - strong positive correlation (e.g. systolic blood pressure always increases with age)
r = 0 - no correlation
r = - 1 - strong negative correlation (e.g. systolic blood pressure always decreases with age)
What is the name given to the correlation coefficient when using parametric variables vs the name given when analysing non-parametric variables
parametric variables = Pearson’s correlation coefficient (r)
non parametric variables = Spearman’s correlation coefficient (p)
what is linear regression?
linear regression may be used to predict how much one variable changes when a second variable is changed
A linear regression equation may appear as follows:
y = a + bx
What do each of the above represent?
y = the variable being calculated
a = the intercept value, when x = 0
b = the slope of the line or regression coefficient. (how much y changes for a given change in x)
x = the second variable