Unit 11 Correlation vs Regression Flashcards
Correlation Analysis examines..
The strength (intensity) of the relationship between two experimental variables.
Two things correlation analysis assesses
If the amount of variation in one variable is explained by another variable
If the explained variation is greater than can be expected by chance alone, ie. is it significant
Regression Analysis…
Measures the form of the relationship and describes
the functional relationship between variables
Where does regression usually apply
Applies when we have control of the X variable
(independent) and can measure it essentially without error
the centroid is…
The equivalent of the mean for a double statistical series
The covariance is
The covariance between X and Y (Sxy) is the equivalent to the variance (S) for a double statistical series
(two continuous variables, X and Y)
What does covariance describe
Describes the dispersion of two or more quantitative variables
Is symmetric: The covariance Sxy equals covariance Sxy(Diagonally)
❑ Variances are always positive (squared)
❑ Covariances can be negative: vary from -∞ to + ∞
Variance is…
Covariance matrix or dispersion matrix (dispersion around the mean) (S matrix)
Covariance measures…
Covariance measures the joint dispersion of two quantitative variables around their centroid
Correlation is…
Correlation is a statistical technique used to determine the degree to which two variables are related
Two correlation coefficients are considered:
Pearson product moment correlation coefficient (r)
employed with interval or ratio scaled variables (parametric)
Spearman rank order correlation coefficient (r rho) employed with ordinal or ranked data (non-parametric)
Pearson Correlation Coefficient (rxy) measures
Only the linear relationship between two quantitative variables
It is the covariance of standardized variables
In correlation a positive relationship means…
Means that ‘individuals’ obtaining high scores on one variable tend to obtain high scores on a second variable.
The converse is also true, i.e. individuals scoring low on one variable tend to score low on a second variable.
In correlation a negative relationship means…
Means that ‘individuals’ scoring low on one variable tend to score high on a second variable
Conversely, individuals scoring high on one variable tend to score low on a second variable
Residuals is..
The distance between points and what you are measuring.
They are the error terms.
If there are small residuals then there is strong a correlation.