Correlation and Regression Flashcards
Correlation
A measure of the linear relationship between two variables. The correlation value characterizes the size and nature of the relationship.
Cannot infer causation from correlation, just nature and size of the relationship, and if they are related.
Can test the correlation between multiple variables at the same time.
Spurious relationship
A linear relationship between variables that is not logical, and usually due to a coincidence or the presence of a third common cause (confound variable). Cannot be determined by a statistical test.
r (correlation coefficient)
A means for determining the degree of linear relationship between two variables. NOT a ratio statistic
Explained variance (r squared)
An indicator of the proportion of variation in one variable (y) that is shared, or can be predicted or explained when the x values are known. Cannot indicate the direction of the relationship.
Also known as the coefficient of determination.
Regression
A calculation the provides a linear equation the predicts values of y for given values of x.
Used to describe the relationship between two variables mathematically and to enable researchers to make predictions.
Doesn’t usually predict the exact value of y, but reasonably close.
Regression Equation
The equation that best fits the scatterplot.