Correlation and Simple linear regression Flashcards
What does a correlation coefficient measure?
The linear association between 2 continuous variables
What does a correlation coefficient have values between?
-1 and +1
What does a correlation coefficient of -1 indicate?
Perfect negative association
What does a correlation coefficient of +1 indicate?
Perfective positive association
What type of statistic is a correlation?
Parametric
What are parametric tests?
Those that make assumptions about the parameters of the population distribution from which the sample is drawn. This is often the assumption that the population data are normally distributed.
Why do the two continuous variables need to be approximately
normally distributed?
Correlation is a parametric statistic
What is a non-parametric measure of association between ranks and thereby does NOT require this assumption?
Spearman rank correlation coefficient is a non-parametric measure of association between ranks. So does NOT require this assumption
Correlation assumes causation.
True or false
FALSE
Correlation does. not assume causation
What does exact linearity between variable y and variable x mean?
That one variable is a linear function of the second
What does the following equation mean?
Y= B0 + B1X
The intercept B0 is the value that y taken when x is zero.
- If the intercept is zero then y increases in proportion to x (i.e. double x then y doubles)
The slope B1 determines the change y when x changes by one unit
- It measures the gradient of the line.
Why might the linear relationship only apply to the expected value of y?
Other things influence y other than x
- (The expected value of y is the average over several instances with the same value of x).
-Any single y measurement might differ from the line.
What is the mathematical model of a linear relationship?
E[Y]=B0+B1X
Or
y= B0 + B1x+E
- where E = Y-E[Y] is the error (discrepancy between the expected and observed y)
What does a linear model stipulate?
A linear relationship between variable x and the expectation of y.
What are scatterplots useful for?
Understanding the bivariate distribution of two continuous variable.