correlation and regression Flashcards
1
Q
How to choose between regression and correlation?
A
- c = interdependent variables
- regression = normally distributed data - dependent and independent variables
2
Q
What is the difference betwen correlation and regression?
A
C = quantifies degree to which two variables are related
R = linear r finds best line that predicts Y from X
C = Correlation coefficient = how much one variable tends to change when the other one does
R= Casual
C = Doesn’t matter which is X and which is Y
C= calculates a correlation coefficient (R) from -1 to +1
R = calculates goodness of fit (r^2)
3
Q
What are the choices for correlation?
A
- Parametric (normal distribution = pearson) - one variable increases as the other increases
- non parametric = spearman
- one variable decreases as the other increases
4
Q
what is the pearson correlation coefficient r?
A
- determines whether there is a statistically significant correlation
- the sign of r (+ or -) indicates whether there is positive or negative correlation
- the closer r is to 1 , the stronger the correlation
5
Q
Regression-linear
A
- in regression analysis, the distance between the data point and the corresponding point on the line of best fit is minimised for all of the data points
- it assumes normal distribution for the values and errorrs of the y variable
- there is no error in the x variable
- the variability of the errors for the y varibale are constant for all x variables