Regression & Correlation Flashcards
What does correlation refer to?
Degree to which two quantitative variables are related
What is commonly used to measure correlation in quantitative parametric data?
Pearsons correlation coefficient
What does value of correlation coefficient ‘r’ vary between?
-1 to +1
Units of ‘r’?
None
When is correlation coefficient not valid?
If data is not independent (paired)
When can Fishers transformation be used?
To compare two correlation coefficients for hypothesis testing
What are Partial correlations?
Correlations between two variables after adjusting for a third variable
What is Spearmans correlation (rho)?
Non-parametric equivalent of Pearsons
What can Spearmans be used for?
To test association between two variables if at least one is ordinal or
If sample size is small despite being continuous variables
or if non-linearity is suspected or
if non-normal distribution is noted for both variables
What does Spearmans assume?
Difference between each pair of ordinal variables is the same i.e. the ranks are equidistant
If the difference between each pair of ordinal variables is not the same, how can one calculate correlation?
Kendalls Tau - appropriate measure of nonparametric correlation
What does regression statistics help with?
Helps predict what value one variable will be if given a particular value of the other variable
Explain the formula for simple linear regression
y = a +bx
B = regression coefficient
A = intercept on y axis
What can simple linear regression predict?
Probable score in Y axis from known score in X axis i.e. dependent variable can be predicted from value of independent variable
How does one determine the value of a and b for regression?
Using a scattergram and method of least squares
Explain the method of using a scattergram and method of least squares
Hypothetical straight line is constructed so that its vertical distance from various points of observations on a scattergram is kept to a minimum; this is called the residue.
The sum of the square of residues is kept to a minimum for a regression line of good fit
What happens in multiple linear regression?
Several independent variables together predict a single dependent variable
What type of technique is multiple regression?
Multivariate
What is the name of the independent variables in multiple regression?
Covariates
What is the name of covariates which may be highly correlated with each other?
Collinearity
Effect of collinearity?
May disturb the regression
When is regression coefficient useful?
Examine confounders