Regression and Correlation Flashcards
Correlation
- degree to which two quantitative variables are related
- does not suggest causation
Pearson’s correlation coefficient
- commonly used measure for quantitative parametric data
- correlation ranges from -1 to +1
- no units
Regression
-helps predict what the next number is going to be in correlated values
Regression co-efficient
- y= a+bx
- b is the regression coefficient and a is the intercept on the y axis
Fisher’s transformation
-may be used to compare two correlation coefficients for hypothesis testing
Partial correlations
-correlations between two variables after adjusting for a third variable
Spearman’s correlation
(rho)
- non-parametric equivalent of pearsons
- used to test the association of variables if at least one is ordinal (ranked)
- assumes ranks are equidistant
- if this is not true then Kendall’s tau will be used
How to calculate the value of a and b in the regression calulation
- done using a scatter gram and ‘method of least squares’
- lines drawn from dots on the scattergram back to the line of good fit
- these distances are called residue
Multiple linear regression
- several independent variables together predict a single dependent variable
- multivarate technique
- the independent variables are called covariates
Collinearity
- when two covariates studied may be highly correlated with each other
- may disturb regression
R2
- square of regression coefficient
- also called the coefficient of determination
- used to test goodness of fit or final regression
- it is the proportion of total variation in the dependent variable that can be explained by the independent variable
- measures how well the dependent variable and calculated dependent variable correspond to each other
- ranges from 0 to 1
Linear regression
-dependent variable must be continuous
Logistic regression
-used if the dependent variable is binary
Log-linear analysis
-accommodates only categorical data
Bernoulli random variables
-variables that have dichotomous outcomes used in the logistic regression