Regression Flashcards
What is regression analysis
Analysis allowing for adjustment of confounders
Types of regression to know
linear - continuous numeric outcome which can be used for curves despite being called linear
logistic - binary outcome, transformation of outcome is modelled
simple vs multivariable regression
simple has one predictor so no accounting for confounders, multivariable adjusts for other factors
Linear regression coefficients
y is outcome
a shows position of line (doesn’t always have practical meaning), shows as (constant) on SPSS
b is regression coefficient, measures association between predictor and outcome
b1, b2 etc are partial regression coefficients for multiple regression coefficients for each variable, assuming all other confounders remain constant
P-values for all coefficients
dummy variables use
Used for categoric confounders
One less dummy variable than number of categories
Set one as reference category
Dummy variable ‘b’ values are compared to reference variable
Archaic/Bayes information criterion
AIC/BIC, lower the better
Measures how well the model fits the data
Residuals
Differences between observed and predicted values for model
Minimising residuals overall finds line of best fit
Linear regression assumptions
continuous outcome
Predictor variables not dependent on each other - multicollinearity
Residuals normally distributed around 0
Residuals have constant variance
Multicollinearity measure
Variance inflation factor (VIF), e.g. 10 means 90% of variability explained by other predictors
Lower is better, >5 should be investigated
Homoscedasticity
Constant variance of residuals for all values of predictors and outcome, one of the necessary assumptions for linear regression
Transformation for logistic regression
Binary outcomes interpreted as odds then log(odds) used as this is linear, spss fits log(odds) in same way as linear regression
Interpreting logistic regression outcome
exponential of (b coefficient (in binary variables) x category value) gives to back transform, gives odds of outcome The odds ratio tells you how the odds of the outcome changes for every one unit increase in the predictor variable for numeric variable confounders
Logistic regression assumptions
No multicollinearity (VIF can be used to check) Binary variable No need to check residuals, don't have to have normal distribution