multiple linear and logistic regression Flashcards
1
Q
What is multiple linear regression?
A
- models the relationship beyween one dependent variable and multiple independent variables using a linear equation
2
Q
What are the underlying principles of logistic regression?
A
- models the probability of binary outcomes by using a logistic function to transform a linear combination of inderpendent variables
3
Q
Advantages of linear regression
A
- explains relationships between variables
- can predict values of the dependent variable
- accounts for the influence of multiple predictors simultaneously
4
Q
Advantages of logistic regression
A
- predicts categorical outcomes
- outputs possibilities for classifications
- handles non-linear relationships via the logit transformation
5
Q
When is multiple linear regression used in data analysis?
A
- when predicting a continuous dependent variable based on multiple continuous or categorical independent variables
6
Q
When is logistic regression used?
A
- when predicting a binary outcome based on one or more independent variables (mutiple predictor values
- independent variables are casually related to dependent variable
- hypothesis testing (same as t test)
- importance of multiple variables predicting another (R^2)
7
Q
How can you recognise and interpret output from multiple linear regression?
A
- look for coefficients that indicate the strength and direction of relationships between predictors and the dependent variable, r squared for model fit and p values for statistical significance
8
Q
how to recognise and interpret output from logistic regression?
A
- analyse the odds ratios to understand the effect of predictors
- look at p values for significance
- use the classification accuracy or confusion matrix to evaluate model performance
9
Q
What is a key conclusion you can state from multiple linear regression output?
A
- if independent variables significantly predict the dependent variable and how much variance they explain
10
Q
what conclusion of logistic regression output can you state?
A
- which predictors significantly influence the probability of the binary outcome and the odds associated with these predictors
11
Q
What is a cofounding variable?
A
- unequally distributed but has an affect on outcome
12
Q
What are the advantages of multiple linear regression?
A
- able to adjust for confounding variables
- examine the effect of multiple independent predictors on an outcome
- improves the amount of the variability you can explain in the dependent variable
- interaction effects
- perform multiple hypothesis tests
- more accurate predictions of outcome variable