Logistic Regression Continued Flashcards

Question

What is the table 2 fallacy?

Answer 1

Reports of multiple adjusted effect estimates from a single model. This practice, which remains common in published literature, can be problematic when different types of effect estimates are presented together in a single table. It is not obvious how to interpret coefficients * There are different reasons why a variable may show no effect * Keep in mind: * Confounders * Mediation * Colliders

Answer 2

A confounder

Answer 3

The association between the variable and outcome will be distorted

Answer 4

The correct association

Answer 5

FALSE Confounders do not lie on the causal pathway between the variable of interest and outcome

Answer 6

A mediator

Answer 7

The association between the variable of interest and outcome

Answer 8

A collider

Answer 9

The association between outcome and exposure

Answer 10

Hospital admission Adjusting for hospital admission can make two unrelated conditions appear associated

Answer 11

* Attempt to adjust for confounding * Include Variables that you suspect might cause the variable of interest or the outcome * Include potential confounders regardless of whether they are statistically significant * Be careful adjusting for variables that could be caused by the variable of interest * Mediators * Colliders No need to report coefficients from suspected confounders in model

Answer 12

* Be mindful of possible causal structures * Presence of mediators may mean that no association does not mean that variable is not a cause or would not be associated if intermediate factor not adjusted for * Colliders can distort associations * Be clear in interpretation that adjustment for other factors has occurred

Answer 13

Often appropriate to include all variables that may be predictive in model * You do not necessarily need to report effects of components * If you do avoid providing causal interpretation

Answer 14

Multiple logistic regression

Answer 15

1. A beta parameter and added to the other 2. assumption about the linear predictor 3. “linearly” 4. Additive

Answer 16

The effect of a change in one variable is the same regardless of the value the other variables take.

Answer 17

A variable which changes the effect of a variable of interest and is not on the causal pathway between the variable of interest and the outcome For example the effect of age on heart attack risk may be moderated by smoking status

Answer 18

Effect modification.

Answer 19

Effect modification

Answer 20

Whether treatment effects differs according patient characteristics E.g. is treatment more effective for smokers than non-smokers Here the effect is modified by smoking status

Answer 21

An interaction term

Answer 22

multiplied together

Answer 23

To test whether an effect modifier is statistically significant you must test whether the interaction term is significant Comparing p-values at different levels of the moderator will lead to increased type 1 error rates as multiple tests are carried out. This leads to concluding there is moderation, when there is not. When a categorical variable is involved in effect modification all levels must be tested simultaneously (likelihood ratio test, wald tests) Report confidence intervals and estimates for each level of effect modifier

Answer 24

Can be used to improve the fit of the model to the data This may be useful when adjusting for confounding to capture true relationship between confounder and outcome (rarely used) Also good for prediction when aim is to find model which best predicts unseen data Excessive use of interaction or non-linear terms can lead to overfitting or challenges in interpretation T

Answer 25

Variables act additively in logistic regression

Answer 26

The effect of a variable to be different depending on the value of other variables in the model e.g effect modification

Answer 27

p-value for the interaction term should

Answer 28

A logistic regression model The prediction is the value of their linear predictor

Answer 29

1. The linear predictor 2. The transformation

Answer 30

AIC and BIC Calibration Discrimination

Answer 31

The model is good

Answer 32

AIC (Akaike information criteria) and BIC (Bayesian information criteria) take into account the how well the model fits and the number of parameters BIC penalizes more heavily for more parameters in model Often the same conclusions are the same from both measures

Answer 33

The level of agreement between observed and predicted values – if we predict a 20% probability of an event for a group of people the observed frequencies should be close to 20%

Answer 34

By statistical tests, plots or linear regression

Answer 35

A slope and intercept – these are sometime referred to as calibration in the small and calibration in the large

Answer 36

Covariates

Answer 37

using plots comparing predicted probabilities to observed outcomes One approach is to group observations by predicted probabilities, then plot predicted frequencies vs observed frequencies from each group Alternatively the Lowess smoother can compare predicted probabilities to the outcome Calibration plots should be compared to the line with slope 1 and intercept zero – this line represents perfect calibration

Answer 38

Hosmer-Lemshow test Group observations into bins based on predicted values or covariate patterns, compare observed to predicted frequencies in the bins Lower p-value = poorer fit: A statistically significant result indicates evidence that differences between observed and predicted frequencies were unlikely to happen by chance. Can either select a number of bins e.g. 10 or if only categorical covariates have a bin for every pattern

Answer 39

p-value dependent on sample size – whether test is significant depends on goodness of fit and sample size p-value can vary with number of bins – no clear way to decide

Answer 40

Ability of model predictions to differentiate between those with and without the outcome

Answer 41

The predictions are higher for those with the outcome and lower for those without.

Answer 42

When there high and low predictions occur with similar frequencies for those with (or without) the outcome

Answer 43

C-statistic

Answer 44

Area under the curve (AUC)

Answer 45

C-statistic is the probability a randomly selected person who has a positive outcome has a higher predicted probability than a randomly selected patient who does not have a positive outcome.

Answer 46

By comparing all possible pairs of patients with different outcomes and comparing predicted probabilities.

Answer 47

No predictive ability

Answer 48

Perfect prediction

Answer 49

The definition of a c-statistic, they always agree

Answer 50

Sensitivity with specificity Sensitivity: True positive rate - Correct prediction for someone who has the outcome Specificity: True negative rate - Correct prediction for someone who does not have the outcome

Answer 51

By taking the area under the curve formed by varying a cutoff for classifying positives and negatives Sensitivity is plotted against 1 - specificity No predictive ability would result in the sensitivity being equal to 1– specificity

Answer 52

Sensitivity and specificity were always one

Logistic Regression Continued Flashcards

(78 cards)