Epi Methods 753 Flashcards
2 categories where 1 is reference group (typically “unexposed”)
Dichotomous Variable
Parameterization of variable into discrete categories
Categorical Variable
Categorical variable that assigns 0 or 1
Binary Variable
Categorical variable that doesn’t have ordering/order not of interest; collection of k-1 binary indicator variables
Nominal Variable
Categorical variable that has ordering/order of interest; collection of binary variables assigned score; step between categories constrained to be equal
Ordinal Variable
Test Ho that B=0, where B is coefficient for category score variable; if p<0.05 best estimate for step from one category to next is different from 0
Mantel Test for Trend
Variable can take any value between lower & upper limit
Continuous Variable
Divide continuous variable by factor; coefficient of variable affected
Rescaling
Subtract continuous variable by factor; intercept affected
Centering
Closely related to counterfactual; compare observed outcome to non-observed (counterfactual) outcome; estimate measures of causal effect by measures of association assuming exchangeability (differences due to confounding)
Potential Outcomes
Observed outcomes in unexposed are good stand-in for unobserved potential outcomes for exposed persons under no exposure & vice versa; not testable but met in expectation with randomization
Exchangeability Assumption
Comparison of pre-treatment covariates in exposed & unexposed groups; comparability doesn’t guarantee assumption met
Exchangeability Assessment
Relax exchangeability assumption to be conditional on covariates; assumes no unmeasured confounders
Conditional Exchangeability
Used to assess how average value of continuous outcome varies systematically with X’s; E[Y] = B0+B1X1+…; B1=average difference (cross-sectional) or change (longitudinal) in Y per 1-unit X1
Linear Regression
Used to assess how log odds binary outcome varies systematically with X’s; log(odds Y)=B0+B1X1+…; B1=difference in log(odds Y) per 1-unit X1; PrOR for cross-sectional or ROR for longitudinal
Logistic Regression
Risk or prevalence > 10%
OR Overestimates RR or PrR
Used to assess how log probability binary outcome varies systematically with X’s; log(Pr(Y=1))=B0+B1X1+…; B1=difference in log(prob Y) per 1-unit X1; PrR for cross-sectional or RR for longitudinal
Log-Binomial Regression
Path from E to O that starts with E & all arrows point in same direction
Causal Path
Any other path from E to O; unconditionally open backdoor paths are confounded vs. unconditionally closed backdoor paths are blocked at collider
Non-Causal Path
Covariate set that leaves all causal paths open & non-causal paths closed vs. does this without any extra variables
Sufficient vs. Minimally Sufficient
Variables only causally associated with exposure; decreases precision if put into model
Instrument
Not necessary for confounder control but may increase precision
Variables Associated with Outcome
Confounding is causal concept but collapsibility is statistical concept; depends on prevalence of outcome & type of measure of association
Problems with Collapsibility Definition
Stratify table by exposure, do not include outcome, & do not include p-values
Causal Inference Table 1
Give similar results when number of confounders is small & no confounders are continuous
Stratified Analysis vs. Regression
Expresses incomplete adjustment of confounding variables due to mismeasurement or misspecification
Residual Confounding
Prioritize accurate representation & interpretation of exposure but fit for confounders
Causal Inference Modeling
2 or more risk factors modify effect of each other with regard to occurrence/level of outcome; effect of E on O differs across strata of X; potential outcomes indexed by E only & estimated conditional on X (1 exchangeability assumption)
Effect Measure Modifier
Risk of O in presence of both E & X differs from what would be expected based on effect of E alone & X alone; potential outcomes indexed by both E & X (2 exchangeability assumptions)
Causal Interaction
Difference of risk differences (liner), ratio of odds ratios (logistic), or ratio of risk ratios (log-binomial)
Coefficient of Product Term
Difference of risk differences expressed as proportion of reference risk (RR00); R11-R01-R10+1
Relative Excess Risk due to Interaction (RERI)
P-value of Wald test for interaction coefficient; LRT (or F-test for linear models); underpowered & likely to return false positives
Test of Homogeneity
Used to identify potential associations with outcome; hypothesis generating; non-causal, potential for multiple comparisons, different across studies
Risk Factor Analysis
Stratify table by outcome, no p-values
Risk Factor Analysis Table 1
Prioritize interpretability & model fit of all covariates
Risk Factor Analysis Modeling
Assign predicted probability of condition based on baseline characteristics; use logistic regression then convert to probability
Prediction Model