Covariates and confounders Flashcards
Covariates
Baseline characteristics of participants that explain part of the variability in outcome
Why does this matter in clinical trials?
Imbalances between trial groups may bias estimate of treatment effect
• This could be positive (overestimate) or negative (underestimate)
• e.g. if drug X works in men not women and there are more men in the treatment than placebo group this may overestimate the overall treatment effect
Differences in treatment effect between subgroups may be clinically important
• e.g. if drug X works in men but not women this is useful information in healthcare
Confounders
- Variables related to both intervention and outcome but not on the causal pathway
- Distort the effect of the intervention on the outcome, create bias
Why does this matter in clinical trials?
• Imbalances between trial groups may bias estimate of treatment effect
• This could be positive (overestimate) or negative (underestimate)
• e.g. patients taking drug Y were more likely to develop abnormal liver function tests than patients taking placebo
• On investigation more patients in the treatment group drank alcohol to excess than those in the placebo group
• On subgroup analysis, stratified by alcohol consumption, there was no difference in liver function tests between treatment and placebo groups; alcohol was a confounder that had positively biased the estimate of the safety outcome
Managing covariates and confounders
Reduce bias of treatment effect
– Randomization
• Stratification by most influential covariates or confounders
– Adjust analysis for covariates or confounders
• Stratification or regression
• Reduces their impact on estimates of treatment effect
Identify significant covariates
– Subgroup analysis
• Identify subgroups of participants with different benefits/risks from treatment
Stratification
- Sorting participants into groups e.g. by confounders or covariates
- Can stratify at randomization – or if this fails during analysis
• Stratification in analysis
– Investigate effects between intervention and outcome within subgroups
• Covariate – relationship will persist
• Confounder – relationship will disappear
Regression
• Unadjusted analysis
– Estimate of treatment effect with no account taken of covariates or confounders
• Adjusted analysis
– Estimate of treatment effect taking into account covariates and confounders
• Ideally pre-specified to reduce potential bias by ‘fishing’
• Achieved using regression models
– Factors chosen for adjustment
• Strong predictors of outcome e.g. correlated with r>0.50
• Clear ‘large’ imbalance despite randomization
Binary clinical trial end points
Events
Disease onset or flare
• e.g. heart or asthma attack, epileptic seizure, arthritis flare, stroke, COVID infection
Healthcare contact or not, may specify:
• Scheduled or unscheduled
• GP, hospital, emergency room
Survival or death, may specify:
• Disease-free survival, overall survival
Composite endpoints – occurrence of not
• e.g. Major adverse cardiac events (MACE) – first occurrence of any of nonfatal stroke, nonfatal myocardial infarction or cardiovascular death
Risk
Probability of an event occurring over a pre-specified time interval
Absolute risk (AR)
• Number of people with event/total number of people
Relative risk (ratio)
- Absolute risk INTERVENTION group/absolute risk COMPARATOR group
- No difference – relative risk = 1, treatment better than control, relative risk <1
Absolute risk reduction (ARR)
- Absolute risk COMPARATOR group MINUS absolute risk INTERVENTION group
- No difference ARR = 0
Relative risk reduction (RRR)
AR COMPARATOR - AR INTERVENTION)/ARCOMPARATOR
• No difference RRR = 0
Number needed to treat
• Number treated (100)/absolute risk reduction (for percentage, number treated is 100)
95% confidence interval
95% chance that the true value is between these numbers
Descriptive vs inferential statistics
• Descriptive statistics – Summarize characteristics of a dataset • e.g. mean, standard deviation – No uncertainty • Inferential statistics – Calculated from descriptive statistics • Estimate of population values • Allow hypothesis testing – Uncertainty
Estimates
- Point estimate – single value estimate of a parameter (e.g. sample mean as point estimate of population mean)
- Interval estimate – a range of values within which the parameter is expected to lie (confidence intervals for example)