SURVIVAL ANALYSIS Flashcards
Right censored
When the true survival time is incomplete, after the follow-up period
- e.g., loss to follow-up, withdrawals, no event by end of study
true survival time >= observed
Left censored
When the event occurs before we had the chance to record it
true survival time <= observed
Interval censored
Event occurs between t1 and t2, but we are not sure when
Main differences between S(t) and h(t)
S(t) is survivor function. It gives the probability that a person survives longer than a specified time. (focused on not failing)
h(t) is hazard function (conditional failure rate). It gives the instantaneous potential for the event to occur, given that the individual has survived up to time t. (focused on failing)
What is the use of h(t)?
Identifying specific models (exponential, weibull, lognormal)
3 assumptions to censoring
- Random: random overall, i.e., subjects censored at time t are representative of all study subjects who remain at risk with respect to survival experience (censored failure rate = observed failure rate; much stronger assumption)
- Independent: random within subgroups (if we have only one group, then random = independent; useful for validity)
- Non-informative: distribution of time to event does not provide information on time to censorship, and vice-versa
Non parametric survival analysis?
- Life tables
- Kaplan Meier
Make no assumption about the distribution
Semi parametric survival analysis?
- Cox PH model
Assumes that the hazard functions are proportional
But leaves hazard rate unspecified (no intercept, it’s integrated to baseline hazard)
Parametric survival analysis?
Strong assumptions, with Weibull, exponential, Gompertz, lognormal
When is the Cox PH a no go?
When the effect of the exposure varies with time (i.e., interaction) - it violates the proportional hazards assumption
What is there to know about the baseline hazard in Cox PH?
It’s only a function of time (not of covariates) and it’s not directly estimated
What are the main characteristics of the hazard function?
- it’s a rate
- it’s always positive
- infinite upper bound
- can decrease, increase or stay constant
How do we go from S(t) to h(t)?
If h(t) is constant, then S(t) = exp(-lambda*t)
If not, the S(t) = exp(negative integral of the hazard function between 0 and t)
Assumptions that come with life tables?
- Censoring occurs uniformly (therefore we halve the time at risk of censored subjects)
With Kaplan-Meier, what happens with ties?
The censoring is assumed to occur right after death, which leads to slightly overestimating the survivor function
What do we use to test if Kaplan-Meier curves are different?
- Log-rang test
- Wilcoxon test
- Likelihood ratio test
What about CI and Kaplan-Meier?
We have a confidence interval for each estimate of S(t)
Log-rank test for Kaplan-Meier
- A chi2 test, with equal weight on every failure
- Good for: test differences that fit the proportional hazard model (so before using Cox); RCT with no confounders
- Bad for: h(t) crossing; needing to control for confounding
Wilcoxon test for Kaplan-Meier
- A log-rank test (therefore, a chi2 test) that weights strata by size, giving more weight to earlier time points
- More powerful if we don’t have proportional hazards