Survival Analysis I Flashcards

Question

How do you estimate the median survival time in Stata?

Answer 1

stci, median

Answer 2

A regression model estimating the hazard ratio for different covariates while controlling for confounders

Answer 3

Models that assume a specific distribution for survival times (e.g., Weibull, Exponential)

Answer 4

The instantaneous rate of event occurrence at a given time point

Answer 5

The hazard ratio between groups remains constant over time

Answer 6

Time-to-event analysis

Answer 7

People drop out before follow-up or join late. Therefore, calculating incidence risk as normal risks underestimating calculations

Answer 8

Those followed for the whole study had a greater chance of experiencing the event, simply because they were followed for longer. Estimates of the event incidence are likely to be underestimated if you assume everyone was followed for the same length of time. If the pattern of censoring differs between groups, there is the potential for comparisons to be biased

Answer 9

If looking at the effect of SES on heart attack risk, the wealthier may be more likely to participate for the whole study period than those of low SES. This may result in underestimations in the lower SES group

Answer 10

As 'failures'

Answer 11

- Analysing the length of time until the occurrence of a binary event - The data are positive (i.e., time to event cannot be <0) and the distribution is usually skewed (couldn't do a linear regression) - Data are usually censored (people are usually event-free before censoring)

Answer 12

- Administrative censoring: person not (yet) experienced at time the study database is closed for analysis - Loss-to-follow-up: person drops out before experiencing the event and before study has ended We don't know the mechanism by which someone's data has been censored - treat both types the same.

Answer 13

Maximum of one

Answer 14

- Survival times are independent of each other, given the values of the predictor variables - This means the survival time of one individual (/observation) should not influence the survival time of another (/observation) E.g., if someone has 2 heart attacks, the second one isn't counted. You only have one line of data per person otherwise assumptions will be violated

Answer 15

- Only possible for an event to occur once (e.g., death) - Underlying process is altered by the first event occurring (e.g., myocardial infarction). The predictors of the first may be different to those of the second. If an event can occur multiple times (e.g., COVID-19), can consider "time to first event"

Answer 16

For americanisation

Answer 17

Subtracting event/censoring date by entry date

Answer 18

Have 2 columns instead of 1 (one column for time and another for the event)

Answer 19

People are followed for different lengths of time. We can use rates (death or incidence rates which cannot be expressed as a percentage) to consider PYs at risk

Answer 20

- Numerator = number of events (d) - Denominator = total person-time at risk e.g., PYs at risk (rather than no. of people) - Rate per PY = number of events (d) / PYs at risk (py) - Rate (/100 PYs) = d / py * 100

Answer 21

E.g., whether 100 or 1000 is used is down to individual judgement as long as it makes sense

Answer 22

Add up the time people were in the study for

Answer 23

- Assumes constant over time - the frequency an event occurs in 1st year of follow-up is same as frequency in 2nd, 3rd, 10th, ... year - May be expressed relative to any period of time (per 100 PYs, per 1000 person-months, etc.) depending on event's frequency - Can compare rates in two groups with the IRR (rate in group 1 / rate in group 2)

Answer 24

One number that averages the frequency of the event over the time period. This may not be appropriate if analysing survival after surgery, where risk of death is highest a few days after surgery

Answer 25

By pulling them up a bit due to underestimations resulting from not considering differing follow-up periods

Answer 26

Multiplication of independent events

Answer 27

Year: 0.6 No. at risk: 21 No. events: 1 No. censored: 0 Prob. event at this time: 1/21 = 0.0048 Prob. no event at this time (p(t)): 20/21 = 0.952 Prob. remaining event free up to and including this time: 1.00 x 0.952 = 0.952

Answer 28

Convention to not consider censoring until time 0.6 The event is considered at time 0.5

Answer 29

Cumulatively decreases as the sample size has reduced (the steps will get bigger)

Answer 30

0.952 x 0.950 = 0.904

Answer 31

CIs to show the range within which the true probability is likely to lie at any given timepoint

Answer 32

No, they can start at the bottom (zero and go up). The y-axis is often altered due to different quantities of white space. However, this can make things look more common than they are. Therefore, some journals want the scale to run from 0 to 1

Answer 33

A chi-squared test on a tabulation and a logistic regression to estimate predictors of death

Answer 34

Check no. of exclusions - Stata will drop those with a negative follow-up time (e.g., erroneously inputting a date so follow-up is -10 years) Need to check observations and failures in cross-tab as well as longest someone was followed for

Answer 35

Widen due to less certainty and less accurate estimations of the true population

Answer 36

sts list Other commands will allow you to see how many were alive at 5 years, for example

Answer 37

Either the probability of having an event (curve goes upwards) or of remaining event-free (curve goes downwards)

Answer 38

When the no. of patients remaining under follow-up and event-free is small (<10?)

Answer 39

Should be shown at regular intervals under the x-axis

Answer 40

No - sometimes not enough data (e.g., only 5% experienced event by end of follow-up)

Answer 41

By treating observations as censored, we assume that, were people to have been followed after censoring, they would have experienced the same event rate as those not censored (i.e., all those who are censored are similar to each other). This may not be the case if censoring is due to some other event that happened to the patient

Answer 42

Informative and neat graphs through labels and legends

Answer 43

Takes whole follow-up period into account and does not require us to know anything about the shape of the survival curve or distribution of survival times

Answer 44

Sum of (O-E)2 / E for each group and then compared to a chi-squared distribution to obtain the relevant p-value

Answer 45

sts test Where refers to a group, such as one defined by age

Answer 46

When the risk of an event is consistently greater for one group than another

Answer 47

Survival curves should always be plotted. Never just do p-values - they are there to complement the graph

Answer 48

Assumes those who drop out aren't different to those who stayed. If those who dropped out were really sick, and then you calculate the death rate based on healthier participants (which will comprise the majority by end of study follow-up), the reason why people drop out wouldn't be random. In the real-world, there may be some differential drop-out. If there's more than 10% lost to follow-up, there may be concerns regarding this.

Answer 49

- Censoring is unrelated to prognosis (non-informative censoring) - Survival probabilities are the same for subjects recruited early and late - Events happened at the times specified - Independence