Ch 3. The Cox PH Model and Its Characteristics Flashcards

1
Q

Tests statistics typically used with ML estimates (2)

A
  • Wald statistic
  • likelihood ratio (LR statistic): makes use of the log likelihood statistic.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Testing the significance of interaction using Likelihood Ratio (LR) statistic

A

We need to compute the difference between the log likelihood statistic of the reduced model which does not contain the interaction term and the log likelihood statistic of the full model containing the interaction term.

In general, the LR statistic can be written in the form -2lnLR minus -2lnLF, where R denotes the reduced model and F denotes the full model.

LR = -2 ln LR - (-2 ln LF)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Z Wald Statistics: How to obtain p-value in model

A

p-value is obtained by dividing the coefficient of the product term by its standard error and then assuming that this quantity is approximately a standard normal or Z variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Statistical objectives of model (3)

A
  • test for significance of effect
  • point estimate of effect
  • confidence interval for effect
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Point estimate of effect

A

A point estimate of the effect of the treatment is provided in the HR column. This value gives the estimated hazard ratio (HR) for the effect of the treatment;

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Calculation of confidence interval for effect

A
  1. Compute a 95% confidence interval for the regression coefficient of the Rx variable (‘beta’1).
  2. Exponentiate the two limits obtained for the confidence interval for the regression coefficient of Rx

exp[ ‘beta’1 +/- 1.96 sqrt (Var • ‘beta’1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Confounding (HR)

A

Crude versus adjusted HR are meaningfully different.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Adjusted survival curves vs KM curves

A

Adjusted survival curves are mathe- matically different from Kaplan–Meier (KM) curves. KM curves do not adjust for covariates and, therefore, are not computed using results from a fitted Cox PH model. Adjusted: Adjusted for covariates Use fitted Cox model KM: No covariates No Cox model fitted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The formula for the Cox PH Model

A

The Cox model formula says that the hazard at time t is the product of two quantities.

The first of these, h0(t), is called the baseline hazard function.

The second quantity is the exponential expression e to the linear sum of ßiXi, where the sum is over the p explanatory X variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a time-independent variable?

A

A time-independent variable is defined to be any variable whose value for a given individual does not change over time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Property that makes the Cox model a semiparametric model

A

The baseline hazard, h0(t), is an unspecified fuction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a parametric model?

A

A parametric model is one whose functional form is completely specified, except for the values of the unknown parameters.

For example, the Weibull hazard model is a parametric model where the unknown parameters are lambda, p, and the ßi’s. Note that for the Weibull model, h0(t) is given by lambda*p*t^(p-1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why is the Cox PH model is popular?

A

Cox PH model is ‘robust’: will closely approximate correct parametric model. (prefer parametric model if sure o correct model; e.g. use goodness-of-fit test.

Measure of effect: hazard ratio (HR) involves only ß’s, without estimating h0(t).

Can estimate h(t,X) and S(t,X) for cox model using a minimum of assumptions.

Cox model prefereed to logistic model:

  • uses more information: survival times and censoring
  • uses (0, 1) outcome and ignores survival times and censoring.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why L is a partial likelihood (rather than a complete likelihood)

A

The term“partial” likelihood is used because the likelihood formula considers probabilities only for those subjects who fail, and d_oes not explicitly consider probabilities for those subjects who are censored._

Thus the likelihood for the Cox model does not consider probabilities for all subjects, and so it is called a “partial” likelihood.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Eq. partial likelihood

A

The partial likelihood can be written as the product of several likelihoods, one for each of, say, k failure times. Thus, at the f-th failure time, Lf denotes the likelihood of failing at this time, given survival up to this time.

L = L1 x L2 x L3 x … Lk = ΠLj (j = 1 , k)

Cox likelihood based on order of events rather than their distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Definition of ‘risk set’, R(t(f))

A

The set of individuals at risk at the jth failure

17
Q

Solution by iteration

A

The solutioni sobtained in a stepwise manner, which starts with a guessed value for the solution, and then successively modifies the guessed value until a solution is finally obtained.

18
Q

General definition of hazard ratio (HR)

A

In general, a hazard ratio (HR) is defined as the hazard for one individual divided by the hazard for a different individual.

The two individuals being compared can be distinguished by their values for the set of predictors, that is, the X’s

19
Q

General rule for (0, 1) exposure variable when there are product terms:

A
20
Q

Formula of Cox model hazard function and Cox model survival function

A
21
Q

Meaning of the PH Assumption

A

The PH assumption requires that the HR is constant over time, or equivalently, that the hazard for one individual is proportional to the hazard for any other individual, where the proportionality constant is independent of time.

PH: HR is constant over time, i.e., h-‘hat’- (t, X*) = constant x h-‘hat’- (t, X)

22
Q

Obtaining maximum likelihood estimates (score equations)

A

Once the likelihood is formulated, the question becomes:

which values of the regression parameters would maximize L?

The process of maximizing the likelihood is typically carried out by setting the partial derivative of the natural log of L to zero and then solving the system of equations (called the score equations).

23
Q

Defintion of time 0 (t0)

A

Starting time of the true survival time (starting point)

24
Q

Defining the starting point: Possible choices for time 0:

A
  • Study entry
  • Beginning of treatment
  • Disease onset
  • Disease diagnosis
  • Surgery
  • Point in calendar time
  • Birth
  • Conception
25
Q

Concept of left truncation at time t0

A

The subject is not observed from time 0 to t0.

If the subject has the event before time t0, then that subject is not included in the study.

If the subject has the event after time t0, the subject is included in the study but with the caveat that the subject was not at risk to be an observed event until time t0

26
Q

Types of left truncation (2)

A
  • Type 1 occurs if the subject has the event before t0 and thus is not included in the study. (e.g. exposure (E) under study causes individuals to die before they could enter the study, this could lead to a (selective) survival bias that would underestimate the effect of exposure)
  • Type 2 occurs if the subject survives beyond time t0 (i.e., t > t0). This is required in order for the subject to have his/ her survival time observed.

(e.g. developed the disease (time 0) prior to being diagnosed with the disease at time t0 and was observed after t0. Subject included in the study)

27
Q

Differences between left truncation / left censorship (4)

A

If a subject is left censored at time t, then that subject is:

  • (i) included in the study,
  • (ii) known to be event free at time 0,
  • (iii) known to be at risk for the event after time 0, and
  • (iv) known to have had the event before time t but with the exact time of event being unknown
28
Q

Approaches of measuring survival time (2)

A
  • Time-on-study
  • Age-at-follow-up until either an event or censorhip
29
Q

Time-on-Study vs. Age as Time Scale: (closed cohort vs open cohort)

Which one to use?

A

Key issue: Did all subjects first become at risk at their study entry?

Clinical trial:

  • Subjects start to be followed for the outcome after random allocation.
  • Reasonable to assume subjects start to be at risk upon study entry.

⇒ time-on-study as the time scale is typically appropriate.

Observational study:

  • Subjects already at risk prior to study entry
  • Unknown time or age when first at risk

Reasonable to assume that T = tr + t; where:
T = true survival time; tr = time at risk prior to study entry; t = observed time-on-study.

The individual’s true (i.e., total) survival time istherefore underestimated by the time-on-study information, i.e. the true survival time is left-truncated.

⇒ time-on-study follow-up times that ignores unkown delayed entry time may be questioned.

30
Q

Modified approach to use time-on-study vs. age as time scale

A