CT4 - Models & Mortality Flashcards

Question

What is the Marriage Model?

Answer 1

The marriage model is a time-inhomogeneous model under which an individual can be either never married (B), married (M), divorced (D), widowed (W) or dead Δ. A Markov jump process can be formulated on the state space *S = { B, M, D, W, Δ }*

Answer 2

**T_x** - future life time of a person aged x - **T₀** usually given simply as **T** **ω** - limiting age, or maximum age person can reach - typically restricted between 100 and 120 in models for simplicity **F_x(t)** - distribution function of **T_x**- **F₀(t)**** **usually given as**F(t)** **F_x(t) = P[T_x ≤ t]** - probability of death for a life aged **x** by age **x + t** **S_x(t) = P[T_x \> t]** - survival function of **T_x**, representing the probability that a life aged **x** survives for **t** years

Answer 3

The simple NCD model can be modified with a number of improvements. One such improvement is to have the following states: * State 0: 0% discount * State 1: 25% discount * State 2: 40% discount * State 3: 60% discount The transition rules are as before except that when there is a claim during the current year, the discount status moves down either two levels if there was a claim in the previous year, or one level if the previous year was claim-free. To form a Markov chain, State 2 needs to be split, otherwise there is a reliance on historical state past the immediately previous step. Have a State 2^-, following single claim in previous year in State 3, and a State 2⁺, following no claims in State 1 Transition Graph Below:

Answer 4

The restricted random walk is a simple random walk with boundary conditions E.g. in the case where a gambler borrows money having lost everything, then the barrier is a **reflecting barrier**, of they cash out as soon as they hit a target, the upper barrier is an **absorbing barrier**. More generally: An **absorbing barrier** is a value b such that: * P( X_n+s = b | X_n = b ) = 1 for all s \> 0* i. e. once b is reached, the system stops and remains in this state afterwards A **reflecting barrier** is a value c such that: P( X_n+1 = c+1 | X_n = c ) = 1 i.e. once c is reached, the random walk is pushed away. A **mixed barrier** is a value d such that: P( X_n+1 = d | X_n = d ) = α and P(X_n+1 = d + 1 | X_n = d ) = 1 - α α∈[0,1] i.e. barrier is an absorbing barrier with probability α and a reflecting barrier with probability 1 - α Transition graph below:

Answer 5

Gompertz' Law *μ_x = Bc^x* where *B* and *c* are constants. Gompertz' Law is an exponential function and as such is usually appropriate to middle or older ages, say 35 to 90. The constants *B* and *c* can be found by taking values of *μx*, for two different *x*, and using them to set up simultaneous equations

Answer 6

Makeham's Law *μ_x = A + Bc^x* where A, *B* and *c* are constants. Makeham's Law adds a constant term to the exponential function. This constant part of *μ_x* implies there is an element of the force of mortality that is not linked to age, for example accidental death. The constants *A, B* and *c* can be found by taking values of *μ_x*, for three different x, and solving the resulting equations to find *A, B* and *c*.

Answer 7

The curtate expectation of a life aged x, denoted by *e_x* is *e_x* = *E[K_x]*.

Answer 8

The two state Markov Survival Model assumes: ## Footnote 1. The Markov Assumption 2. *_dtq_x+t=μ_x+tdt+o(dt)* for *t \> 0* where *o(dt)* is a small correction term *(when differentiated this correction term goes to zero)* 3. *μ_x+t* is constant for positive integers *x* and *0 ≤ t \< 1*

Answer 9

Consider life ***i*** in an observation of ***N*** lives, where ***1 ≤ i ≤ N***, let: * ***x + a_i*** - exact age at which observation of life ***i*** starts * ***x + b_i*** - exact age at which observation of life ***i*** ends * ***0 ≤ a_i \< b_i ≤ 1*** * ***D_i*** - variable indicating whether life ***i*** is observed to die during observation - **1** if ***i*** **dies**, **0**, if not * **V_i** - waiting time - actual time of observation for life ***i*** - age when observation ceases less age when it began (ends either due to death or to end of observation - which is its upper bound) * ***0 \< V_i ≤ b_i - a_i ≤ 1 *** * ***D_i = 0 ⇔ V_i = b_i - a_i** - i.e. if no death, waiting time is difference between start and end of observation* * ***D_i = 1 ⇔ 0 \< V_i \< b_i - a_i** - i.e. if death, waiting time is less than difference between start and end of observation*

Answer 10

Let fi(di, vi) be the joint dbn of (Di, Vi), where we use (di, vi) to represent a single observation, a sample from (Di, Vi)

Answer 11

Likelihood function for *μ*, based on observation of *(d, v)* is *L(μ; d, v) = e^-μvμ^d* Maximum Likelihood Estimate (MLE) and Maximum Likelihood Estimator for *μ* given by respectively:

Answer 12

This empirical approach has limitations: ## Footnote 1. **Smoothness of *S*(*t*)**: Estimating *S*(*t*) using this empirical approach would result in a step function i.e. *S*(*t*) would step down for each *t* where a death was observed. However, *S*(*t*) can be smoothed using statistical techniques. 2. **Size of group**: It is difficult to find a sufficiently large population to observe easily. A large amount of lives must be observed to provide an accurate estimate of *S*(*t*). The larger the population used the less significant the steps become, i.e. each death has less of an impact on *S*(*t*), and so the smoother *S*(*t*) is expected to become. 3. **Time period of observation**: This estimation requires the same group of lives to be observed until the last person dies. That means the observation period may be over 100 years. By the end of this period, the data is unlikely to be an appropriate basis on which to model a survival function for the current population. This is because the level of mortality is likely to change in both level and shape over this period e.g. medical advances, improved diet etc. An alternative approach is to use the whole population - using different segments for each age. 4. **Censoring**: In order to use the approximation for *S*(*t*) above, we need to observe each life until it dies. In practice, some of the lives in the observation may disappear from the observation for reasons other than death, for example, emigration. Or, we may not be able to ascertain their exact date of death. This type of issue is referred to as **censoring**.

Answer 13

1. **Right-censoring**: The observation finishes before the terminal event has necessarily happened for all subjects. For example, the investigation ends on a fixed date. 2. **Left-censoring**: The observation does not allow us to know when a subject entered into the state we are observing. For example, in bio-medical research it is usually recorded when a patient entered the hospital at a particular date, and that he survived for a certain amount of time thereafter; however, the researcher does not necessarily know, or have on record, when exactly the symptoms of the disease first occurred. 3. **Interval-censoring**: The observation set-up only allows us to know whether the event fell within some interval of time. It does not allow us to say the exact date of the event. For example, women are tested for cervical cancer every three years. Therefore, if one of the regular tests reveals cancer, it is not possible to pin-point the exact date the cancer developed but it is possible to say that it fell within the three-year period prior to this test. 4. **Random censoring**: This occurs when a subject ceases to be observed at a random time, before they have experienced the terminal event. For example, a member of a pension scheme transfers their benefits out of the scheme. This transfer would occur at a time which could not have been pre-empted and means the observed lifetime of this member is censored. 5. **Informative censoring**: Censoring is informative if it gives information about the subject in relation to the terminal event. For example, if an employee of a company retires on the grounds ill-health, one would expect them to have a higher rate of mortality than an employee of the same age who remains at the firm. 6. **Non-informative censoring**: Censoring is non-informative if it gives no information about the subject or event in question. For example, in survival analysis, if a person in the population emigrates, this does not provide any additional information about the expected lifetime of this person. 7. **Type I censoring**: This describes the situation when a observation is terminated at a particular point in time, so that the remaining subjects are only known not to have reached the terminal event. For example, in survival analysis we end the observation on a fixed date. In this case, the censoring time is often fixed, and the number of terminal events observed is a random variable. 8. **Type II censoring**: In contrast to type I censoring, with type II censoring the observation continues until a fixed proportion of the subjects have reached the terminal event. For example, an investigation into the safety of a drug may require a certain level of success to be observed before the drug can be marketed e.g. recovery of 95% of the patients.

Answer 14

Suppose we observe a population of size *N*. Imagine we observe *m* deaths within this population before terminating our observation. The *m* deaths occur at *k* different times. (This allows for some deaths occuring at the same time and clearly *k ≤ m*.) Denote the i th observed time of death by *t_i*, where *t₁ \< t₂ \< ...\< t_{k - 1} \< t_k* and *1 ≤ i ≤ k.* Let *n_i* represent the number of lives at risk and under observation just prior to time *t_i*, and *di*, the number of deaths at time *t_i*. Let *c_i* represent the total lives censored between *t_i* and *t_i _{+ 1}*. The Kaplan-Meier estimate or product limit estimate of the survival function *S(t)* is:

Answer 15

The **Cox model**, also known as the **Proportional Hazards model**, provides the following format for modelling the hazard function:

Answer 16

A Markov chain is a stochastic process with discrete states operating in discrete time in which the probabilities of moving from one state to another are dependent only on the present state of the process. EITHER If the transition probabilities are also independent of time. OR If the l-step transition probabilities are dependent only on the time lag, the chain is said to be time-homogeneous.

Answer 17

State Space Discrete + Time Space Discrete * Markov chain, Markov jump chain, Counting Process, Random Walk * E.g. No claims bonus SS Discrete + T Continuous * Counting Process, Poisson Process, Markov jump process, Compound Poisson Process (with Discrete Jump DBN) * e.g Number of claims received monitored continuously SS Continuous + T Discrete * General random walk, White noise * e.g. Total amount insured on a certain type of policy valued at end of each month SS Continuous + T Continuous * Compound Poisson (with continuous jump dbn), Brownian Motion, Ito Process * e.g. Value of claims arriving monitored continously

Answer 18

State Space = { Able, Ill, Dead } ## Footnote * V_i*= Waiting time of life *i* in the able state * W_i*= Waiting time of life *i* in the ill state. * S_i*= Number of transitions able to ill by life *i*. * R_i*= Number of transitions ill to able by life *i*. * D_i*= Number of transitions able to dead by life *i*. * U_i*= Number of transitions ill to dead by life *i*. *V = ΣV_i* etc. Transition Intensities: *μ = μ^ad, σ = μ^ai, ρ = μ^ia, ν = μ^id* Likelihood function for these transition intensities is proportional to: *L(μ, ν, σ, ρ) = e^-(μ+σ)νe^-(ν+ρ)wμ^dν^uσ^sρ^r*

Answer 19

In case of censoring, need to be alble to use an approximation for required _tq_x to make up for effect of censoring, meaning that each life potentially contributes to a different part of the q_x. Three assumptions can be used to relate _bi-aiq_x+ai to q_x. * For these calculations, the results are related by (2) \< (1) \< (3) * UDD - assumes increasing force of mortality * Balducci assumption - assumes decreasing force of mortality

Answer 20

Initial exposed to risk is required for binomial model, that requires equivalent Bernoulli trials to calculate probability. Central exposed to risk is equivalent to Markov waiting time, used for Poisson and 2 state Markov.

Answer 21

Definition. The actuarial estimate estimates q_x for the Binomial model, allowing for censoring. It is defined by:

Answer 22

For normal human mortality all of the models are acceptable, due to the low force of mortality. Hence, life tables, which have traditionally been produced using the Binomial model, have been successfully used for years. However, the Binomial model tends to be inappropriate in the following circumstances: * where there is plenty of data: it is usually easier to calculate *E^c_x*, and therefore use the Poisson or two-state model. * there is more than one state: the Binomial model does not easily extend to multiple states. Hence the two-state or Poisson models should be used. * *μ *is not small: The approximation used to calculate *E_x* for the actuarial estimate is invalid for larger *μ*. It also results in the loss of valuable information about the time of the transition to another state. The Poisson model is also inappropriate for higher and only the two-state model is appropriate here.

Answer 23

To avoid adverse (anti-)selection by customers, companies must attempt to get all salient information relating to risk (ratings) factors, particularly when competitors use the same information. Models need to be constructed splitting the general population of insured customers into roughly homogenous groups. Main Factors: * age * sex * smoker status * level of underwriting (i.e. medical required in advance) * duration-in-force * type of policy (through employer / mortgage etc) * weight v. height * units of alcohol per week Additional Factors: * sales channel (e.g. demographic for daytime tv v. direct mail etc) * policy size * occupation (either directly through having a risky job - e.g. bomb disposal expert, or indirectly by being linked to lifestyle proclivities) * known impairments / personal health history * family health history

Answer 24

Common definitions of age used in observations: ## Footnote * **Age *x* last birthday**: covers [*x, x + 1*] * **Age *x* nearest birthda**y: covers [*x - 0.5, x + 0.5*] * **Age *x* next birthday**: covers[*x - 1, x*] Three kinds of rate interval: * **life year rate interval** age label changes on date dependent on life's birthday only * **calandar year rate interval** age label changes at a set point in the year - e.g. 1 January * **policy year rate interval** age label changes on policy anniversary Age at death will be recorded based on the combination of these two aspects of age and policy rate interval.

Answer 25

The **principle of correspondence** states that: a life alive at time *t* should be included inthe exposure for age *x* at time *t* if and only if, were that life to die immediately, it would be counted in the death data at age *x*. *I.E. - a life should only be included in the exposed to risk if it would be included in mortality figures for the same period.*

Answer 26

In order to calculate the central exposed to risk, precisely we require the following items of data for each life, *i*, included in the investigation: ## Footnote * date of birth of life * date observation of life commenced * date observation of life ceased * an indicator to show whether observation of life *i* ceased due to death In addition to this information, we require the definition of age to be specified. For each life we would calculate the following for each relevant age *x*: 1. the date at which the life entered the observation for age *x* , i.e. the latest of: * date at which the life reached age *x* * date observation commenced 2. the date at which the life left the observation for age *x* i.e. the earliest of: * date at which the life reached age *x+1* * date observation ceased 3. the difference between 1 and 2 Given this information, we can calculate for each life, by calculating the time spent under observation, whilst age *x*. This should be totalled across the investigation, to calculate the total central exposed to risk.

Answer 27

As the Continuous Mortality Investigation Bureau (CMIB) compiles research based on figures provided by UK / Eire life insurance companies compiled for 1st January, it uses a **trapezium rule approximation** for approximating the central exposed to risk. The use of the trapezium rule assumes that population changes gradually over the course of the year - an assumption that may be invalid

Answer 28

The assumption that policy anniversaries are uniformly distributed throughout the year must be treated with care. This assumption may not hold for several reasons. Such as: * **policy anniversaries may be concentrated** around certain points of the year. For example just prior to the end of a tax year. * **policy anniversaries may not be independent of birthdays**. For example, customers may take out insurance just prior to a birthday to obtain a lower premium. * **employers** who provide life assurance through a group arrangement with the life insurer, will **have ****the same policy anniversary for all employees**.

Answer 29

Breslow's approximation to the partial likelihood function

Answer 30

Can start either with: * **null model** - start with no covariates and add one at a time * **full model** - start with all covariates and then eliminate ones that are likely to be insignificant **Likelihood Ratio Statistic:** LRS = - 2(L_p - L_p+q), where L_pis maximised log-likelihood for model with p parameters, and L_p+q is for model with p + q parameters If additional q covariates have no effect, then LRS has asymptotic chi-squared dbn with q degress of freedom. For *q* to be considered and improvement, LRS must be over threshold value for confidence level (normally 5%) on *q* degrees of freedom.

Answer 31

**Advantages** * The graduated rates will progress smoothly provided the number of parameters is small * Good for producing standard tables * Can easily be extended to more complex formualae, provided optimisation can be achieved * Can fit the same formula to different experiences and compare parameter values to highlight differences between them **Disadvantages** * It can be hard to find a formula to fit well at all ages without having lots of parameters * Care is required when extrapolating: the fit is bound to be best at ages where we have lots of data and can often be poor at extreme ages.

Answer 32

* long runs of deviation of same signs caused by undergraduation * solution \> grouping of signs test * a few large deviations balanced by more numerous small deviations * solution \> individual standardised deviation tests * graduated rates may be too high or low across the whole range, but not by enough to show up in chi-square test * solution \> groups of signs * results of graduation not smooth * solution \> test using third order differences of graduated rates - if graduation smooth, will be small in comparison to values

Answer 33

For a continuously differentiable function *f(x),*

Answer 34

* The nature of the existing sickness data the company possesses. The model can only be as complex as the data will allow it to be. * Whether the company has made any previous attempts to model sickness rates among its employees, and how successful they were. * The complexity of the model – e.g. whether it should be stochastic or deterministic. More complex models will be costlier to prepare and run, but eventually there may be diminishing returns to additional complexity. * General trends in sickness at the national level may need to be built in. * The definition of sickness and level of benefits payable under the scheme. * Does the company plan to change the characteristics of the employees? For example, does it plan to recruit more mature persons? * The ease of communication of the model. * The budget and resources available for the construction of the model. * Capability of staff. Will outside consultants be required? * By whom will the model be used? Will they be capable of understanding and using it? * Does the model need to interface with models of other aspects of the company’s business (e.g. taking data from other systems)? * The independence of sickness rates should be taken into account e.g. in the event of an epidemic claims cannot be considered independent.

Answer 35

We do not need to know the general shape of the hazard/distribution.

Answer 36

Let S be the state space. We say that {π_j | j∈S} is a stationary probability distribution for a Markov chain with transition matrix P if the following hold for all j∈S :

Answer 37

* **using parametric graduation** - appropriate when there is lots of data available is available on large populations * **using standard tables** - if useful data is scarce and an appropriate standard table exists * **using graphical graduation** - where there is little relevant data and no standard table *(e.g. for compiling data on newly discovered animals or newly demarcated populations)*

Answer 38

**(Modified) individual standardised deviations test** Under the null hypothesis (same as for the chi-squared test) we would expect individual deviations to be distributed Normal (0,1) * Only 1 in 20 of the zx should lie above 1.96 or below -1.96 in absolute value * none should lie above 3 or below 3 in absolute value * about two thirds of the z_x should lie between −1 and +1 If these are not true, then there is a problem with either individual outliers or the sample as a whole.

Answer 39

The Gompertz–Makeham law states that the death rate is the sum of an age-independent component (the Makeham term, named after William Makeham) and an age-dependent component (the Gompertz function, named after Benjamin Gompertz), which increases exponentially with age. Where external causes of death are rare (laboratory conditions, low mortality countries, etc.), the age-independent mortality component is often negligible. In this case the formula simplifies to a Gompertz law of mortality. The Gompertz–Makeham law works well from about 30 to 80 years of age. At more advanced ages, some studies have found that death rates increase more slowly – a phenomenon known as the **late-life mortality deceleration** – but other studies disagree.

Answer 40

A Poisson process with rate λ is a continuous-time integer-valued process Nt, t ≥ 0 with the following properties: * N₀ = 0; * N_t has independent increments; * N_thas stationary increments, each having a Poisson distribution, as follows, for *s \< t, n = 0, 1, 2, ...*

Answer 41

A proportional hazards (PH) model is a model which allows investigators to assess the impact of risk factors, or covariates, on the hazard of experiencing an event. In a PH model the hazard is assumed to be the product of two terms, one which depends only on duration, and the other which depends only on the values of the covariates. Under a PH model, the hazards of different lives with covariate vectors z1 and z2 are in the same proportion at all times, for example in the Cox model:

Answer 42

Cox's model ensures that the hazard is always positive. Standard software packages often include Cox's model. Cox's model allows the general 'shape' of the hazard function for all individuals to be determined by the data, giving a high degree of flexibility while an exponential term accounts for differences between individuals. This means that if we are not primarily concerned with the precise form of the hazard, we can ignore the shape of the baseline hazard and estimate the effects of the covariates from the data directly. The Cox model is included as standard in most statistical modelling software, unlike parametric models, which normally require manual coding and setup.

Answer 43

The sum of the first n terms of a geometric series is, where |r|\<1

Answer 44

The general form is μ_x = αX + exp(βX) , where αX takes the form α₀ + α₁x + α₂x² +... and βX takes the form β₀ +β₁x + β₂x² +....

Answer 45

**Strengths of Binomial model** * avoids numerical solution of equations * can be generalised to give the Kaplan-Meier estimate **Weaknesses of Binomial model** * need to compute an initial exposed-to-risk is a pointless complication if census-type data are available * not so easily generalised as two-state or Poisson models to processes with more than one decrement, and not so easily generalised as two-state model to increments * estimate of q_x has a higher variance than that of the two-state Poisson models (though the difference is very small unless mortality is very high)

Answer 46

A stochastic process X_(t) operates with state space S. Prove that if the process has independent increments it satisfies the Markov property.

CT4 - Models & Mortality Flashcards

(86 cards)