CT4 - Models & Mortality Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What is a G Test?

A

G-tests are likelihood-ratio or maximum likelihood statistical significance tests that are increasingly being used in situations where chi-squared tests were previously recommended.

The G test is undertaken in the same way as a chi-squared test, with the same degrees of freedom.

If Oi is the observed frequency in a cell, E is the expected frequency on the null hypothesis, and the sum is taken over all non-empty cells, G is calculated as:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How is a Pearson Residual Calculated?

A

For the Binomial mortality model (and thus approximately for the Poisson model), the Pearson residual is given by:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the GLM Link Functions for the following DBNs?

  • normal
  • exponential
  • poisson
  • binomial
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Forward Selection Procedure for Levels of Nested Generalised Linear Models (GLMs)

A

Let Di be the deviance of the ith level model, where D0 is the deviance of the null model (G score, chi-squared or other with approx chi-squared dbn).

In turn test (Di - Di+1) against required significance level (normally 5%) in a chi-squared test with d.f. = 1.

If change in deviance is significant (i.e. p-value < 0.05), then this supports moving to the next level of model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define a Symmetric Simple Random Walk

A

(Used to model processes, such as the movement of stock market share prices)

A simple random walk operates on discrete time and has a discrete state space (i.e. set of all integers)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define a Compound Poisson Process

A

A compound Poisson process is a continuous-time (random) stochastic process with jumps.

A compound Poisson proces operates on continous time. It has a discrete or continous state space depending on whether Yj are continuous or discrete.

Examples:

  • Aggregate claims for an insurance portfolio (number of claims, each of potentially differing size)
  • Total rainfall (number of drops, each of differing size)
  • Total biomass (number of patches of organism, each patch of differing size)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the 12 Key Steps of Modelling

A

1. Objectives: Develop a clear set of objectives to model the relevant system or process, define its scope. This includes reviewing regulatory guidance.

  1. Plan and validate: Plan the model around the chosen objectives, ensuring that the model’s output can be validated i.e. checked to ensure it accurately reflects the anticipated output from the relevant system or process.
  2. Data: Collect and analyse the necessary data for the model. This will include assigning appropriate values to the input parameters and justifying any assumptions made as part of the modelling process.
  3. Capture the real world system: The initial model should be described so as to capture the main features of the real world system. The level of detail in the model can always be reduced at a later stage. Choose parameters
  4. Expertise: Involve the experts on the relevant system or process. They will be able to feedbacks on the validity of the model before it is developed further. Discuss worst cases and probabilities.
  5. Choose a computer program: Decide on whether the model should be built using a simulation package or a general purpose language.
  6. Write the model: Write the computer program for the model.
  7. De-bug the program. (e.g. test against particular scenarios)
  8. Test the model output: Test the reasonableness of the output from the model. The experts on the relevant system or process should be involved at this stage.
  9. Review and amend: Review and carefully consider the appropriateness of the model in light of making small changes to the input parameters.
    • e.g. test median outcomes against business plans
    • e.g. check probabilities assigned to worst case scenarios
  10. Analyse output: Analyse the output from the model. Test sensitivity to small changes.
  11. Document and communicate: Communicate and document the results and the model.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the Benefits of Modelling?

A
  • Compressed timeframe
    (e. g. for financial planning, real world figures unfold over a lifetime, need to test over a shorter period)
  • Ability to incorporate randomness
    (in stochastic models)
  • **Scenario testing **
    (Can combine parameters in realistic scenarios - e.g. relating to interest rate rises)
  • Greater control over experimental conditions
  • Cost control
    (upstream testing cheaper than assessment after implementation)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the Limitations of Modelling?

A
  • Time and cost
    complex models are expensive to create
  • Multiple runs required
    for stochastic models, gives an indication of the dbn of outputs, but shows the impact of changes to inputs, rather than optimising outputs
  • Validation and verification
    Complex models are hard to relate back to the real world in sanity checking
  • Reliance on data input
    rubbish in, rubbish out
  • Inappropriate use
    Scope and purpose of model must be understood by client to avoid misuse
  • Limited scope
    models can’t cover all future eventualities, such as changes in legislation
  • Difficulty intepreting outputs
    results usually useful only relative to other results, not in absolute real-world terms
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Continuous, Discrete and Mixed Type Processes

A

In a stochastic (random) process, a family of random variables Xt : t ∈ τ is disclosed over time, t

In a discrete process, τ = {0, 1, 2, … }
= Z (natural, or counting numbers)
e.g. number of claims made

In a continuous process, T = R (real numbers), or [0,∞]
e.g. stock price fluctuations

A counting process Nt, is the number of events by time period t, where Nt is a non-decreasing integer and N0 = 0

An example of a mixed type process, including both continuous and discrete elements, is the market price of coupon-paying bonds - the bonds change price at specific times in response to coupons being paid, but also change continually due to market.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the Markov Property?

A

A process with the Markov property will be one in which the future development of the process may be predicted on the basis of the current state of the system alone, without reference to it past history.

If a stochastic process Xt is defined on a state space S and time set t ≥ 0, the Markov property is expressed mathematically as:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Prove the Chapman-Kolmogorov Equations for n-step transitions

A

The Chapman-Kolmogorov Equations provide a way of calculating n-step transition probabilities through matrix multiplication of single-step transitions. The proof is as follows:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do you Calculate the
n-Step probability using the Chapman Kolmogorov equations?

A

If αj0 is the initial probability distribution,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Conditions for a Stationary Distribution in a Markov Chain

A

In a Markov chain, the distribution of Xn may converge to a limit π, such that

P(Xn= j | X0= i) → πj

Regardless of the starting point - this is known as a stationary distribution

  • πj = Σπipij *for all i ∈ S
  • A Markov chain with a finite state space has at least one stationary probability distribution}
  • An irreducible Markov chain with a finite state space has a unique stationary probability distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Time Inhomogeneous / Homogeneous Markov Chains

A

A Markov chain is called time-homogeneous if transition probabilities do not depend on time.

Where they do depend on time (e.g. risk for drivers at a particular age, or at a particular length of time after qualifying), a Markov Chain is called time-inhomogeneous.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Simple No Claims Discount (NCD) Model

A

No Claims Discount (NCD)

Policy holders start at 0% level

No claim in year moves up one level (or stays at top)

Claim in year moves down one level (or stays at bottom)

e.g:

Consider a no claims discount (NCD) model for car-insurance premiums. The insurance
company offers discounts of 0%, 30% and 60% of the full premium, determined by the following rules:

  1. All new policyholders start at the 0% level.
  2. If no claim is made during the current year the policyholder moves up one discount level, or remains at the 60% level.
  3. If one or more claims are made the policyholder moves down one level, or remains at the 0% level.

Transition graph shown below:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q
# Define Transition Intensities *q<sub>ii</sub>(t), q<sub>ij</sub>(t)*
 for Markov Jump Processes
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Define Kolmogorov’s Forwards Equation for Markov Jump Processes

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Define Kolmogorovs Backwards Equation for Markov Jump Processes

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Distribution of Holding Times (Markov Jump Process)

A

Poisson process characterised by unit upward jumps - hence path fully characterised by times between jumps.

Distribution of Holding Times is Exponential with Parameter λ, in case of Poisson, and μ, in case of non-Poisson Markov Process.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Define Residual and Current Holding Times for Markov Jump Processes

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the Survival Model?

A

The survival model is a two state Markov chain, with a state space of S = { A, D } (alive / dead). There is single transition rate μ(t) - identified with the force of mortality at age t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is the Sickness-Death Model?

A

An extension of the survival model, with three states - healthy (H), sick (S) or dead (D). S = { H, S, D }

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the Long Term Sickness Model?

A

The so-called long term care model is a time-inhomogeneous model where the rate of transition out of state S (sickness) will depend on the current holding time in state S.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is the Marriage Model?

A

The marriage model is a time-inhomogeneous model under which an individual can be either never married (B), married (M), divorced (D), widowed (W) or dead Δ. A Markov jump process can be formulated on the state space

S = { B, M, D, W, Δ }

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

<span>Terminology for Model of Lifetime or Failure</span>

Define: Tx, ω, Fx(t), Sx(t)

A

Tx - future life time of a person aged x - T0 usually given simply as T

ω - limiting age, or maximum age person can reach - typically restricted between 100 and 120 in models for simplicity

Fx(t) - distribution function of Tx- F0(t)** usually given asF(t)**

Fx(t) = P[Tx ≤ t] - probability of death for a life aged x by age x + t

Sx(t) = P[Tx > t] - survival function of Tx, representing the probability that a life aged x survives for t years

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Consistency Conditions for Model of Lifetime or Failure

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Define the Force of Mortality (Hazard Rate)

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Modified No Claims Discount (NCD) Model

A

The simple NCD model can be modified with a number of improvements.

One such improvement is to have the following states:

  • State 0: 0% discount
  • State 1: 25% discount
  • State 2: 40% discount
  • State 3: 60% discount

The transition rules are as before except that when there is a claim during the current year, the discount status
moves down either two levels if there was a claim in the previous year, or one level if the previous year was
claim-free.

To form a Markov chain, State 2 needs to be split, otherwise there is a reliance on historical state past the immediately previous step.

Have a State 2-, following single claim in previous year in State 3, and a State 2+, following no claims in State 1

Transition Graph Below:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Restricted Random Walk Model

A

The restricted random walk is a simple random walk with boundary conditions

E.g. in the case where a gambler borrows money having lost everything, then the barrier is a reflecting barrier, of they cash out as soon as they hit a target, the upper barrier is an absorbing barrier.

More generally:

An absorbing barrier is a value b such that:

  • P( Xn+s = b | Xn = b ) = 1 for all s > 0*
    i. e. once b is reached, the system stops and remains in this state afterwards

A reflecting barrier is a value c such that:

P( Xn+1 = c+1 | Xn = c ) = 1

i.e. once c is reached, the random walk is pushed away.

A mixed barrier is a value d such that:

P( Xn+1 = d | Xn = d ) = α and

P(Xn+1 = d + 1 | Xn = d ) = 1 - α

α∈[0,1]

i.e. barrier is an absorbing barrier with probability α and a reflecting barrier with probability 1 - α

Transition graph below:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

PDF of Tx(t) - fx(t)

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What is Gompertz’ Law?

A

Gompertz’ Law μx = Bcx where B and c are constants.

Gompertz’ Law is an exponential function and as such is usually appropriate to middle or older ages, say 35 to 90. The constants B and c can be found by taking values of μx, for two different x, and using them to set up simultaneous equations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What is Makeham’s Law?

A

Makeham’s Law μx = A + Bcx where A, B and c are constants.

Makeham’s Law adds a constant term to the exponential function. This constant part of μx implies there is an element of the force of mortality that is not linked to age, for example accidental death. The constants A, B and c can be found by taking values of μx, for three different x, and solving the resulting equations to find A, B and c.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Complete Expectation of a Life Aged X

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Curtate Future Lifetime of a Life Aged X

A
36
Q

Curtate Expectation of a Life Aged X

A

The curtate expectation of a life aged x, denoted by ex is ex = E[Kx].

37
Q

Variances of Complete / Curtate Future Lifetime

A
38
Q

Assumptions of a Two State Markov Survival Model

A

The two state Markov Survival Model assumes:

  1. The Markov Assumption
  2. dtqx+tx+tdt+o(dt) for t > 0 where o(dt) is a small correction term (when differentiated this correction term goes to zero)
  3. μx+t is constant for positive integers x and 0 ≤ t < 1
39
Q

Definitions for Deaths and Waiting Time in Two State Markov Survival Model -

Define: ai, bi, Di, Vi

A

Consider life i in an observation of N lives, where 1 ≤ i ≤ N, let:

  • x + ai - exact age at which observation of life i starts
  • x + bi - exact age at which observation of life i ends
  • 0 ≤ ai < bi ≤ 1
  • Di - variable indicating whether life i is observed to die during observation - 1 if i dies, 0, if not
  • V<em>i</em> - waiting time - actual time of observation for life i - age when observation ceases less age when it began (ends either due to death or to end of observation - which is its upper bound)
  • **0 < Vi ≤ bi - ai ≤ 1 **
  • Di = 0 ⇔ Vi = bi - ai - i.e. if no death, waiting time is difference between start and end of observation
  • Di = 1 ⇔ 0 < Vi < bi - ai - i.e. if death, waiting time is less than difference between start and end of observation
40
Q

Probability Distributions for Deaths and Waiting Time in Two State Markov Survival Model

A

Let fi(di, vi) be the joint dbn of (Di, Vi), where we use (di, vi) to represent a single observation, a sample from (Di, Vi)

41
Q

MLEs for Deaths and Waiting Time in Two State Markov Survival Model

A

Likelihood function for μ, based on observation of (d, v) is

L(μ; d, v) = e-μvμd

Maximum Likelihood Estimate (MLE) and Maximum Likelihood Estimator for μ given by respectively:

42
Q

Non-Parametric Estimation of Survival Function - Limitations

A

This empirical approach has limitations:

  1. Smoothness of S(t): Estimating S(t) using this empirical approach would result in a step function i.e. S(t) would step down for each t where a death was observed. However, S(t) can be smoothed using statistical techniques.
  2. Size of group: It is difficult to find a sufficiently large population to observe easily. A large amount of lives must be observed to provide an accurate estimate of S(t). The larger the population used the less significant the steps become, i.e. each death has less of an impact on S(t), and so the smoother S(t) is expected to become.
  3. Time period of observation: This estimation requires the same group of lives to be observed until the last person dies. That means the observation period may be over 100 years. By the end of this period, the data is unlikely to be an appropriate basis on which to model a survival function for the current population. This is because the level of mortality is likely to change in both level and shape over this period e.g. medical advances, improved diet etc. An alternative approach is to use the whole population - using different segments for each age.
  4. Censoring: In order to use the approximation for S(t) above, we need to observe each life until it dies. In practice, some of the lives in the observation may disappear from the observation for reasons other than death, for example, emigration. Or, we may not be able to ascertain their exact date of death. This type of issue is referred to as censoring.
43
Q

Forms of Censoring

A
  1. Right-censoring: The observation finishes before the terminal event has necessarily happened for all subjects. For example, the investigation ends on a fixed date.
  2. Left-censoring: The observation does not allow us to know when a subject entered into the state we are observing. For example, in bio-medical research it is usually recorded when a patient entered the hospital at a particular date, and that he survived for a certain amount of time thereafter; however, the researcher does not necessarily know, or have on record, when exactly the symptoms of the disease first occurred.
  3. Interval-censoring: The observation set-up only allows us to know whether the event fell within some interval of time. It does not allow us to say the exact date of the event. For example, women are tested for cervical cancer every three years. Therefore, if one of the regular tests reveals cancer, it is not possible to pin-point the exact date the cancer developed but it is possible to say that it fell within the three-year period prior to this test.
  4. Random censoring: This occurs when a subject ceases to be observed at a random time, before they have experienced the terminal event. For example, a member of a pension scheme transfers their benefits out of the scheme. This transfer would occur at a time which could not have been pre-empted and means the observed lifetime of this member is censored.
  5. Informative censoring: Censoring is informative if it gives information about the subject in relation to the terminal event. For example, if an employee of a company retires on the grounds ill-health, one would expect them to have a higher rate of mortality than an employee of the same age who remains at the firm.
  6. Non-informative censoring: Censoring is non-informative if it gives no information about the subject or event in question. For example, in survival analysis, if a person in the population emigrates, this does not provide any additional information about the expected lifetime of this person.
  7. Type I censoring: This describes the situation when a observation is terminated at a particular point in time, so that the remaining subjects are only known not to have reached the terminal event. For example, in survival analysis we end the observation on a fixed date. In this case, the censoring time is often fixed, and the number of terminal events observed is a random variable.
  8. Type II censoring: In contrast to type I censoring, with type II censoring the observation continues until a fixed proportion of the subjects have reached the terminal event. For example, an investigation into the safety of a drug may require a certain level of success to be observed before the drug can be marketed e.g. recovery of 95% of the patients.
44
Q

Kaplan Meier Estimate (Product Limit Estimate)

A

Suppose we observe a population of size N. Imagine we observe m deaths within this population before terminating our observation. The m deaths occur at k different times. (This allows for some deaths occuring at the same time and clearly k ≤ m.)

Denote the i th observed time of death by ti, where t1 < t2 < …< tk - 1 < tk and 1 ≤ i ≤ k.

Let ni represent the number of lives at risk and under observation just prior to time ti, and di, the number of deaths at time ti.

Let ci represent the total lives censored between ti and ti + 1.

The Kaplan-Meier estimate or product limit estimate of the survival function S(t) is:

45
Q

Nelson-Aalen Estimate

A
46
Q

Cox Model (Proportional Hazards)

A

The Cox model, also known as the Proportional Hazards model, provides the following format for modelling the hazard function:

47
Q

Define Time Homogeneous Markov Chain

A

A Markov chain is a stochastic process with discrete states operating in discrete time in which the probabilities of moving from one state to another are dependent only on the present state of the process.

EITHER

If the transition probabilities are also independent of time.

OR

If the l-step transition probabilities are dependent only on the time lag, the chain is said to be time-homogeneous.

48
Q

Categorise Stochastic Models by State Space / Time Space into Continous and Discrete

A

State Space Discrete + Time Space Discrete

  • Markov chain, Markov jump chain, Counting Process, Random Walk
  • E.g. No claims bonus

SS Discrete + T Continuous

  • Counting Process, Poisson Process, Markov jump process, Compound Poisson Process (with Discrete Jump DBN)
  • e.g Number of claims received monitored continuously

SS Continuous + T Discrete

  • General random walk, White noise
  • e.g. Total amount insured on a certain type of policy valued at end of each month

SS Continuous + T Continuous

  • Compound Poisson (with continuous jump dbn), Brownian Motion, Ito Process
  • e.g. Value of claims arriving monitored continously
49
Q

Prove The Forward Kolmogorov Differential Equations for a Two State Markov Model from First Principles

A
50
Q

Forwards/Backwards Kolmogorov Equations for
Two State Markov Model

A
51
Q

Illness-Death Markov Model Define Vi, Wi, Si, Ri, Di, Ui, Likelihood Function and MLE’s

A

State Space = { Able, Ill, Dead }

  • Vi *= Waiting time of life i in the able state
  • Wi *= Waiting time of life i in the ill state.
  • Si *= Number of transitions able to ill by life i.
  • Ri *= Number of transitions ill to able by life i.
  • Di *= Number of transitions able to dead by life i.
  • Ui *= Number of transitions ill to dead by life i.

V = ΣVi etc.

Transition Intensities:

μ = μad, σ = μai, ρ = μia, ν = μid

Likelihood function for these transition intensities is proportional to:

L(μ, ν, σ, ρ) = e-(μ+σ)νe-(ν+ρ)wμdνuσsρr

52
Q

Illness Death Model - MLE Distributions

A
53
Q

MLE for Poisson Model of Mortality

A
54
Q

Proof of Poisson MLE Expectation and Variance

A
55
Q

MLE for Binomial Model of Mortality

A
56
Q

Proof of Binomial MLE Expectation and Variance

A
57
Q

Adjusting for Censoring - tqx in terms of qx:

  • Constant force of Mortality
  • Uniform Distribution of Deaths
  • Balducci Assumption
A

In case of censoring, need to be alble to use an approximation for required tqx to make up for effect of censoring, meaning that each life potentially contributes to a different part of the qx.

Three assumptions can be used to relate bi-aiqx+ai to qx.

  • For these calculations, the results are related by (2) < (1) < (3)
  • UDD - assumes increasing force of mortality
  • Balducci assumption - assumes decreasing force of mortality
58
Q

Initial and Central Exposed to Risk

A

Initial exposed to risk is required for binomial model, that requires equivalent Bernoulli trials to calculate probability.

Central exposed to risk is equivalent to Markov waiting time, used for Poisson and 2 state Markov.

59
Q

Actuarial Estimate
(of qx for Binomial)

A

Definition. The actuarial estimate estimates qx for the Binomial model, allowing for censoring. It is
defined by:

60
Q

Which Mortality Model to Use: Binomial, Poisson or Two-State?

A

For normal human mortality all of the models are acceptable, due to the low force of mortality. Hence, life tables, which have traditionally been produced using the Binomial model, have been successfully used for years.

However, the Binomial model tends to be inappropriate in the following circumstances:

  • where there is plenty of data: it is usually easier to calculate Ecx, and therefore use the Poisson or two-state model.
  • there is more than one state: the Binomial model does not easily extend to multiple states. Hence the two-state or Poisson models should be used.
  • *μ *is not small: The approximation used to calculate Ex for the actuarial estimate is invalid for larger μ. It also results in the loss of valuable information about the time of the transition to another state. The Poisson model is also inappropriate for higher and only the two-state model is appropriate here.
61
Q

Key Ratings Factors Used by Insurance Companies in Policy Pricing

A

To avoid adverse (anti-)selection by customers, companies must attempt to get all salient information relating to risk (ratings) factors, particularly when competitors use the same information.

Models need to be constructed splitting the general population of insured customers into roughly homogenous groups.

Main Factors:

  • age
  • sex
  • smoker status
  • level of underwriting (i.e. medical required in advance)
  • duration-in-force
  • type of policy (through employer / mortgage etc)
  • weight v. height
  • units of alcohol per week

Additional Factors:

  • sales channel (e.g. demographic for daytime tv v. direct mail etc)
  • policy size
  • occupation (either directly through having a risky job - e.g. bomb disposal expert, or indirectly by being linked to lifestyle proclivities)
  • known impairments / personal health history
  • family health history
62
Q

Give the Common Definitions for Age Used in Policies

What are the Three Main Kinds of Rate Interval?

A

Common definitions of age used in observations:

  • Age x last birthday:
    covers [x, x + 1]
  • Age x nearest birthday:
    covers [x - 0.5, x + 0.5]
  • Age x next birthday:
    covers[x - 1, x]

Three kinds of rate interval:

  • life year rate interval
    age label changes on date dependent on life’s birthday only
  • calandar year rate interval
    age label changes at a set point in the year - e.g. 1 January
  • policy year rate interval
    age label changes on policy anniversary

Age at death will be recorded based on the combination of these two aspects of age and policy rate interval.

63
Q

Principle of Correspondence (for estimates of μ and q)

A

The principle of correspondence states that:

a life alive at time t should be included inthe exposure for age x at time t if and only if, were that life to die immediately, it would be counted in the death data at age x.

I.E. - a life should only be included in the exposed to risk if it would be included in mortality figures for the same period.

64
Q

Calculating Central Exposed to Risk

A

In order to calculate the central exposed to risk, precisely we require the following items of data for each life, i, included in the investigation:

  • date of birth of life
  • date observation of life commenced
  • date observation of life ceased
  • an indicator to show whether observation of life i ceased due to death

In addition to this information, we require the definition of age to be specified.

For each life we would calculate the following for each relevant age x:

  1. the date at which the life entered the observation for age x , i.e. the latest of:
    • date at which the life reached age x
    • date observation commenced
  2. the date at which the life left the observation for age x i.e. the earliest of:
    • date at which the life reached age x+1
    • date observation ceased
  3. the difference between 1 and 2

Given this information, we can calculate for each life, by calculating the time spent under observation,
whilst age x. This should be totalled across the investigation, to calculate the total central exposed to risk.

65
Q

Census Approximation of Waiting Times

A

As the Continuous Mortality Investigation Bureau (CMIB) compiles research based on figures provided by UK / Eire life insurance companies compiled for 1st January, it uses a trapezium rule approximation for approximating the central exposed to risk.

The use of the trapezium rule assumes that population changes gradually over the course of the year - an assumption that may be invalid

66
Q

Estimators of qx and μx - which x do they estimate for?

A
67
Q

Why Policy Anniversaries May Not Be Uniformly Distributed Throughout the Year

A

The assumption that policy anniversaries are uniformly distributed throughout the year must be treated with care. This assumption may not hold for several reasons. Such as:

  • policy anniversaries may be concentrated around certain points of the year. For example just prior to the end of a tax year.
  • policy anniversaries may not be independent of birthdays. For example, customers may take out insurance just prior to a birthday to obtain a lower premium.
  • employers who provide life assurance through a group arrangement with the life insurer, will **have **the same policy anniversary for all employees.
68
Q

Breslow’s approximation to the partial likelihood function

A

Breslow’s approximation to the partial likelihood function

69
Q

Likelihood Ratio Test & Statistic for Nested Cox Models

A

Can start either with:

  • null model - start with no covariates and add one at a time
  • full model - start with all covariates and then eliminate ones that are likely to be insignificant

Likelihood Ratio Statistic:

LRS = - 2(Lp - Lp+q), where Lp is maximised log-likelihood for model with p parameters, and Lp+q is for model with p + q parameters

If additional q covariates have no effect, then LRS has asymptotic chi-squared dbn with q degress of freedom.

For q to be considered and improvement, LRS must be over threshold value for confidence level (normally 5%) on q degrees of freedom.

70
Q

Derive tpx in terms of μx

A
71
Q

Advantages and Disadvatages of Paramateric Graduation of Mortality Data

A

Advantages

  • The graduated rates will progress smoothly provided the number of parameters is small
  • Good for producing standard tables
  • Can easily be extended to more complex formualae, provided optimisation can be achieved
  • Can fit the same formula to different experiences and compare parameter values to highlight differences between them

Disadvantages

  • It can be hard to find a formula to fit well at all ages without having lots of parameters
  • Care is required when extrapolating: the fit is bound to be best at ages where we have lots of data and can often be poor at extreme ages.
72
Q

Problems not Picked up by
Chi-Square Goodness of Fit

A
  • long runs of deviation of same signs caused by undergraduation
    • solution > grouping of signs test
  • a few large deviations balanced by more numerous small deviations
    • solution > individual standardised deviation tests
  • graduated rates may be too high or low across the whole range, but not by enough to show up in chi-square test
    • solution > groups of signs
  • results of graduation not smooth
    • solution > test using third order differences of graduated rates - if graduation smooth, will be small in comparison to values
73
Q

Define Logarithmic Derivative

A

For a continuously differentiable function f(x),

74
Q

Scenario Example for Questions on General Issues in Real World Actuarial Modelling

A large company wishes to construct a model of sickness rates among its employees to use in evaluating the present and future financial health of its sick pay scheme.

Outline factors which the company should take into consideration when developing
the model.

A
  • The nature of the existing sickness data the company possesses. The model can only be as complex as the data will allow it to be.
  • Whether the company has made any previous attempts to model sickness rates among its employees, and how successful they were.
  • The complexity of the model – e.g. whether it should be stochastic or deterministic. More complex models will be costlier to prepare and run, but eventually there may be diminishing returns to additional complexity.
  • General trends in sickness at the national level may need to be built in.
  • The definition of sickness and level of benefits payable under the scheme.
  • Does the company plan to change the characteristics of the employees? For example, does it plan to recruit more mature persons?
  • The ease of communication of the model.
  • The budget and resources available for the construction of the model.
  • Capability of staff. Will outside consultants be required?
  • By whom will the model be used? Will they be capable of understanding and using it?
  • Does the model need to interface with models of other aspects of the company’s business (e.g. taking data from other systems)?
  • The independence of sickness rates should be taken into account e.g. in the event of an
    epidemic claims cannot be considered independent.
75
Q

What is the main advantage of a semi-parametric model over a fully parametric one?

A

We do not need to know the general shape of the hazard/distribution.

76
Q

What is a Stationary Probability Distribution (w.r.t. Markov Chains)?

A

Let S be the state space.
We say that {π j | j∈S} is a stationary probability distribution for a Markov chain with transition matrix P if the following
hold for all j∈S :

77
Q

What are the Main Approaches to Graduating Raw Data?

A
  • using parametric graduation - appropriate when there is lots of data available is available on large populations
  • using standard tables - if useful data is scarce and an appropriate standard table exists
  • using graphical graduation - where there is little relevant data and no standard table (e.g. for compiling data on newly discovered animals or newly demarcated populations)
78
Q

(Modified) individual standardised deviations test

A

(Modified) individual standardised deviations test

Under the null hypothesis (same as for the chi-squared test) we would expect individual deviations to be distributed Normal (0,1)

  • Only 1 in 20 of the zx should lie above 1.96 or below -1.96 in absolute value
  • none should lie above 3 or below 3 in absolute value
  • about two thirds of the zx should lie between −1 and +1

If these are not true, then there is a problem with either individual outliers or the sample as a whole.

79
Q

Gompertz-Makeham
Law of Mortality

A

The Gompertz–Makeham law states that the death rate is the sum of an age-independent component (the Makeham term, named after William Makeham) and an age-dependent component (the Gompertz function, named after Benjamin Gompertz), which increases exponentially with age.

Where external causes of death are rare (laboratory conditions, low mortality countries, etc.), the age-independent mortality component is often negligible. In this case the formula simplifies to a Gompertz law of mortality.

The Gompertz–Makeham law works well from about 30 to 80 years of age.

At more advanced ages, some studies have found that death rates increase more slowly – a phenomenon known as the late-life mortality deceleration – but other studies disagree.

80
Q

What is a Poisson Process?

A

A Poisson process with rate λ is a continuous-time integer-valued process Nt, t ≥ 0 with the following properties:

  • N0 = 0;
  • Nt has independent increments;
  • Nt has stationary increments, each having a Poisson distribution, as follows, for s < t, n = 0, 1, 2, …
81
Q

What is Meant by a Proportional Hazards Model?

A

A proportional hazards (PH) model is a model which allows investigators to assess the impact of risk factors, or covariates, on the hazard of experiencing an event.

In a PH model the hazard is assumed to be the product of two terms, one which depends only on duration, and the other which depends only on the values of the covariates.

Under a PH model, the hazards of different lives with covariate vectors z1 and z2 are in the same proportion at all times, for example in the Cox model:

82
Q

What are the Advantages of Using a Cox Model?

A

Cox’s model ensures that the hazard is always positive. Standard software packages often include Cox’s model.

Cox’s model allows the general ‘shape’ of the hazard function for all individuals to be determined by the data, giving a high degree of flexibility while an exponential term accounts for differences between individuals.

This means that if we are not primarily concerned with the precise form of the hazard, we can ignore the shape of the baseline hazard and estimate the effects of the covariates from the data directly.

The Cox model is included as standard in most statistical modelling software, unlike parametric models, which normally require manual coding and setup.

83
Q

Sum of a Geometric Series of n terms - a + ax + ax2 +…axn-1

A

The sum of the first n terms of a geometric series is, where |r|<1

84
Q

General Polynomial Graduation Formula

Describe the general form of the polynomial formula used to graduate the most recent standard tables produced for use by UK life insurance companies.

A

The general form is

μx = αX + exp(βX) ,

where αX takes the form

α0 + α1x + α2x2 +…

and βX takes the form

β01x + β2x2 +….

85
Q

Strengths and Weaknesses of the Binomial for Mortality Models

A

Strengths of Binomial model

  • avoids numerical solution of equations
  • can be generalised to give the Kaplan-Meier estimate

Weaknesses of Binomial model

  • need to compute an initial exposed-to-risk is a pointless complication if census-type data are available
  • not so easily generalised as two-state or Poisson models to processes with more than one decrement, and not so easily generalised as two-state model to increments
  • estimate of qx has a higher variance than that of the two-state Poisson models (though the difference is very small unless mortality is very high)
86
Q

Proof that Independent Increments Implies Markov Property

A

A stochastic process X(t) operates with state space S. Prove that if the process has independent increments it satisfies the Markov property.