Section A: Stochastic Reserving Flashcards

1
Q

Verrall: In what situations would you want to adjust your model for expert knowledge?

A
  • Change in payment patterns
  • New legislation that limits benefits
    • Decreases the potential for loss development and development factors must be adjusted.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Verrall: What is the benefit of a Bayesian model over the Mack or Bootstrap models to predict reserves?

A

The Bayesian approach can incorporate expert opinion into the model naturally without compromising the underlying assumptions.

  • i.e. incorporates expert knowledge & its easy to implement

Two key areas where expert knowledge is applied:

  • Expected losses in the BF method
  • Selected individual LDFs in the Chain Ladder method
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Verrall: What makes the implementation of Bayesian methodology easy?

A

A common problem with Bayesian methods is the difficulty deriving the posterior distribution as this may be multi-dimensional.

MCMC makes this easier by using the conditional distribution of each parameter, given all other parameters, thus making the simulation a univariate distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Verrall: Describe the MCMC methodology.

A
  • MCMC methods simulate the posterior distribution of a random variable.
  • Breaks down the process using simulations
  • Each of the simulations use the conditional distribution of each parameter given all the other parameters making the simulation a univariate distribution.
  • By considering each parameter in turn, this creates a Markov Chain
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Verrall: What is the difference between the Chain Ladder and BF method for deriving reserves?

A
  • The BF method uses an external estimate for the “level” of each row (e.g. loss ratio, expected loss, etc.)
  • The CL method uses the data in each row
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Verrall: Stochastic Reserving for the Chain-Ladder Technique - Indicated the mean, variance, advantages and the disadavantages of the model.

Mack’s Model

A

E[Di,j] = λjDi,j-1

Var(Di,j) = σj2Di,j-1

​Advantages

  • parameter estimates and prediction errors (reserve ranges) are easy to get (e.g. only need a spreadsheet)
  • easy to implement

Disadvantages

  • since the distribution isn’t specified, there is no predictive distribution
  • must estimate additional parameters to calculate the variance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Verrall: Stochastic Reserving for the Chain-Ladder Technique - Indicate the mean, variance, advantages and the disadavantages of the model.

Over-Dispersed Poisson Model

A

E[Ci,j] = xiyj

Var(Ci,j) = φxiyi

*xi = expected ultimate loss for year i

*yj = % of ultimate losses emerging in development period j

Note: over-dispersed means proportional NOT equal

​Advantages

  • doesn’t break down model if there are negative incremental values
  • same reserve estimate as chain ladder
  • more stable than lognormal model of Kremer

Disadvantages

  • connection to chain ladder is not immediately apparent
  • must estimate additional parameters to calculate the variance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Verrall: Stochastic Reserving for the Chain-Ladder Technique - Indicated the mean, variance, advantages and the disadavantages of the model.

Over-Dispersed Negative Binomial

A

E[Ci,j] = (λj-1)Di,j-1

Var(Ci,j) = φλjj-1)Di,j-1

NOTE: The reserve estimates are the same as the CL method.

→All LDFs must be > 1 (no overall negative development) or variance will be negative

​Advantages

  • doesn’t break down model if there are some negative incremental values
  • same reserve estimate as chain ladder
  • mean is exactly the same as the CL method

Disadvantages

  • Column sums of the incremental losses must be positive or the variance will be negative
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Verrall: Stochastic Reserving for the Chain-Ladder Technique - Indicated the mean, variance, advantages and the disadavantages of the model.

Normal Approximation to the Negative Binomial

A

E[Ci,j] = (λj-1)Di,j-1

Var(Ci,j) = φjDi,j-1

​Advantages

  • allows the possibility of negative incremental losses
  • same reserve estimate as chain ladder
  • mean is exactly the same as the CL method

Disadvantages

  • Additional parameters must be estimated to calculate variance (same issue as Mack method)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Verrall: What is the RMSEP?

A

Root Mean Square Error of Prediction

Mean Squared Error of Prediction (MSEP) is how we calculate the prediction intervals and is also known as the prediction variance.

MSEP = Prediction Variance = Process Variance + Estimation Variance

RMSEP = √MSEP = Prediction Error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Verrall: What is the difference between the standard error and prediction error?

A
  • SE is the square root of the estimation variance
    • only accounts for parameter estimation error
  • Prediction error is concerned with the variability of the forecast and accounts for both
    • uncertainty in the parameter estimation (estimation variance)
    • variability in the data being forecast (process variance)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Verrall: What are the advantages of Bayesian methods when it comes to prediction error?

A
  1. Full predicitve distribution can be found using simulation methods
  2. The RMSEP can be obtained directly by calculating the standard deviation of the distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Verrall: What are two ways the actuary can intervene in the estimation of the development factors for the chain-ladder method?

A
  1. Change a development factor in some of the rows based on external information
  2. Use a 5 year volume-weighted average rather than using all the data in the triangle (all year average)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Verrall: What prior distribution is used in the Bayesian Model for the BF method?

A

Since the BF method assumes expert opinion in each row, we specify the prior distribution as a gamma distribution; xi ~ GAM(⍺i, ßi):

E[xi] = ⍺ißi = Mi

Var(xi) = ⍺i / ßi2 = Mi / ßi

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Verrall: Credibility-Weighted Bayesian Model for the BF method

Zi,j = ?

E[Cij] = ?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Verrall: How can the variance of the model be adjusted for xi?

A

ßi can be used to alter the variance of xi:

  • if we choose prior distributions with large variances (small betas), we have low confidence (no prior knowledge) in our parameter estimates
    • result is close to the CL method
  • if we choose prior distributions with small variances (large betas), we have high confidence (prior knowledge) in our estimates
    • result is close to the BF method
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Verrall: Fully Stochastic BF model formulas

A
  • Run a reverse chain ladder to calculate the gammas and the expected incremental losses
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Verrall: Summarize the steps needed for defining a stochastic version of the BF technique.

A

Step 1: Estimate column parameters

Step 2: Incorporate prior information into the distributions for the parameters xi

Step 3: Use xi to determine ɣi

Step 4: Calculate the expected incremental losses using the gammas

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Shapland: The goal of the ODP bootstrap model is to provide a range of possible outcomes rather than a point estimate. A point estimate is still required by ASOP 36 as we book a point estimate. That being said, list 3 reasons why stochastic reserving is beneficial.

A
  • Could book the first moment of the distribution instead of using a deterministic approach for point estimate (as defined by ASOP 43)
  • SEC looking for more reserving risk information in 10-K reports
  • Rating agencies are looking to validate & improve their models and welcome input from company actuaries regarding unpaid claim distributions
  • Support ERM’s dynamic risk models
  • Accounting framework Solvency II and IFRS moving towards using unpaid claim distributions rather than point estimates
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Shapland: Briefly describe the objectives of the “Using the ODP Bootstrap Model: A Practitioner’s Guide” monograph.

A
  1. Provide practical details on GLM’s - benefits of GLM that can tailor to statistical features of the data versus algorithms which force the data to fit to a static method.
  2. Promote adoption of using unpaid claim distributions by showing how the ODP bootstrap modelling can be used in practice.
  3. Show how to deal with practical issues such as incomplete triangles or negative incremental losses/counts.
  4. Show how stochastic reserving can be similar to deterministic reserving by weighting different models
    • deterministic weights different methods rather than models
  5. Illustrate the advantage of using a complete set of risk estimation tools (both stochastic models and deterministic methods) to arrive at an actuarial best estimate.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Shapland: What is model risk?

A

Model Risk

The risk that the chosen model is not the same as the on that generates future losses.

→ can be addressed by weighting several models together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Shapland: What are the two key assumptions that need to be made in order to make a projection of ultimate losses for the chain-ladder method?

A
  1. The same development factor is applied to all accident years based on a volume weighted average (or some other average).
  2. Each accident year has a parameter that represents its level (its current cumulative loss).
    • Ultimate Claims = Cumulative Loss x CLDF
    • Variation of this assumption is to assume homogeneity in the exposures underlying the losses. This results in using an average loss by development period which can be used to estimate the next period losses. The issue with this is that CL method assumes AY’s are NOT homogenous.
    • The BF and CC method do assume homogeneity (loss ratio, average loss, etc.) by incorporating future expected results into the reserve estimate.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Shapland: What are the advantages of a Bootstrap Model?

A
  • Generates a distribution of possible outcomes rather than a single point estimate
    • Provides more information about the results which can be used in capital modeling
  • Can be modified to align with the statistical features of the data underlying the analysis
  • Can reflect the fact that insurance loss distributions tend to be skewed to the right.
    • the sampling process doesn’t need a distribution assumption and the underlying data is already skewed which is what the model is reflecting
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Shapland: Provide an overview of the Over-Dispersed Poisson Model.

A
  • Incremental claims q(w,d) are modeled directly using a GLM
  • GLM structure uses:
    • log link function
    • ODP error distribution

Steps:

  1. Use the GLM model to estimate parameters
  2. Use bootstrapping (sampling residuals with replacement) to estimate the total distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Shapland: List the formulas that are used to parameterize the GLM.

A

E[q(w,d)] = mw,d

Var(q(w,d)) = ømzw,d

ln(mw,d) = ηw,d

ηw,d = ⍺w + Σd=2ßd

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Shapland: What does the power Z represent in the parameterized GLM?

A

z is used to specify the error distribution

Error Distribution

z = 0 → Normal Distribution

z = 1 → Poisson Distribution

z = 2 → Gamma Distribution

z = 3 → Inverse Guassian Distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Shapland: Set up a GLM model for a 3 x 3 triangle.

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Shapland: Explain how you would solve the GLM model.

A

Solve the ⍺ and ß parameters of the Y = X + A matrix equation that minimizes the squared difference between the vector of the log of actual incremental losses (Y) and the log of expected incremental losses, the Solution Matrix.

  • Can use the iteratively weighted least squares (as in excel files), maximum liklihood or Newton-Raphson to solve for the parameters in the A vector.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Shapland: What formulas do you need to solve for the fitted incrementals in the GLM model?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Shapland: Discuss the usefulness of the Simplified GLM framework.

A
  • The fitted incremental values using the Poisson error distribution assumption are the same as the incremental losses using volume-weighted average LDFS.

Simplified GLM Method

  1. Use cumulative claim triangle to calculate LDFs
  2. Develop losses to ultimate
  3. Calculate the expected cumulative triangle by using the last cumulative diagonal, dividing backwards successively by each volume weighted LDF.
  4. Calculate the expected incremental triangle from the cumulative triangle
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Shapland: What are the advantages of the Simplified GLM Framework?

A
  1. GLM can be replaced with a simpler development factor approach and still be based on the underlying GLM framework.
  2. Using age-to-age factors serves as a “bridge” to the deterministic framework which allows the model to be easily explained to others.
  3. We can use link ratios to get a solution where there are negative incremental losses. A GLM with log-link function may not have a solution.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Shapland: Unscaled Pearson Residuals

rw,d = ?

A

rw,d = [q(w,d) - mw,d] / √mzw,d

Unscaled Pearson Residual AY,d = (Actual LossAY,d - Expected LossAY,d) / Expected Incurred LosszAY,d

  • Pearson residuals are desirable since they are calculated consistently with the scale parameter
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Shapland: Scale Parameter

Ø =

A

ø = Σr2w,d / (N-p)

N = # of incremental values

p = # of parameters (⍺, ß, hetero-adj parameters)

For a simple GLM:

p = # AY’s + # LDFs = 2*(number of AYs) - 1

34
Q

Shapland: What assumption is necessary about residuals for bootstrapped samples?

A

Sampling with replacement assumes that the residuals are independent and identically distributed (iid) and does not require residuals to be normally distributed. (pg 12)

35
Q

Shapland: Sampled incremental loss for a bootstrap model

q*(w, d) = ?

A

q*(w, d) = r* · √(mzw,d) + mw,d

IncLosssimAY,d = residualsim · √E[IncLoss]z + E[IncLoss]

Note:

If mw,d < 0, use the following:

q*(w, d) = r* · √abs(mzw,d) + mw,d

36
Q

Shapland: Standardized Pearson Residuals

rHw,d = ?

A

rHw,d = rw,d · fHw,d

fHw,d = [1 / (1 - Hw,d)]1/2

where fHw,d → Hat Matrix adjustment factor

37
Q

Shapland: Describe the process outlined in Shapland to create a distribution of point estimates via the ODP bootstrap method.

A

1) Create a sample triangle of incremental losses using sampled standardized Pearson residuals, r*, and expected incremental losses from the model, mw,d

q*(w, d) = r*√mzw,d + mw,d

2) Calculate the cumulative triangle and LDFs from the simulated triangle
3) Calculate the point estimate of unpaid losses for the sampled data
4) Run steps 1-3 for many samples to get a distribution of point estimates

Note: These steps ignore process variance. We can add process variance to future incremental values using a Gamma distribution.

38
Q

Shapland: How can you add process variance to future incremental values in a bootstrap model?

A

qsim(w, d) ~ Gamma(mean = miterw,d, var = ø·miterw,d)

qsim(w, d) ⇒ Simulated IncLossw,d with process variance

miterw,d is the expected future incremental for this iteration, calculated from the sampled bootstrap triangle

Note: Here miterw,d is difference for each iteration and different than the mw,d used to calculate the triangle of residuals

39
Q

Shapland: What are two important points about the ODP bootstrap process (sampling residuals) as described by England & Verrall?

A
  1. Sampling of residuals should exclude zero-value residuals
    • typically in corners of triangle
    • these cells will have some variance but since we don’t know what that variance is we sample from the remaining residuals
  2. The hat matrix adjustment factor is a replacement for, and improvement on, the degrees of freedom factor.

(pg. 13)

40
Q

Shapland: Standardized Pearson Scale Parameter

øH = ?

A

øH = Σ(rw,d)2 / N

Note: In the bootstrap model, use the unscaled Pearson scale parameter, ø. The standardized Pearson parameter could be used to approximate the scale parameter.

41
Q

Shapland: The ODP bootstrap model can be used to model both paid and incurred losses. However, if modeling incurred losses we end up getting possible outcomes for IBNR not unpaid losses.

What are 2 possible approaches for modeling the unpaid loss distribution using incurred losses?

A
  1. Model incurred losses and then convert the ultimate values to a payment pattern
  • Run paid data model in parallel with the incurred data model and use random payment pattern from each iteration from paid model to convert ultimate values from the incurred method to a payment pattern
  • Benefit is that OS are included so the ultimate estimates are improved
  • Also creates a distribution of IBNR and Case Reserves
  1. Modeling paid losses and case reserves separately
  • Apply the bootstrap model to the Munich chain ladder model
  • Advantages over first approach is not running paid losses twice and including the correlation between paid and outstanding losses
42
Q

Shapland: A common issue with using ODP bootstrap model is that the distribution of the most recent AY’s can produce volatile results compared to more mature AY’s.

Discuss 2 ways you could alleviate this problem.

A

BF Method

  • Incorporate BF model by using a priori loss ratios for each AY with standard deviations for each loss ratio and an assumed distribution
  • During simulation, for each iteration, simulate a new a priori loss ratio

Cape Cod Method

  • Apply the Cape Cod algorithm to each iteration of the bootstrap model
43
Q

Shapland: There are two limitations of the CL model and therefore the ODP bootstrap of the CL model.

What are the two limitations?

How can you overcome these issues?

A

Limitations

  1. Does not measure nor adjust for calendar year effects
  2. May include a significant number of parameters and therefore over-fits the model to the data

Solutions

  1. Reduce the number of parameters in the model
  2. Add calendar year effects by adding another parameter (CY trends or diagonal) to the model (ɣ)
44
Q

Shapland: List the pros and cons of using fewer parameters in the generalized ODP model.

A

Pros

1) Helps avoid potentially over-parameterizing the model
2) Allows the ability to add parameters for calendar-year trends
3) Can be used to model data shapes other than data in triangle form

→e.g. missing incremental losses in first few diagonals

Cons

1) GLM must be solved for each iteration of the bootstrap model, slowing simulations
2) The model is no longer directly explainable to others using age-to-age factors

45
Q

Shapland: How do you handle negative incremental values in the GLM framework (GLM & ODP Bootstrap)?

A
  1. Modify the log-link function
    * use this when the sum of the incremental loss in the development column is > 0
  2. Add a constant
    * when entire development column is less than zero
46
Q

Shapland: Negative Incremental Values

Modified Log-Link

(GLM Bootstrap)

A

When the sum of the incremental losses in the development column is greater than zero, we can modiy the log-link triangle calculations:

q(w, d) > 0

ln[q(w, d)]* = ln[q(w, d)]

q(w, d) = 0

ln[q(w, d)]* = 0

q(w, d) <0

ln[q(w, d)]* = -ln[abs(q(w, d))]

47
Q

Shapland: Negative Incremental Values

Negative Development Periods (Constant)

(GLM Bootstrap)

A

When the sum of the incremental losses in the development triangle are < 0:

1) Shift incremental losses up by the size of the largest negative incremental value, Ψ.

q+(w, d) = q(w, d) - Ψ

2) Calculate the log-link triangle, run the GLM and then caluclated the fitted incremental values, mw,d+.
3) Shift the fitted incremental losses back down by Ψ:

mw,d = mw,d+ + Ψ

*Taking out the negatives before running the GLM bootstrap, then adding back the negatives when you have the projected incremental losses

48
Q

Shapland: Negative Incremental Values

Simplified GLM Adjustments

(ODP Bootstrap)

A

When fitted incrementals (mw,d) are negative, make the following adjustments to the formulas for residuals and sumulated incremental values:

rw,d = [q(w, d) - mw,d] / √abs(mzw,d)

q*(w, d) = r* · √abs(mzw,d) + mw,d

49
Q

Shapland: Negative values can impact the process variance in the simulation process. How can this be resolved?

A

Negative Values - Process Variance

When the fitted incremental (mw,d) is negative, there are two options to simulate incrementals from a Gamma distribution with absolute values:

  • Change the sign of the simulated value

-Gamma[mean = |mw,d|, var = ø·|mw,d|]

⇒BUT, this results in a skewed left distribution

  • Shift the entire distribution to have a mean of mw,d

Gamma[mean = |mw,d|, var = ø·|mw,d|] + 2mw,d

⇒this keeps the distribution skewed to the right

50
Q

Shapland: What is the issue with negative incremental values during the simulation?

A

Negative incremental losses may cause extreme outcomes for some iterations.

Example

They may cause cumulative values in an early development column to sum to near zero and the next column to be much larger.

→This results in extremely large LDFs and central estimates for an iteration.

51
Q

Shapland: What can you do to address the issue with extreme outcomes from negative incremental values during simulation?

A

There are 3 options:

1) Remove extreme iterations from the results
* BUT only remove truly unreasonable iterations
2) Recalibrate the model after identifying the sources of negative incremental losses
* e.g. remove a row with sparse data when the product was first written
3) Limit incremental losses to zero
* Replace negative incremental loss in original data with zero incremental loss

52
Q

Shapland: Another practical issue that arises with the ODP bootstrap models is:

Non-Zero Sum of Residuals

What is this and how can it be dealt with in the model?

A
  • Residuals calculated in a bootstrap model are just error terms, so they should be iid with a mean of zero

→BUT usually the average of all residuals will be non-zero

  • Since residuals are random observations, a non-zero sum of all residuals is not necessarily incompatible with the true distribution
  • To set the average residual to zero, one option is to add a constant to all residuals:

r*i = ri - rbar

53
Q

Shapland: Another practical issue that arises with the ODP bootstrap models is:

Using a N-Year Weighted Average

What is this and how can it be dealt with in the model?

A

Common for actuaries to use weighted averages that are less than all years in CL or related methods. GLM’s can also be adjusted to consider the data in the most recent diagonals.

With GLM Framework

  • Exclude the first few diagonals and only use N+1 diagonals to parameterize the model (data is now a trapezoid instead of a triangle)
  • Run bootstrap simulations and only sample residuals for the trapezoid that’s used to parameterize the model

With Simplified GLM

  • Calculate N-year average LDFs
  • Run bootstrap simulation, sampling residuals for the entire triangle in order to calculate cumulative values
  • Use N-year average factors to project future expected values for each iteration
54
Q

Shapland: Another practical issue that arises with the ODP bootstrap models is:

Missing Values

What is this and how can it be dealt with in the model?

A

Examples: Missing the oldest diagonals (if data were lost) or missing values in the middle of the triangle

Calculations Affected

LDFs, fitted triangle (if missing the latest diagonal), residuals, degrees of freedom

Solution 1: Estimate the missing values from surrounding values

Solution 2: Modify LDFs to exclude the missing value, no residual for missing value → Don’t resample from missing values

Solution 3: If missing value is on the latest diagonal, estimate the value or use value in the 2nd to last diagonal to fill the triangle, and use judgement to determine their effect on the resulting distribution.

55
Q

Shapland: Another pactical issue that arises with the ODP bootstrap models is:

Outliers

What is this and how can it be dealt with in the model?

A

There may be outliers that are not representative of the variablility of the dataset in the future, so we may want to remove them.

  • Outliers could be removed and treated as missing values
  • Identify outliers and exclude from LDFs and residual calculations, but resample the corresponding incremental when simulating triangles

Remove outliers cautiously and only after understanding the data.

56
Q

Shapland: Another practical issue that arises with the ODP bootstrap models is:

Heteroscedasticity

What is this and why is it a problem?

A

Heteroscedasticity

When Pearson residuals have different levels of variability at different ages.

Why heteroscedasticity is a problem:

ODP bootstrap model assumes standardized Pearson residuals are IID.

  • With heteroscedasticity, we can’t take the residuals from one development period and use them in other development periods

​Considerations when assessing heteroscedasticity:

  • Account for the credibility of the observed data
  • Account for the fact that there are fewer residuals in older development periods
57
Q

Shapland: Another practical issue that arises with the ODP bootstrap models is:

Heteroscedasticity

How can you adjust for heteroscedasticity in your model?

A
  1. Stratified Sampling
  2. Calculating hetero-adjustment (or variance) parameters
  3. Calculate non-constant scale parameters
58
Q

Shapland: Adjusting for Heteroscedasticity

Stratified Sampling

A

Option 1: Stratified Sampling

  1. Organize development periods by groups with homogenous variances
  2. For each group: Sample with replacement only from residuals in that group

BUT: Some groups only have a few residuals in them, which limits amount of variability in possible outcomes

59
Q

Shapland: Adjusting for Heteroscedasticity

Standard Deviation

(Calculating hetero-adjustment parameters)

A

Option 2: Hetero-Adjustment Variance Parameters

*Apply to standardized residuals (q M8)

60
Q

Shapland: Adjusting for Heteroscedasticity

Scale Parameter

A

*Uses unscaled residuals to calculate the ø, then applies the h factor to the standardized residual

61
Q

Shapland: What are the pros and cons of adjusting for heteroscedasticity using hetero-adjustment factors?

A

Pro:

Can resample with replacement from the entire triangle

Con:

Adds parameters which affects the degrees of freedom and the scale parameter

62
Q

Shapland: Another practical issue that arises with the ODP bootstrap models is:

Heteroecthesious Data - Partial First Development Period

What is this and how can you adjust for it?

A

Heteroecthesious Data - Partial First Development Period

This occurs when the first development period has a different exposure period length than the other columns

⇒e.g. the first column has 6 months of data but the rest have 12 months

Adjustments

  • Reduce the latest accident year’s future incremental losses to be proportional to the level of earned exposure in the first period.
    • the LDF will extrapoloate the 6 months to a full year (e.g. 6-18 months); this needs to be reduced back into 6 months
  • Then simulate process variance (or reduce the incremental losses after process variance step)
63
Q

Shapland: Another practical issue that arises with the ODP bootstrap models is:

Heteroecthesious Data - Partial Last CY Period Data

What is this and how can you adjust for it?

A

Adjustments

a) Annualize exposures in the last partial diagonal
b) Calculate the fitted triangle and residuals
c) During the ODP bootstrap simulation, calculate and interpolate the LDF’s from the fully annualized sample triangles
d) Adjust the last diagonal from the sample triangles to de-annualize incremental losses on the last diagonal
e) Project future values by multiplying the interpolated LDFs with the new cumulative values
f) Reduce the future incremental values for the latest accident year to remove future exposure

64
Q

Shapland: Another practical issue that arises with the ODP bootstrap models is:

Exposure Adjustment

What is this and how can it be dealt with in the model?

A

Issue: Exposures have changed significantly over the years such as rapidly growing line or the line is in runoff.

Adjustment

  • If EE exist, divide all the claims data by exposures for each accident year to run the model with pure premiums
  • After the process variance step, multiply the result by accident year exposures to get total claims
65
Q

Shapland: Another practical issue that arises with the ODP bootstrap models is:

Tail Factors

What is this and how can it be dealt with in the model?

A

Issue: development may go beyond the triangle and tail factors are often used to extrapolate development.

Adjustment

  • Add tail factor to the ODP bootstrap by assuming the tail factor parameter follows a distribution
  • Once this is added, other considerations such as process variance, hetero-adjustment factors, etc. can be extended to include tail factors.

Considerations: The tail factor may be a cumulative LDF so you may need to break down into age-to-age factors going out say 5 years or so.

66
Q

Shapland: Another practical issue that arises with the ODP bootstrap models is:

Level of Extreme Simulations

What is this and how can it be dealt with in the model?

A

Issue: the number of data points used to parameterize the ODP bootstrap is limited which makes it difficult to determine whether the most extreme observation is a 1-in-100 or a 1-in-1000 year event.

Adjustment

Parametric bootstrapping is a way to overcome a lack of extreme residuals in the ODP bootstrap model.

Steps:

1) Fit a parameterized distribution to the residuals
2) Resample residual from the distribution instead of observed residuals.

67
Q

Shapland: What do you do if there are a significant number of outliers?

A
  • This may indicate the model is a poor fit.
  1. For the GLM bootstrap, you could choose new parameters or change the error distribution (z)
  2. For the ODP bootstrap, you could use an L-year weighted average to get a better fit.
    • KEEP in mind - ODP bootstrap doesn’t use a specific distribution for residuals (assumes IID)
    • Thus, a large number of outliers could mean the residuals are highly skewed. If skewness is real, then outliers should be included in the fitting process to replicate the true nature of the residuals.
68
Q

Shapland: The key diagnostic tests are designed for 3 purposes. What are they?

What is the main objective of examining diagnostic tests?

A
  1. Test the assumptions of the model
  2. Gauge the quality of the model fit to the data
  3. Help guide adjustments of the model parameters to improve the fit of the model

Objective

Find a set of models and parameters that results in the most realistic and most consistent simulations based on the statistical features of the data.

69
Q

Shapland: What are residuals graphs used for? What are you looking for?

A

Residual graphs help test the assumption that residuals are IID.

Plots to look at:

  • Residual vs. Development Period - looks for heteroscedasticity
  • Residuals vs. AY Period
  • Residuals vs. Payment Period - calendar year
  • Residuals vs. Predicted

→Look for issues with trends, look at the variance between residuals by AY and development period (does the first few periods show different variance compared to the middle or last development or AY’s), the mean line should be around zero

70
Q

Shapland: Discuss the following diagnostics test:

Normality Test

A

The normality test compares residuals to the normal distribution. If residuals are close to normal, you should see the following:

  • Normality plot with residuals in line with the diagonal line (normally distributed)
  • High R2 value and p-value greater than 5%

Note: In the ODP bootstrap, residuals don’t need to be normally distributed.

This test is good for comparing parameters and gauging skewness.

71
Q

Shapland: Goodness of Fit Test

Formulas for AIC and BIC

A
72
Q

Shapland: Diagnostic Tests

Identifying Outliers

A

Box-Whisker plots can be used to identify outliers:

  • Box shows the 25th & 75th percentile
  • Whiskers extend to the largest values within 3 times the inter-quartile range
  • Values outside whiskers are outliers
73
Q

Shapland: Diagnostic Test

How should outliers be handled?

A
  • If outliers represent scenarios that can’t be expected to happen again, then it makes sense to remove them.
  • Use extreme caution when removing outliers because they may represent realistic extremes that should be kept in the analysis.
74
Q

Shapland: Diagnostic Test - Reviewing Model Results

Estimated Unpaid Results

A

When reviewing the estimated unpaid loss results, you should look at the following:

  • SE should increase from the oldest AY to most recent years
  • SE for all AY’s should be larger than any individual year
  • CoV should decrease from the oldest to most recent years due to independence in incremental payment stream
  • A reversal of CoV in recent years could be due to:
    • increasing parameter uncertainty in more recent years
    • model may over-estimate uncertainty in recent years; we may want to switch to CC or BF model
  • Minimum/maximum simulations should be reasonable

Key: SE, CoV & Min/Max

75
Q

Shapland: What methods can be used to combine the results from multiple models?

A

Run models with the same random variables

1) Simulate random variables for each iteration
2) Use same set of random variables for each model
3) Use model weights to weight incremental values from each model for each iteration by accident year

Run models with independent random variables

1) Run each model separately with different random variables
2) Use weights to randomly select a model for each iteration by AY so that the result is a weighted mixture of models

76
Q

Shapland: Estimating Cash Flow Results

What are you looking for?

A

A model’s output may be reviewed by calendar year or future diagonal.

Simulation of unpaid losses by calendar year have the following characteristics:

  • Standard Error of CY unpaid losses decrease as calendar year increases in future
  • Coefficient of variation increases as CY increases

⇒This is because the final payments projected farthest out will be the smallest and most uncertain

77
Q

Shapland: Estimated Ultimate Loss Ratios

What are you looking for?

A
  • Estimated ultimate loss ratios by accident year are calculated using all simulated values, not just the future unpaid values
    • if we are interested in the remaining volatility in the loss ratio, then add unpaid losses with paid losses and divide by premium by year
  • Represents the complete variability in loss ratio for each accident year
  • Loss ratio distributions can be used for projecting pricing risk
78
Q

Shapland: Results for entire business unit can be estimated after each business segment has been analyzed and weighted into best estimates. This is done by aggregating. In order to estimate the possible outcomes for the company as a whole, a correlation of results is needed between segments.

Briefly discuss issues with correlation methods.

A
  • Both location mapping and re-sorting methods use residuals of incremental future losses to correlate segments

→both tend to create overall correlations close to zero

  • For reserve risk, the correlation that is desired is between total unpaid amount for two segments so there may be a disconnect
79
Q

Shapland: Correlation between Business Segments

Re-Sorting

A

Uses algorithms such as Copula or Iman-Conover to add correlation

Advantages

  • Data triangles can be different shapes/sizes by segment
  • Can use different correlation assumptions
  • Different correlation algorithms may have other beneficial impacts on the aggregate distribution
    • For example, you can use a copula with a heavy tail distribution to strengthen the correlation between segments in the tail, which is important for risk-based capital modeling
80
Q

Shapland: Correlation between Business Segments

Location Mapping

A

For each iteration, sample the residuals from the residual triangles using the same locations for all business segments.

Advantages

  • Method is easily implemented - data doesn’t require an estimated correlation matrix and preserves the correlation of the original residuals

Disadvantages

  • All segments need to have the same size data triangles with no missing data
  • Correlation of original residuals is used, so we can’t test other correlation assumptions