Meyers Flashcards

1
Q

When models do not accurately predict dist of outcomes for test data, 3 explanations

A
  1. Insurance process is too dynamic to be captured by single model
  2. Could be other models that better fit data
  3. Data used to calibrate model is missing crucial info needed to a make reliable prediction
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

3 tests to validate models

A
  1. histogram
  2. p-p plot
  3. K-S statistic
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

histogram

A
  • if percentiles are uniformly distributed, height of bars should be equal
  • for small sample, not perfectly level
  • if level, model is appropriate
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

p-p plot

A
  • tests for stat significance of uniformity
  • plot expected percentiles on x and sorted predicted percentiles on y -> if predicted percentiles are uniformly dist, plot lies along 45 degree line

ie model is appropriate if p-p plot lies along 45 degree line

expected value e = {1/(n+1),…,n/(n+1)}

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

K-S statistic

A

D=max|pi-fi|

fi = 100*{1/n,…,n/n}

  • can reject hypothesis that set of percentiles is uniform @ 5% level if D > critical value = 136/sqrt(n)
  • critical values appear as 45 degree bands that run parallel to y=x
  • Meyers deems model validated if passes K-S test
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Validating Mack: results

A
  • incurred data
  • on histogram, percentiles show little uniformity and actual outcomes are falling into smaller and larger percentiles more often -> Mack produces dist that is light tailed
  • in p-p plot, predicted percentiles form S shape -> light tailed because actual outcomes failing into percentiles that are lower than expected in left tail and higher in right tail
  • D > critical value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Validating ODPB: results

A
  • paid data
  • predicted outcomes are occurring in lower percentiles more often -> implies both models produce expected loss estimates that are biased high when modeling paid losses
  • producing higher expected loss estimates, left tail becomes lighter
  • D > critical value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Possible reasons for observations for paid and incd data ie Mack and ODPB results

A
  • insurance loss environment has experience changes that are not yet observable
  • other models that can be validated
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Bayesian models for Incurred loss data

A

-Mack model underestimates variability of predictive distribution which leads to light tails

Leveled Chain Ladder (LCL)

Correlated Chain Ladder (CCL)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Leveled Chain Ladder (LCL)

A
  • treats level of AY as random ie independence between AY -> model will predict more risk
  • sigma is larger for earlier DPs where more claims open and more variability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Correlated Chain Ladder (CCL)

A
  • allows for correlation between AYs -> model will predict more risk than LCL
  • should result in larger standard deviation for predicted distribution (heavier in tails), which would result in percentiles of outcomes to be more uniform than LCL
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

LCL results

A
  • produce higher std dev than Mack
  • has S shape and some points lie outside K-S bounds, but improvement over Mack, & D is closer to critical value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

CCL results

A
  • produce higher std dev than Mack
  • CCL produced higher std dev for each AY than LCL
  • CCL has S shape and all points within bounds and D is smaller than critical value -> model validates against data and exhibits uniformity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Bayesian models for Paid loss data

A

-CCL model produced estimates that were biased high

Correlated Incremental Trend (CIT)

Leveled Incremental Trend (LIT)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Correlated Incremental Trend (CIT)

A
  • introduces payment trend and dist is skewed right and allows for negative values (model should be based on increm paid)
  • sigma is smaller for earlier DPs
  • opposite from LCL b/c increm loss
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Leveled Incremental Trend (LIT)

A

-similar to CIT but does not have AY correlation

17
Q

results for CIT and LIT

A
  • CIT and LIT produce estimates that are biased high
  • neither show noticeable improvement over ODP and Mack
18
Q

Changing Settlement Rate (CSR)

A
  • claims are reported and settled faster due to tech and CIT model might not fully reflect this change
  • allows for changing settlement rates which can reflect speedup in claim settlement for more recent AY
  • cum. paid losses since no longer considering payment yr trend
19
Q

CSR results

A
  • histogram is nearly level
  • p-p plot closely tracks with y=x
  • indicates that incurred data recognized speedup in claims settlement rate which led to good fit with CCL
20
Q

total risk

A

total risk = process risk + parameter risk

21
Q

process risk vs parameter risk

A
  • process risk = avg var of outcomes from expected result
  • parameter risk = var due to many possible parameters in posterior dist of parameter
22
Q

Meyers found what risk is close to total risk for several insurers

A

parameter risk

23
Q

model risk

A
  • model risk = risk that we did not select right model
  • model risk is special case of parameter risk
24
Q

if p-p plot shows S curve

A

demonstrates more high and low percentiles than expected

45 degree = uniformly distributed

25
Q

CCL: distribution for ultimate loss

A

start with loss given C(w,d) and then calc u(w,d) using below

u(1,d)=alpha(1)+beta(d)

calculate parameters of distribution for ~C by correlating with AY given:

~u(w,d)=alpha(w)+beta(d)+row*(ln(C(w-1,d))-u(w-1,d))

C(w-1,d) and u(w-1,d) is from 1 steps

~C(w,d) is simulated from lognormal with log mean ~u(w,d) and log std dev σ(d)

26
Q

CSR: distribution for ultimate losses

A

u(w,d)=alpha(w)+beta(d)*(1-gamma)^(w-1)

C(w,d) is simulated from lognormal dist with log mean=u(w,d) and log std dev=σ(d)

27
Q

CSR: how gamma parameter impacts claims payment pattern

A

development period portion of logmean formula is Beta(d)*(1-gamma)^(w-1)

with later AY, absolute value will be smaller

the larger this portion is, the larger logmean, resulting in higher simulated losses

If Beta(d) is negative, then logmean larger

higher simulated losses indicate speedup in settlement rate for more recent AY

28
Q

LCL compared to MACK

A

-compared to Mack, model uses random level parameters instead of fixed level parameters for most recent cumulative loss for the AY

29
Q

CCL compared to MACK

A
  • model uses random level parameters instead of fixed level parameters for most recent cumulative loss for the AY
  • model incorporates correlation between AY
30
Q

procedure used by CCL to create loss distribution for ultimate losses

A
  1. use loss triangle and prior distributions for CCL model, run MCMC script to estimate posterior distributions
  2. create sample sets of parameters from posterior distributions
  3. for each parameter set, simulate the ultimate losses, iterated for each AY
    - simulated losses C(w-1,10) is used to calc u(w) for next AY; simulated C(w,10) is then simulate with u(w) and sigma(10)
  4. distribution and any summary statistics are calculated off of total ultimate losses across AY
31
Q

incorporating expert knowledge about expected losses in CCL

A

prior distribution for level parameters and logelr parameter can be specified to be more restrictive instead of using vague priors so that model better reflects expected losses

32
Q
A