Shapland Flashcards

Question

Explain the problem with negative incremental values during simulation

Answer 1

Negative incremental may cause extreme outcomes for some iterations. Example: They may cause cumulative values in an early development column to sum to near zero and the next column to be much larger. This results in extremely large LDFs and central estimates for an iteration.

Answer 2

1. Remove extreme iterations from results BUT only remove truly unreasonable iterations 2. Recalibrate model after identifying the sources of negative incremental (ex: remove a row with sparse data when product was first written) 3. Limit incremental losses to zero (replace negative incremental in original data with a zero incremental loss)

Answer 3

Residuals calculated in bootstrap model are just error terms, so they should id with mean of zero. But, usually the average of all residuals will be non-zero. Since residuals are random observations, a non-zero sum of all residuals is not necessarily incompatible with true distribution. To set the average residual to zero, one option is to add a constant to all residuals: r* = r - rbar

Answer 4

Ex: missing oldest diagonals (if data was lost) or missing values in middle of the triangle. Calculations affected: LDFs, fitted triangle, residuals, degrees of freedom Solutions: 1. Estimate missing value from surrounding values 2. Modify LDFs to exclude missing value, no residual for missing value (do not resample from missing values) 3. If missing value on latest diagonal, estimate value or use value in second to last diagonal to get filled triangle, using judgment.

Answer 5

There may be outlines that are not representative of variability of the dataset in the future, so we may want to remove them. 1. Outliers could be removed and treated as missing values. 2. Identity outliers and exclude from LDFs and residual calculations, but resample the corresponding incremental when simulating triangles. Remove outliers cautiously and only after understanding data since they may represent realistic extremes that should be kept in analysis.

Answer 6

When Pearson residuals have different levels of variability at different ages. OP bootstrap model assumes standardized Pearson residuals are IID. With heteroscedasticity, we cannot take residuals from one development period and use them in other development periods.

Answer 7

1. Account for credibility of observed data 2. Account for the fact that there are fewer residuals in older development periods.

Answer 8

Group periods together with similar residual variance and only sample residuals from the corresponding group for the model. 1. Organize development periods by groups with homogeneous variances 2. For each group, sample with replacement only from residuals in that group BUT: some groups only have a few residuals in them, which limits the amount of variability in possible outcomes

Answer 9

Group residuals with similar variances. Divide the total residual standard deviation by the standard deviation for each group to get each group's hetero factor. Multiply residuals by the factor for each group to get all residuals to the same variance level. Then, sample residuals from the entire triangle and back out the adjustment. 1. Calculate hetero-adjustment factors by group hi = stddev(union of all rH_w,d) / std dev(union of rH_w,d in group i) 2. Adjust residuals in each group riH_w,d = rH_w,d * hi 3. Resample residuals and back out hi qi*(w,d) = r*/hi * sqrt(m^z_w,d) + m_w,d

Answer 10

1. Calculate overall scale parameter phi = sum(residuals^2) / N-p 2. Adjust residuals in each group phi_i = N/N-p * sum(residuals^2 in group i) / ni ni is number of residuals in group I 3. Calculate hetero-adjustment factor hi hi = sqrt(phi/phi_i)

Answer 11

Pro: can resample with replacement from entire triangle Con: adds parameters, affecting degrees of freedom and scale parameter

Answer 12

This occurs when first development period has a different exposure period length than other columns Ex: 6 months in the first column and 12 months in the rest Adjustment: Reduce latest accident year's future incremental loses to be proportional to the level of earned exposure in first period Then simulate process variance (or reduce after process variance step)

Answer 13

Partial last calendar period data. 1. Annualize exposures in last partial diagonal 2. Calculate fitted triangle and residuals 3. using ODP bootstrap simulation, calculate and interpolate LDFs from fully annualized sample triangles 4. Adjust last diagonal of the sample triangles to de-annualize incremental on the last diagonal 5. Project future values by multiplying the interpolated LDFs with the new cumulative values 6. Reduce future incremental values for the latest accident year to remove future exposure

Answer 14

If earned exposures exist, divide all claims data by exposures for each accident year to run the model with pure premiums. After process variance step, multiply the result by accident year exposures to get total claims.

Answer 15

Purpose: way to overcome a lack of extreme residuals in an ODP bootstrap model 1. Fit parametrized distribution to the residuals 2. Resample residuals from distribution instead of the observed residuals

Answer 16

Purpose: Find a set of models and parameters that results in the most realistic and most consistent simulations based on statistical features of data. 1. Test assumptions in model 2. Gauge the quality of model fit to data 3. Help guide adjustments of the model parameters to improve fit of the model

Answer 17

Plots to look at: 1. Residuals vs Development Period (look for heteroscedasticity) 2. Residuals vs Accident Period 3. Residuals vs Payment Period 4. Residuals vs Predicted Look for issues with trends Plot relative std dev of residuals and range of residuals to further test for heteroscedasticity

Answer 18

Compares residuals to the normal distribution. If residuals are close to normal, you should see: 1. Normality plot with residuals in line with diagonal line (normally distributed) 2. High R^2 and p-value greater than 5% Note: in ODP bootstrap, residuals do not need to be normally distributed

Answer 19

AIC = 2p + n*ln(2*pi*RSS/n)+1) BIC = n*ln(RSS/n) + p*ln(n) Smaller values indicate that residuals fit a normal distribution better. AIC and BIC add a penalty for more parameters.

Answer 20

Use a box-whisker plot: Box shop 25th - 75th percentile Whiskers extend to the largest values within 3 times the inter-quartile range Values outside whiskers are outliers

Answer 21

1. Standard error should increase from oldest to most recent years 2. Standard error for all years should be larger than any individual year 3. Coeff of variation should decrease from oldest to most recent years due to independence in incremental payment stream. 4. A reversal in coeff of variation in recent years could be due to: a) Increasing parameter uncertainty in more recent years b) Model may overestimate uncertainty in recent years, we may want to switch to BF or Cape Cod model 5. Min/Max simulations should be reasonable

Answer 22

1. Run models with same random variables a) Simulate random variables for each iteration b) Use same set of random variables for each model c) Use model weights to weight incremental values from each model for each iteration by accident year 2. Run models with independent random variables a) Run each model separately with different random variables b) Use weights to randomly select a model for each iteration by accident year so that the result is a weighted mixture of models

Answer 23

1. Std error of calendar year unpaid decreases as calendar year increases in future 2. Coeff of variation increases as calendar year increases This is because the final payments projected farthest out will be the smallest and most uncertain The main difference between CY (cash flows) and Ays (unpaid losses) is that std errors and COVs move in opposite directions.

Answer 24

Both location mapping and re-sorting methods use residuals of incremental future losses to correlate segments. Both tend to create overall correlations of close to zero. For reserve risk, the correlation that is desired is between total unpaid amounts for two segments so there may be a disconnect.

Answer 25

1. Re-sorting Use algorithms such as Copula or Iman-Conover to add correlation 2. Location Mapping For each iteration, sample residuals from residual triangles using the same locations for all segments

Answer 26

1. Data triangles can be different shapes/sizes by segment 2. Can use different correlation assumptions 3. Different correlation algorithms may have other beneficial impacts on aggregate distribution Ex: can use a copula with heavy tail distribution to strengthen the correlation between segments in tails, which is important for risk-based capital modeling.

Answer 27

Method is easily implemented Does not require an estimated correlation matrix Preserves the correlation of original residuals

Answer 28

1. All segments need to have same size data triangles with no missing data 2. Correlation of original residuals is used, so we cannot test other correlation assumptions

Answer 29

1. Negative incremental values 2. Non-zero sum of residuals 3. Using n-year weighted average 4. Missing values 5. Outliers 6. Heteroscedasticity 7. Heteroecthesious data 8. Exposures changing over time 9. Lack of extreme residuals

Answer 30

1. Standard error should increase from older to more recent Ays, because std error should follow size of increasing unpaid loss reserve. 2. Total, all-year std error should be larger than the standard error for any individual year. 3. CV should decrease moving from older to more recent Ays, with possible exception of the most recent year (where parameter uncertainty might overpower process uncertainty). This is because older years have a smaller loss reserve and there are few claims payments remaining. 4. CV of total reserve should be less than any individual year because the model assumes accident years are independent.

Shapland Flashcards

(54 cards)