Shapland Flashcards

Question

Tail Factors

Answer 1

ODP: - Instead of using a deterministic tail factor, we can assign a distribution (ex. normal or lognormal) to the tail factor parameter GLM: - Assume that the final development period will continue to apply incrementally until its effect on the future incremental claims is negligible

Answer 2

If we believe that extreme observations are not captured well in the loss triangle, we can parameterize a distribution for the residuals (such as normal) and resample using the distribution - know as parametric bootstrap

Answer 3

Can be used to test the assumption that residuals are i.i.d. Should be able to draw a relatively horizontal line through the residuals Should see constant spread (homoscedasticity)

Answer 4

Although we do not require the residuals to be normally distributed, it’s still helpful to compare residuals against a normal distribution - allows us to compare parameter sets and assess the skewness of the residuals, run before and after heteroscedasticity adjustments Normality Plot: - Shows the relationship between theoretical quantiles of a standard normal distribution (i.e., the 𝑥-axis) and the empirical quantiles of the observed data (i.e., the 𝑦-axis) - If the data is normally distributed, then points will lie in a diagonal line Test Values: - 𝒑-value from Shapiro’s Test for Normality should be greater than 5% - R^2 should be close to 1 - AIC/BIC should be small, RSS comes from difference of residual and its normal counterpart from the normality plot - AIC = 2p + n * [ln((2 * pi * RSS) / n) + 1] - BIC = n * ln(RSS / n) + p * ln(n)

Answer 5

Use box-whisker plots of the standardized residuals When residuals are not normally distributed, outliers tend to be more common and we don’t necessarily want to remove them because they are representative of the shape of the data

Answer 6

1. Start with a “basic” GLM model which includes one parameter for accident, development, and calendar periods 2. Check the residual plots for the basic model - If the residuals by accident period, development period, and calendar period are not randomly scattered around zero, then we should consider adding parameters - If certain parameters are not statistically significant, we should remove them - The implied development pattern should look like a smoothed version of the CL development pattern 3. We keep adding/removing parameters until we see a proper residual plot

Answer 7

Standard Error: - Should increase from older to more recent years because the standard error should follow the size of the increasing unpaid loss reserve - Total standard error should be larger than the standard error for any individual year - Total standard error should be less than the sum of the individual standard errors (but this can be true for unreasonable models) CoV: - Should generally decrease from older to more recent years because older years have a smaller loss reserve and there are few claims payments remaining - Parameter uncertainty may overpower process uncertainty leading to an increased CV in most recent year - Total CoV should be less than any individual year because the model assumes independent AYs

Answer 8

Each model is run with the exact same random variables (i.e.,random residuals in terms of position) Once all of the models have been run, the incremental values for each model are weighted together for each iteration by AY Causes correlation in model results since each model is run using the same set of random residuals

Answer 9

Each model is run with its own random variables (i.e.,different samples of random residuals in terms of position) Once all of the models have been run, the weights are used to select a model for each iteration by AY For example, suppose we are estimating unpaid losses using CL and BF bootstrap models. Further suppose the weights are 25% and 75%, respectively. For each iteration by AY, we would draw a uniform random variable on (0, 1). If the drawn uniform random variable is “<0.25”, then the CL unpaid losses would be used for that iteration/AY combination. Otherwise, the BF unpaid losses would be used. The result is a weighted mixture of models, where the CL and BF model results represent approximately 25% and 75% of the iterations by AY, respectively.

Answer 10

For each iteration: 1. Sample residuals for segment 1 2. Track the location in the residual triangle where each sampled residual was taken 3. For all other segments, sample the residuals from their residuals triangles using the same locations Advantages: - Easily implemented - Does not require an estimated correlation matrix Disadvantages: - Requires all business segments to have the same size data triangles with no missing data - Since the correlation of the original residuals is used, we cannot test other correlation assumptions for stress testing purposes

Answer 11

Re-sorting relies on algorithms such as Iman-Conover (rank correlation algorithm) or copulas to induce a desired correlation For example, we could induce correlation among business segments by re-sorting the residuals until the rank correlation between each business segment matches the desired correlation specified by a correlation matrix Advantages: - Data triangles can be different shapes/sizes by segment - Can use different correlation assumptions - Different correlation assumptions may have other beneficial impacts on the aggregate distribution

Answer 12

The variance of the ODP doesn't allow for negative expected incremental losses (would give negative variance) Paid losses and claim counts are less likely to show negative incremental development, which is more likely for reported losses