Shapland Flashcards

Question

Disadvantage of stratified sampling

Answer 1

Some groups may only have a few residuals in them, which limits the amount of variability in the possible outcomes

Answer 2

incomplete or uneven exposures at interim evaluation dates

Answer 3

Occurs when the first development column has a different exposure period than the rest of the columns. This is NOT a problem for parameterizing the ODP bootstrap model since the Pearson residuals use the square root of the fitted value to make them all exposure independent

Answer 4

In a deterministic analysis (not bootstrapping), the most recent accident needs to be adjusted to remove exposures beyond the evaluation date. We can reduce the projected future payments by half to remove the exposures from 6/30 to 12/31. During ODP bootstrap simulation process, we do the same thing. Once the projected future values have been reduced by half, we simulate the process variance as usual. Alternatively, we can reduce the future values by half AFTER simulating the process variance

Answer 5

occurs when the latest diagonal only has a 6 months development period

Answer 6

In a deterministic analysis, we can exclude the latest diagonal when calculating age-to-age factors, interpolate those factors for the exposures in the latest diagonal, and use the interpolated factors to project the future values. When parameterizing the ODP bootstrap model, we annualize the exposures in the last diagonal to make them consistent with the rest of the triangle. The fitted triangles is calculated based on this annualized triangle to obtain residuals During the ODP bootstrap simulation process, age-to-age factors are calculated from the annualized sample triangles and interpolated. Then, the latest diagonal in the sample triangle is adjusted back to a six month period. The cumulative values are then multiplied by the interpolated age-to-age factors to project future values. We must reduce the future values for the latest accident year by half

Answer 7

We divide the claim data by earned exposure for each AY. this normally improved the fit of the model The simulation process is then run on the adjusted data. After the process variance step is completed, we multiply the results by the earned exposures to restate them in terms of total values

Answer 8

Similar to the ODP, the GLM model is fit to the exposure adjusted losses. Main difference: exposure adjusted losses with HIGHER exposures are assumed to have LOWER variance when fitting the GLM. Exposure adjustments could allow fewer AY parameters for the GLM bootstrap model

Answer 9

Tail factor can be extrapolated The tail factor standard deviation is 50% or less of the tail factor -1

Answer 10

Assume that the final development period will continue to apply incrementally until its effect on the future incremental claims is negligible

Answer 11

Testing the assumption that residuals are independent and identically distributed We can graph the residuals by development period, accident period or calendar period or against the fitted incremental losses

Answer 12

We should be able to draw a relatively flat line through the residuals. Residuals should appear random

Answer 13

We should group residuals into hetero groups and adjust them to a common standard deviation. TO help visualize HOW the residuals should be grouped, we can graph relative standard deviations and look for natural groupings

Answer 14

Although the ODP model does not require residuals to be normally distributed, it's still helpful to compare residuals against a normal distribution This allows us to compare parameter sets and assess the skewness of the residuals. This test uses both graphs AND calculated test values

Answer 15

If the data points tightly distributed around the diagonal line, then the residuals are assumed to be normally distributed

Answer 16

1. P-value: P-value should be large (greater than 5%). typically based on the Shapiro test for normality 2. R^2 - R^2 should be close to 1 3. AIC & BIC, these adjust for the number of parameters used in the model. They should be small

Answer 17

Use a box-whisker plot. The values beyond the whiskers (the largest values within 3 times the inter-quartile range) are considered outliers

Answer 18

A model with fewer parameters is preferred as long as the goodness of fit is not markedly different

Answer 19

1. Start with a basic GLM model which includes one parameter for accident, development, and calendar period 2. Check the residual plots. If it doesn't look right, we add more parameters. The implied development patterns for the GLM should look like a smoothed version of the ODP bootstrap chain-ladder development pattern

Answer 20

The standard error should increase when moving from the oldest years to the most recent years (because the standard errors follows the magnitude of the results) The total standard error should be larger than any individual error The coefficient of variance should generally decrease when moving from the oldest years to the most recent years. (because the older AYs have fewer payments remaining, which causes all of the variability to be reflected in the coefficient) The total coefficient of variation should be smaller than any individual year's coefficient of variation The standard error or coefficient of variance for all years combined will be LESS than the sum of standard error or coefficient of variation for individual years. Because accident years are assumed to be independent

Answer 21

1. With an increasing number of parameters in the model, parameter uncertainty increases when moving from the oldest years to the most recent years. This parameter uncertainty may overpower the process uncertainty, causing an increase in variability 2. The model may simply be overestimating the variability in the most recent years. In this case, the BF or Cape Cod models may need to be used in place of the CL method.

Answer 22

1. Run models with the same random variables. Once all the models have been run, the incremental values for each model are weighted together (for each iteration by AY) 2. Run models with independent random variables. once all the models have been run, the weights are used to select a model (for each iteration by AY) by randomly sampling the specified percentage of iterations from each model. The result is a weighted mixture of models

Answer 23

1. Access the quality of the fit 2. Parameterize a DFA (dynamic financial analysis) model 3. Estimate extreme values 4. Estimate TVaR

Answer 24

Some of the random noise is prevented from distorting the calculations of specific metrics

Answer 25

For AY, standard errors increase and CoV decrease as we move from older to more recent years. For CY, standard errors decrease and CoV increase as we move from older the more recent years.

Answer 26

Using a multivariate distribution whose parameters and correlations have been specified. However, we don't know the distribution of each BU

Answer 27

1. Location mapping 2. Re-sorting

Answer 28

- Pick a BU - For each iteration, sample a residual and then note where it belonged in the original residual triangle - each of the segments is then sampled using the residuals at the same locations for their respective residuals triangles. This preserves the correlation of the original residuals in the sampling process

Answer 29

Benefit: it can be easily implemented in a spreadsheet and it does not require us to estimate a correlation matrix. Cons: it requires all of the business segments to come with residual triangles that are the same size and have no missing values for tress testing purposes

Answer 30

To cause correlation among BU in a bootstrap model, the residuals are re-sorted until the rank correlation between each business matches the desired correlation. P-values can be calculated for each correlation coefficient to test its significances

Answer 31

Residual triangles may have different shapes/sizes, different correlation assumptions may be employed AND different correlation algorithms may have beneficial impacts on the aggregate distribution

Answer 32

need to specify a desired correlation matrix

Answer 33

1. Can tailor the model to the statistical features of the data 2. Can use fewer parameters to avoid over-parameterization 3. Can model data that's not in a loss triangle

Answer 34

1. Simulation is slower because the GLM must be solved for in each iteration 2. Can't directly explain the model using LDFs

Answer 35

1. Can use the simpler LDF method and the model will still be based on the GLM framework 2. Using LDFs makes the model more easily explainable to others 3. The GLM uses a log-link and may not work with negative incremental, but the simplified GLM will still get a solution

Answer 36

1. Unable to adjust for calendar-year effects 2. Requires many parameters and can over-fit the data

Shapland Flashcards

(60 cards)