Shapland Flashcards
Property of the over-dispersed Poisson model
the fitted incremental claims will exactly equal the fitted incremental claims derived using the standard chain-ladder factors
Advantage of the ODP bootstrap model
Although sampling with replacement assumes the residuals are independent and identically distributed, it does not require the residuals to be normally distributed.
This allows the distributional form of the residuals flow through the simulation process. (this is sometimes referred to as a ‘semi-parametric’ bootstrap model since we are not parameterizing the residuals
How to include process variance in the future incremental claims
We assume that each future incremental claims follows gamma distribution.
This revised model incorporates process variance and parameter variance in the simulation of the historical and future data
Approach 1 for modeling an unpaid loss distribution using incurred data
We run a paid data model in conjunction with the incurred data model.
Then we use the random payment pattern from each iteration of the paid data model to convert the ultimate values from each corresponding incurred model iteration to develop paid losses by AY.
Advantage: it allows us to use the case reserves to help predict the ultimate losses, while still focusing on the payment stream for measuring risk
An improvement to this approach would be the inclusion of correlation between the paid and incurred models
Approach 2 for modeling an unpaid loss distribution using incurred data
Apply the ODP bootstrap to the Munich chain-ladder (MCL) model. The MCl uses the inherent relationship/correlation between the paid and incurred losses to predict ultimate losses.
When paid losses are low relative to incurred losses, then future paid loss development tends to be higher than average. When paid losses are high relative to incurred losses, then future paid loss development tends to be lower than average.
2 advantages:
1. it does not require us to model paid losses twice.
2. it explicitly measures the correlation between paid and incurred losses
Issue with using the ODP bootstrap
Iterations for the latest few accident years tend to be more variable than what we would expect given the simulations for earlier accident years.
This is due to the fact that MORE age-to-age factors are used to extrapolate the sampled values to develop point estimates for each iteration.
How to fix the issue with the ODP bootstrap
Future incremental values can be extrapolated using the BF or Cape Cod method.
Two drawbacks of GLM bootstrap
- The GLM must be solved for each iteration of the bootstrap model, which may slow down the simulation
- The model is no longer directly explainable to others using age-to-age factors
4 benefits of GLM bootstrap
- Fewer parameters helps avoid over-parametrizing the model
- Gives us the ability to add parameters for calendar year trends.
- Gives us the ability to model data shapes other than triangles
- Allows us to match the model parameters to the statistical features found in the data, and to extrapolate those features
How do we produce point estimates using the GLM bootstrap model
Unlike the ODP bootstrap that replicates the chain-ladder model we do not apply age-to-age factors to each sample triangle to produce point estimates.
Instead, we fit the same GLM model underlying the residuals to each sample triangle. Then we use the resulting parameters to produce ultimates and reserve point estimates.
Drawback: the additional time required to fit a GLM to each sample triangle
3 options to deal with extreme outcomes
- identify the extreme iterations and remove them.
- Recalibrate the model (identify the source of the negative incremental losses and remove it if necessary)
- Limit incremental losses to zero
Should the residuals be adjusted so that their average is zero?
If the average of the residuals is positive, then re-sampling from the residuals will add variability to the resampled incremental losses. I may also cause the resampled incremental losses to have an average greater than the fitted losses. In this case, the residuals should be adjusted.
Using an L-year weighted average for the GLM bootstrap
- We use L years of data by excluding the first few diagonals in the triangle (which leaves us with L+1 included diagonals)
- This changes the shape of the triangle to a trapezoid
- The excluded diagonals are given zero weight in the model and fewer calendar year parameters are required.
- When running the bootstrap simulations, we only need to sample residuals for the trapezoid that was used to parametrize the original model. Because the GLM models incremental claims directly and can be parameterized using a trapezoid. Each parameter set is then used to project the sampled triangles to ultimate.
Using an L-year weighted average for the ODP bootstrap
- We calculate L year average factors instead of all year factors
- We exclude the first few diagonals when calculating residuals
- We still sample residuals for the entire triangle when running bootstrap. Because the ODP bootstrap requires cumulative values in order to calculate link ratios. Once we have cumulative values for each sample triangle, we use L-year average factors to project the sample triangles to ultimate
What does missing values affect
- Loss development factors
- fitted triangle (if the missing value lies on the last diagonal)
- Residuals
- Degree of freedom
Dealing with missing values for ODP bootstrap
- Estimate the missing value using surrounding values
- Exclude the missing value when calculating the loss development factors. No corresponding residual will be calculated for the missing value. Similar to the L-year weighted average, sample for the entire triangle. Once the sample triangles are calculated, we should exclude the cells corresponding to the missing values from the projection process
- If the missing value lies on the last diagonal, we can either estimate the value OR we can use the value in the second to the last diagonal to contract the fitted triangle
Dealing with missing values for GLM bootstrap
The missing data simply reduced the number of observation used in the model.
Similar to ODP, we could use any one of the 3 method to estimate the missing data
Managing outlies for ODP bootstrap
- Exclude the outliers completely (proceed in the same manner as a missing value)
- Exclude the outliers when calculating the age-to-age factors and the residuals (similar to missing values), BUT include the outlier cells during the sample triangle projection process. (remove the extreme impact of the incremental cell by excluding the outlier during the fitting process while still including some non-extreme variability by including the cell in the sample triangle projections)
3 options when excluding outliers to calculate age-to-age factors
- Exclude in the numerator
- Exclude in the denominator
- Exclude in the numerator and denominator
Managing outliers for GLM bootstrap
Outliers are treated similarly to missing data.
If the data is not considered representative of real variability, the outliers should be excluded and the model should be parameterized without it
What do we do if there are a significant number of outliers
- Might indicate that the model is a poor fit to the data
- For GLM, new parameters could be chosen OR the distribution of the error could be changed.
- For ODP, an L-year weighted average could be used to provide a better model fit.
3 options to adjust for heteroscedasticity
- Stratified sampling
- Calculating variance parameters
- Calculating scale parameters
Describe stratified sampling
- Group development periods with homogeneous variances
- Sample with replacement from the residuals in each group separately
Advantage of stratified sampling
It’s straightforward and easy to implement