Shapland & Leong Flashcards

Question

Describe the process for using an N-year weighted average of losses when determining development factors under the following frameworks: b) Simplified GLM framework

Answer 1

First, we calculate N-year average factors instead of all-year factors. Then, we exclude the first few diagonals when calculating residuals. However, when running the bootstrap simulations, we must still sample from the entire triangle so that we can calculate cumulative values. We use N-year average factors for projecting the future expected values as well

Answer 2

⇧ Estimate the missing value using surrounding values ⇧ Exclude the missing value ⇧ If the missing value lies on the last diagonal, we can use the value in the second to last diagonal to construct the fitted triangle

Answer 3

⇧ If these values occur on the first row of the triangle where data may be sparse, we can delete the row and run the model on a smaller triangle ⇧ Exclude the outliers completely ⇧ Exclude the outliers when calculating the age-to-age factors and the residuals, but re-sample the corresponding incremental when simulating triangles

Answer 4

⇧ Homoscedasticity – residuals are independent and identically distributed ⇧ Heteroscedasticity – residuals are independent, but NOT identically distributed

Answer 5

⇧ Stratified sampling • Group development periods with homogeneous variances • Sample with replacement from the residuals in each group separately ⇧ Variance parameters • Group development periods with homogeneous variances • Calculate the standard deviation of the residuals in each of the “hetero” groups • Calculate the hetero-adjustment factor for each group • Multiply all residuals in each group by the hetero-adjustment factor for that group • All groups now have the same standard deviation, and we can sample with replacement from among ALL residuals

Answer 6

Since there are fewer residuals for older development periods, credibility decreases in the tail of the triangle. It’s important NOT to overreact to “apparent” heteroscedasticity in older development years

Answer 7

Heteroecthesious data refers to incomplete or uneven exposures at interim evaluation dates

Answer 8

⇧ Partial first development period data – occurs when the first development column has a di↵erent exposure period than the rest of the columns ⇧ Partial last calendar period data – occurs when the latest diagonal only has a six-month development period

Answer 9

⇧ Test various assumptions in the model ⇧ Gauge the quality of the model fit ⇧ Guide the adjustment of model parameters

Answer 10

⇧ To test whether or not a model is over-parameterized, use the following steps: • Start with the basic model which includes one parameter for accident, development and calendar periods • Use trial and error to find a good fit to the data (i.e. add and remove parameters until a good fit is found) • Run all of the standard diagnostics (normality plots, box-whisker plots, p-values, etc.) and compare them to the model with more parameters • If the diagnostics are comparable, then the model with less parameters is preferred (principle of parsimony)

Answer 11

⇧ With an increasing number of parameters in the model, parameter uncertainty increases when moving from the oldest years to the most recent years. This parameter uncertainty may overpower the process uncertainty, causing an increase in variability ⇧ The model may simply be overestimating the variability in the most recent years

Answer 12

⇧ Run models with the same random variables • Each model is run with the exact same random variables. Once all of the models have been run, the incremental values for each model are weighted together (for each iteration by accident year) ⇧ Run models with independent random variables • Each model is run with its own random variables. Once all of the models have been run, weights are used to select a model (for each iteration by accident year).The result is a weighted mixture of models

Answer 13

⇧ Assess the quality of the fit ⇧ Parameterize a DFA (dynamic financial analysis) model ⇧ Estimate extreme values ⇧ Estimate TVaR

Answer 14

He can estimate the complete variability in the loss ratio by using all simulated values to estimate the ultimate loss ratio by accident year (rather than just using the values beyond the end of the historical triangle)

Answer 15

He can estimate the future variability in the loss ratio by using only the future simulated values to estimate the ultimate loss ratio (i.e. add the estimated unpaid losses to the actual cumulative losses to date)

Answer 16

We can create a total unpaid distribution histogram by dividing the range of all values generated from the simulation into 100 equally sized buckets, and then counting the number of simulations that fall within each bucket. Since simulation results tend to appear jagged, a Kernel density function can be fit to the data to provide a smoothed distribution. Each point of a Kernel density function is estimated by weighting all of the values near that point, with less weight given to points further away

Answer 17

⇧ Location mapping • Pick a business segment. For each bootstrap iteration, sample a residual and then note where it belonged in the original residual triangle. Then, sample each of the segments using the residuals at the same locations for their respective residual triangles. This preserves the correlation of the original residuals in the sampling process ⇧ Re-sorting • To induce correlation among business segments in a bootstrap model, re-sort the residuals for each business segment until the rank correlation between each segment matches the desired correlation

Answer 18

⇧ Location mapping • Can be easily implemented in a spreadsheet • Does not require a correlation matrix ⇧ Re-sorting • Works for residual triangles with different shapes/sizes • Different correlation assumptions can be employed

Answer 19

⇧ The standard errors for accident years 2005-2008 should be increasing over time ⇧ The estimates of unpaid claims for accident years 2011-2013 should be increasing over time ⇧ The coefficients of variation for accident years 2011-2013 should be decreasing over time ⇧ The total standard error should be larger than any of the individual accident year standard errors

Answer 20

Since the model iterations were run using different sets of random residuals, the actuary must use the weights to select a model for each simulated run. For example, suppose 1000 iterations were run for the two models. Since the AY 2013 weights are 25% and 75%, the actuary would sample 250 iterations from the chain-ladder model and 750 iterations from the BF model when developing the reserve distribution

Answer 21

1. Remove Outliers 2. Recalibrate the model (review data used, parameters selected) 3. Limit Incremental losses below at zero (in Original Triangle, Sampled Triangle, or Simulated loss) 4. Understand what is driving negative development, this will guide you in how to deal with it

Answer 22

Data, where the exposures are not the same. Two types: 1. Valuation of losses before year is fully earned -Last row has not earned a full year; model will still forecast a full year 2. Last Diagonal is valued in between normal schedule -Last Diagonal isn’t a full year, so we don’t have a model for it

Answer 23

Once the residuals from the original triangle have been calculated -Fit a Distribution to these residuals -Then sample from this distribution, instead of the actual residuals

Answer 24

- Test Assumptions of Model - Gauge the Quality of the Model Fit - Adjust Model Parameters

Answer 25

Draw a Box & Whisper Plot The Box is the 25th and 75th percentiles The difference between those two is the InterQuartile (IQ) Range Any residuals that exceed the 75th percentile by 1.5 times the IQ Range should be investigated

Answer 26

For the estimate of Unpaid losses, for each Accident Year, calculate the following - Mean -Standard Error -Coefficient of Variation - Percentiles (50%, 75%, 95%, 99%) - Minimum & Maximum -Standard Error should be highest for more recent years - CoV should be highest for older years; most recent AY will also be high - Check Min & Max for reasonability

Answer 27

Two Methods to do this Each requires a weight given to each method, for each AY 1. Take the Weighted average of the models When running the simulations, each method must use the same underlying random uniform variable in the Process Step 2. Use the Weights, to randomly select an Unpaid result for each AY, in each simulation No need to correlate the methods here

Answer 28

Graph a Histogram of the results Use a Kernel Density function to smooth the results

Answer 29

-Location Mapping When Sampling, take the sampled residual from the same location in the triangle for each line of business -Re-Sorting Advantages: – Triangles may be different size by LOB – May make different correlation assumptions – Correlation algorithms may have additional benefits

Answer 30

- Test ODP Bootstrap against additional data sets (CAS loss simulation model) -Expand model to include: Munich Chain Ladder, Claim Counts & Severity -Research Other Risk Measures -Use in Solvency II - Research Correlation Matrix (parameter that is most difficult to estimate)

Shapland & Leong Flashcards

(68 cards)