Robertson - ELFs for HGs Flashcards

Question 1

Q

NCCI publishes Excess Loss Factors (ELFs) to assist with

Answer

A

pricing excess policies and some retrospectively rated policies for WC

Question 2

Q

NCCI calculates ELFs at level

Answer

A

hazard group and state level for each possible limit

ELFs are the same for every class in a given hazard group within a state for a given limit

Question 3

Q

hazard groups are defined on

Answer

A

CW basis since NCCI believes that mix of injuries within a class should not vary much by state

Question 4

Q

in 2007, NCCI implemented new hazard group definitions and moved from

Answer

A

4 to 7 hazard groups

Question 5

Q

end goal of new analysis was to produce

Answer

A

new hazard group definitions at CW level which could later be used to produce new ELFs for every limit, state, and hazard group combo

-also required deciding for which limits to calculate ELFs, deciding on # of hazard groups, and deciding which classes go into each hazard group

Question 6

Q

to group classes with similar vectors into potential hazard groups, can use

Answer

A

L2 or Euclidean distance

-this would have advantage in that it minimizes the relative error in estimating excess premium

Question 7

Q

relative error in estimating excess premium for class c with limit L is

Answer

A

PLR*|R_HG(L)-R_c(L|

Question 8

Q

NCCI performed a cluster analysis called ___ to group classes into hazard groups using premium weights

Answer

A

k-means algorithm

Question 9

Q

NCCI used cluster analysis to group classes with similar

Answer

A

vectors of excess ratios as measured by Euclidean distance into hazard groups

Question 10

Q

used __ clustering

Answer

A

non-hierarchical

which meant new hazard groups did not have to be subsets of existing groups and could instead represent best partition for given number of clusters

Question 11

Q

weighted k-means algorithm they used works as follows

Answer

A

decide on # of clusters (potential hazard groups) k to target
assign classes to k arbitrary groups
calculate centroid of excess ratios of each group (essentially weighted excess ratios)
for each class, find closest centroid using L2 distance
move each class to group with closet centroid
if any classes move go back to step 2 and continue

Question 12

Q

k-means clustering is equivalent to

Answer

A

to max R squared statistic in linear regression

it minimizes variance within and maximizes the variance between

which means hazard groups will be homogeneous and well separated

Question 13

Q

can think of R2 formula as

Answer

A

Trace(B)/Trace(T)

or

1-Trace(W)/Trace(T)

Question 14

Q

2 statistics used to decide on #HGs

Answer

A

Calinski and Harabasz statistic
Cubic Clustering Criterion (CCC) statistic

Question 15

Q

Calinski and Harabasz statistic

Answer

A

-measures between variance of clusters/within variance

[Trace(B)/(k-1)]/[Trace(W)/(n-k)]

n=#classes, k=#HGs

higher values of this statistic indicate better # of clusters
test also known as Pseudo-F test since it resembles F-test of regression analysis

*higher statistic indicates better clusters with higher between-cluster variance and lower within-cluster variance

Question 16

Q

Cubic Clustering Criterion (CCC) statistic

Answer

A

compares amount of variance explained by given set of clusters to that expected when clusters are formed at random based on multi-dimensional uniform distribution

high value indicates better performance
since CCC is less reliable when data is highly correlated, NCCI gave more weight to #1

Question 17

Q

result of 7 HGs was

Answer

A

best choice under #1 and 2nd best under #2

-to make final decision, NCCI re-ran analysis using only classes with 50% and 100% credibility and both statistics showed 7 as optimal #

Question 18

Q

reasons NCCI decided to not give further consideration to CCC test that showed 9 groups as optimal:

Answer

A

found that #1 outperformed #2 statistic
CCC deserves less weigh when correlation is present which was case
selection of #HGs ought to be driven by large classes where most of experience was concentrated; using highly or fully credible classes showed 7 as optimal #
there was crossover in excess ratios between HGs when using 9 groups which is something that is not appealing in practice (somewhat counterintuitive)

Question 19

Q

NCCI sent proposed HGs to underwriters to review and made some changes based on feedback; considered things such as:

Answer

A

similarity between class codes that were in different groups
degree of exposure to automobile accidents in given class
extent heavy machinery is used in given class

Question 20

Q

Robertson sums up by saying remapping of HGs was founded on 3 key ideas:

Answer

A

computing excess ratios by class
sorting classes based on excess ratios
cluster analysis

Question 21

Q

Why did the NCCI move from using 17 limits for ELFs to 5 limits

Answer

A

ELFs at any pair of excess limits are highly correlated across classes.
Limits below $100,000 were heavily represented in the prior 17 limits.
Using 1 limit wouldn’t have captured the full variability in excess ratios.
The NCCI wanted to cover the range of limits commonly used for rating

Question 22

Q

What is the formula the NCCI used for determining the credibility to apply to the class vectors of excess ratios in the analysis described by Robertson?

Give 5 other credibility options the NCCI considered in the analysis

Answer

A

z=min(n/n+k,1); n=#claims in class; k=avg # claims per class

i. Using the median instead of the average for k.
ii. Excluding Medical Only claims from the analysis.
iii. Including only Serious claims in the analysis.
iv. Requiring a minimum # of claims for classes used in the calculation of k.
v. Various square root rules, such as z =sqrt(n/384), which corresponds to a 95% chance of n being within 10% of its expected value.

Question 23

Q

Provide two reasons why the new hazard groups were superior to the prior hazard groups.

Answer

A

• The excess ratios for the new hazard groups were well separated compared to the prior hazard groups, meaning that there was less overlap in the excess ratios between classes in different hazard groups. • Amoresophisticatedmethodwasusedtodeterminetheoptimalnumberofhazardgroups. The NCCI used the Calinski and Harabasz and CCC statistics to decide on using 7 hazard groups. • A more sophisticated method was used to group classes into hazard groups by using the k-means algorithm. • The new hazard groups had a more even concentration of classes and premium, while previously classes were primarily concentrated in 2 of the 4 hazard groups. • As part of the analysis, NCCI updated their excess ratio calculations, which had not been done since 1993. Thus the 2007 excess ratios used more recent/applicable data. • The NCCI used an underwiter review process for ﬁnalizing the new hazard groups, which helped identify issues such as classes in different hazard groups with similar exposures. • The prior analysis used proxy variables to measure excess loss potential, while the new analysis used excess ratios directly. • The new analysis used excess ratios by injury type, while the prior analysis treated all injury types the same

Question 24

Q

Provide two reasons why a class code should be assigned to a different hazard group than the hazard group resulting from the cluster analysis.

Answer

A

• If a class code has very similar exposure to another class code, it may make sense to assign both to the same hazard group. • If a class code has a high degree of exposure to automobile accidents that is not fully reﬂected in the past experience, it may make sense to move it to a higher risk hazard group. • If a class code has a high degree of heavy machinery usage that is not fully reﬂected in the past experience, it may make sense to move it to a higher risk hazard group. • If a class code has a high degree of exposure to dangerous substances that is not reﬂected in the past experience, it may make sense to move it to a higher risk hazard group.