C.3. Multivariate Classification Flashcards

Question 1

Q

3 reasons GLMs have grown in popularity

Answer

A

Increased computing power
Better data availability
Competitive pressure

Question 2

Q

Benefits of multivariate methods (particularly GLMs)

Answer

A

Properly adjust for exposure correlations
Focus on signal and ignore noise
Provide statistical diagnostics (CIs)
Allow for consideration of interactions between rating variables

Question 3

Q

Adv/disadv of minimum bias procedures

Answer

A

A: properly adjusts for exposure correlation
D: do not provide ways to test for whether variables are statistically significant, also computationally inefficient

Question 4

Q

Sequential analysis

Question 5

Q

Important steps in solving GLMs

Answer

A

Compiling dataset with enough data for modeling, selecting a link function, specifying distribution of underlying random process, and using maximum likelihood to calculate parameters of the model

Question 6

Q

Why GLMs are usually run of frequency and severity instead of loss ratios

Answer

A

No need to on-level premiums at granular level, a priori expectations of frequency and severity but not loss ration patterns, no standard distribution for modeling loss ratios, loss ratio models become obsolete when rates are changed

Question 7

Q

Common GLM diagnostic tests

Answer

A

Looking at CIs around estimates
Chi-square, F-tests, other tests
Running model on separate consecutive time periods of data to see if parameters are consistent over time
Building model on a subset of historical data and comparing prediction with actual
Judgmental decision

Question 8

Q

Actuaries’ role in GLMs

Answer

A

Obtaining reliable data (GIGO)
Exploring anomalous results in GLM with additional analysis
Considering model results from statistical and business perspective
Developing appropriate methods to communicate model results based on company’s ratemaking objectives

Question 9

Q

Common types of external data used in GLMs

Answer

A

Geo-demographic information
Weather data
Property characteristics
Information about insured individuals or businesses (i.e. credit scores)

Question 10

Q

Data mining techniques

Answer

A

Factor analysis (reduce number or variables needed)
Cluster analysis (combine similar risks into groups)
CART (classification and regression trees): if-then rules
MARS (multivariate adaptive regression spline): turns continuous variables into categorical variables
Neural networks: training algorithms to identify patterns

C.3. Multivariate Classification Flashcards

(10 cards)