C.3. Multivariate Classification Flashcards

1
Q

3 reasons GLMs have grown in popularity

A
  1. Increased computing power
  2. Better data availability
  3. Competitive pressure
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Benefits of multivariate methods (particularly GLMs)

A
  1. Properly adjust for exposure correlations
  2. Focus on signal and ignore noise
  3. Provide statistical diagnostics (CIs)
  4. Allow for consideration of interactions between rating variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Adv/disadv of minimum bias procedures

A

A: properly adjusts for exposure correlation
D: do not provide ways to test for whether variables are statistically significant, also computationally inefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sequential analysis

A

Meh

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Important steps in solving GLMs

A

Compiling dataset with enough data for modeling, selecting a link function, specifying distribution of underlying random process, and using maximum likelihood to calculate parameters of the model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why GLMs are usually run of frequency and severity instead of loss ratios

A

No need to on-level premiums at granular level, a priori expectations of frequency and severity but not loss ration patterns, no standard distribution for modeling loss ratios, loss ratio models become obsolete when rates are changed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Common GLM diagnostic tests

A

Looking at CIs around estimates
Chi-square, F-tests, other tests
Running model on separate consecutive time periods of data to see if parameters are consistent over time
Building model on a subset of historical data and comparing prediction with actual
Judgmental decision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Actuaries’ role in GLMs

A

Obtaining reliable data (GIGO)
Exploring anomalous results in GLM with additional analysis
Considering model results from statistical and business perspective
Developing appropriate methods to communicate model results based on company’s ratemaking objectives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Common types of external data used in GLMs

A

Geo-demographic information
Weather data
Property characteristics
Information about insured individuals or businesses (i.e. credit scores)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Data mining techniques

A
Factor analysis (reduce number or variables needed)
Cluster analysis (combine similar risks into groups)
CART (classification and regression trees): if-then rules
MARS (multivariate adaptive regression spline): turns continuous variables into categorical variables
Neural networks: training algorithms to identify patterns
How well did you know this?
1
Not at all
2
3
4
5
Perfectly