GLMs Flashcards

Question

Odds, logit function

Answer 1

µ / (1 - µ) For each unit increase in a given predictor variable iwth coefficient ß increases the odds by e^ß - 1 in percentage terms

Answer 2

Incorporates pre-determined values so GLM takes as given

Answer 3

If log link function is used, continuous variables should be logged (flexibility in fitting curve shapes) Exceptions: year variables for trends, variables containing values of 0

Answer 4

Have 2 or more levels, converted to binary variables Level with highest number of observations usually deemed base levels Using a different level results in wider CIs

Answer 5

g(**µ**) = **Xß** **µ** is the vector of µ_i values **ß** is the vector of ß parameters (coefficients) X is the design matrix (rows are results ignoring coefficients)

Answer 6

Number of parameters that need to be estimated for the model

Answer 7

Should be removed, grouped with the base level

Answer 8

Near-perfect linear dependency among 3 or more predictor variables Ex: x₁ + x₂ ~ x₃ Detected with variance inflation factor statistic (VIF) \> 10 is considered high

Answer 9

Perfect linear dependency among predictor variables GLM will not converge, but most will detect and remove one of the variables from the model

Answer 10

1. GLMs give full credibility (partially adressed by p-values, SEs) 2. GLMs assume randomness of outcomes are uncorrelated (violated if dataset has several renewals of same policy, or by weather events)

Answer 11

1. Setting goals and objectives 2. Communication (IT, legal, UWs) 3. Collecting/processing data 4. Exploratory data analysis 5. Specifying the form of the model 6. Evaluating model output 7. Validation 8. Translation into a product 9. Maintenance and rebuild

Answer 12

Training set and test (holdout) set

Answer 13

Train and test Train, validate, test Cross-validation

Answer 14

Split into single training and single test sets (60/40 or 70/30) Can split randomly or on time (if not done by time, could lead to over-optimistic validation results)

Answer 15

Validation set can be used to refine model and make tweaks Test set should be left until model is final Typically 40/30/30

Answer 16

Less common in insurance (hand-picked variables) Most common is *k*-fold: 1. Pick a *k* and split into *k* folds 2. For each fold, train the model using the other *k* - 1 folds, and test using *k*^th fold Superior (more data for training and testing) but more time-consuming (models built completely separately)

Answer 17

Only when model is complete If too many decisions are made based on test set, it is effectively a training set (leads to overfitting)

Answer 18

Gain more insight and intuition about each Each is more stable separately PP modeling can lead to overfitting if predictor variable only impacts freq or sev but not both, since randomness of other component may be considered signal Tweedie assumes both freq and sev move in same direction

Answer 19

1. Run each peril model separately 2. Aggregate expected losses 3. Run model using all-peril LC as target variable and union of all predictor variables as predictors (focus on dataset more reflective of future mix of business)

Answer 20

p-values Cost-effectiveness of collecting data ASOPs/legal requirements IT constraints

Answer 21

Plot r against x and see if the points match y = ßx

Answer 22

Binning (increases DOF, variation within bins is ignored) Adding polynomial terms (loses interpretability without a grap) Add piecewise linear function: hinge function max (0, x - c) at each break point c

Answer 23

Two categorical: 1/0 Continuous and categorical: f(x)/0 Two continuous: product of the two

Answer 24

Log of the product of the likelihood for all observations using the model

Answer 25

Only valid if datasets used are identical Comparisons of deviance only valid if assumed distribution and dispersion are the same

Answer 26

AIC = -2LL + 2p

Answer 27

BIC = -2LL + p ln (n) Less reasonable for insurance data (large n, large BIC)

Answer 28

Sort deviance residuals in ascending order (y-axis) Ø^-1 [(i - 0.5) / n] for x-coordinates If model is well-fit, points will appear on a straight line

Answer 29

Cook's distance Cross-validation Bootstrapping

Answer 30

Plotting actual vs. predicted Simple quantile plots Double lift charts Loss ratio charts Gini Index

Answer 31

Economic value of the model (ability to prevent adverse selection)

Answer 32

Sort holdout datased based on predicted LC Bucket into quantiles by exposure Calculate average predicted LC and average actual LC from each bucked and plot (divide both values by overall average predicted LC)

Answer 33

1. Predictive accuracy 2. Monotonicity 3. Vertical distance of actual LC between first and last quantiles

Answer 34

1. Calculate sort ratio 2. Sort by sort ratio 3. Bucket into quantiles by exposure 4. Calculate average predicted LC for each model and average actual LC for each bucket, divide by overall average LC

Answer 35

1. Sort holdout dataset based on predicted LC 2. Bucket into quantiles by exposure 3. Calculate actual loss ratio (using current rating plan) Greater distance between lowest and highest, greater model does identifying further segmentation opportunites

Answer 36

1. Sort holdout dataset based on predicted LC 2. Plot the graph with x-axis as % cumulative exposures, y-axis as cumulative % of actual losses

Answer 37

Correct prediction that the event occurs

Answer 38

Prediction that the event occurs, but it does not

Answer 39

Prediction that the event does not occur, but it does

Answer 40

Correct prediction that the event does not occur

Answer 41

Ratio of true positives to total event occurrences Sometimes called true positive or hit rate

Answer 42

Ratio of true negatives to total event non-occurrences

Answer 43

Plots sensitivity as a function of 1 - specificity

Answer 44

If two or more models perform roughly equally well, can combine those models; only really works when model errors are as uncorrelated as possible

GLMs Flashcards

(71 cards)