Graphs Flashcards
GLM - Steps to create a simple quantile plot
- Sort the holdout dataset based on that model’s predicted loss cost.
- Bucket the data into quantiles with each quantile having equal exposure
- Calculated the average predicted loss cost and average actual loss cost for each bucket and plot them on a graph.
GLM - How to pick winning model from quantile plot?
The winning model should be based on 3 criteria:
1. Predictive Accuracy - Difference between actual and predicted in each quantile.
2. Monotonicity - The actual pure premium should consistently increase across quantiles.
3. Vertical distance of actual loss cost between first and last quantile. Indicates how well model distinguishes best and worst risks.
GLM - Graph of Quantile and Axis
GLM - How to create double lift chart?
- For each observation, calculate sort ratio = model 1 predicted loss cost / model 2 predicted loss cost.
- Sort the data by sort ratio in ascending order.
- Bucket the data into quantiles with equal exposures.
- Calculate the average predicted loss cost for each model and average actual loss cost for each bucket, divide each by the overall average loss cost from that source, and plot the quantities on a graph.
GLM - Explain the graph of a double lift chart
GLM - Explain how to create a Loss Ratio Chart
- Sort dataset based on that model’s predicted loss costs.
- Bucket the data into quantiles with each have equal exposure.
- Calculate the actual loss ratio (based on the current rating plan, not the model) for each bucket and plot them on a graph.
GLM - How to determine which model performs better in a Loss Ratio Chart
The greater the vertical distance between the lowest and highest loss ratios, the greater the model does at identifying further segmentation opportunities not present in the current rating plan.
GLM - Describe the graph of a loss ratio chart
GLM - How to create Gini Index/Lorenz Curve?
- Sort the holdout dataset based on that model’s predicted loss costs.
- Plot a graph with the x-axis being the cumulative percent of exposures and the y-axis being the cumulative percent of actual losses.
GLM - Describe Lorenze Curve and identify Gini?
GLM - Describe ROC Curve.
Robertson - Explain which graph is better?
The proposed method should be used since it results in well separated groupings, as there is
little overlap between the indicated deductible factors within each coverage A grouping. The
current method groupings have significant overlap, particularly for the 2 middle groups.
Bahnemann - Excess Severity Curve Shapes - Pareto, Exponential, Weibull, Lognormal, Gamma
Bahnemann - Answer the following:
a) Express G+H in Integral form with layer method
b) Express B+C+D in integral form with size method
c) In letters, what is ILF for S with base of R
Fisher - Describe how to perform Quintile’s Test
- Sort risks by mod from lowest to highest
- Group into 5 buckets
- Calculate manual and standard loss ratios using premium weighted average