Use of Multivariate Models in Pricing Flashcards
Information required to quote: Motor
- Information on the policy and coverage.
- type of cover
- payment frequency
- voluntary excess - Details of proposer
- age
- gender
- marital
- occupation - Details of driver
- experience
- age
- relationship - Details of vehicles
- make/model
- parking location
- value
- safety features
Information required to quote: Household insurance
- Policy details
- type of cover
- excess
- special items
- number of ppl in household- bachelor or family home
- claims Experience - Proposer
- age
- gender
- marital
- smoker
- employment status - House details
- year purchase
- type of Property- flat or bungalow
- age of Property
- construction type
- number of bedrooms
- location
- ownership e.g lease
- property value
- security features e.g alarm
- trees
Vehicle Classification Techniques: ABI classifies vehicles into 50 groups based on characteristics. Factors used to establish groupings:
- Damage and part costs.
- repair times
- new car values
- body shells (aluminium or steel)
- perfomance (acceleration speed)
- car security.
- safety features
- engine size
Many insurer’s use ABI as a starting basis for categorising vehicles, then use adjust based on experience.
Risk Groupings : Motor
Claim type Size of claim Past claims Experience NCD Status Age Gender Vehicle group Vehicle Age
Risk Groupings: Household
Claim type e.g fire, theft Size of claim Number of bedrooms Location Age Sum insured Past claims Experience
Pricing with limited Data
- Use other data e.g external data, historical data adjusted, own data for similar biz
- Include loadings or conservative assumptions
- Use ILFs/first loss curves to estimate higher layer premium
- Use qualitative methods e.g where risk perception is important element in pricing
Risk factors that actually affect actual claims cost: Motor
Driver
-driving style, Experience, level of skill, power of observation
Vehicle
-value of vehicle/repair cost, safety features e.g airbags, ABS, security features, performance, speed, size, weight
Environment
- the road, the time of day, natural hazards eng ice
Exposure
-amount of driving ( no. of miles)
Third parties
Many of these factors can’t be quantified, change over time and can’t be defined by customer.
Point of sale questions act as proxies for genuine risk factors
Effectiveness of these factors depends on:
- How directly it measures a genuine risk factor. E.g vehicle value
- If proxy is a factual quantity known to Proposer e.g postcode can give more info on area.
- Whether the fact has an obvious direction, which proposer may misstate to obtain cheaper quote e.g annual mileage.
- The extent to which the proxy overlap to other e.g age of licence correlates to age of driver
Risk factors that actually affect actual claims cost: Household
- smoker is correlated to fire. Many house fires caused by cigarettes.
- Some construction types are vulnerable to fire.
- Presence of alarm will impact theft.
- measures of exposure- sum insured, number of bedrooms, number of children.
- Many customers do not realise they could claim for particular events. Employment status, no. of prior claims, postcode act as proxies for this behaviour.
External data that may be used to predict claims experience.
Proposer
- previous insurance company.
- other insurance products
- customer lifetime value models- asses price elasicity, cross sell, strategy to lapsing insureds, how long they remain in co. tc.
- Customer behaviour models
- credit score/insurance score
Location e.g postcode
- average wealth
- subsidence/ geological soil
- flood, theft data
- census data.
Insured Asset ( motor)
- data from insurer’s trade body ABI e.g car group.
- data from motor registration/ licencing authority
- Additional vehicle data
- data from inter industry agreement to share claims info.
- Aggregrator sites may provide competitor rates.
- Indices for inflation from government consultancies.
- Tax rates
How external data actually affect actual claims cost:
- Questions relating to perils e.g flood data related to flood peril, ABI data can give info on performance and repair costs.
- Questions that relate to risk exposure e.g licensing bodies may collect actual mileage data.
Questions that relate to customer discretionary behaviour e.g wealth, prior claims, cross product
Types of Multivariate
- GLM ( mainly for personal lines)
- Link function is a log function that allows multiplicative relationship between factors.
Classification and approaches
- Factors such as postcode have large number of levels
- Classification may be used to produce smaller grouping such that they can be included in the GLM.
- This improves predicted values taking into account credibility.
Approaches
- Spatial smoothing
- Vehicle classification techniques based on ABI.
Spatial Smoothing and type
- physically close areas, same experience.
Types
- Distance based smoothing
- Adjacency based smoothing.
- Too low spatial Smoothing means near or neighbouring location codes have little influence causing random noise. Hence reducing predictiveness of model.
- Too much spatial smoothing can result in the blurring of experience so that some of the true underlying residual variation is lost, again causing distortions.
- Both under-smoothing and over-smoothing can result in poor pricing and anti selection.
Distance based smoothing
- The further away a location code, the less weight is given to experience
- Regardless of rural/urban or natural/artificial boundaries.
- used for weather related perils
- easy to understand and implement (no distribution assumptions)
- Can be enhanced to include other dimensions e.g density