Taylor and McGuire Flashcards by Jordan Lastnik

Mean and variance of exponential dispersion family (EDF) of distributions

mean = mu

variance = dispersion parameter * variance function

How well did you know this?

Not at all

Perfectly

Variance function for the Tweedie sub-family of EDFs

variance = mu ^ p
And p between 0 and 1 (inclusive)

in words: variance is proportional to a power of the mean

How well did you know this?

Not at all

Perfectly

Relationship between p for the Tweedie distribution and tail heaviness

tail heaviness increases as p increases

How well did you know this?

Not at all

Perfectly

Mean and variance of a Tweedie distribution

mean = mu = [ ( 1 - p ) * theta ] ^ ( 1 / ( 1 - p ) )

where theta = location parameter

variance = dispersion parameter * mu ^ p

How well did you know this?

Not at all

Perfectly

General GLM format (in matrix notation)

Taylor and McGuire

link function = transposed covariate matrix * beta matrix

where betas are the linear response variables, and the link function transforms the mean of each observation into a linear function of the parameters (betas)

How well did you know this?

Not at all

Perfectly

Conditions for the structure of a GLM (3)

Taylor and McGuire

each observation is a member of the EDF
h(mu_i) = x^T * B
observations are stochastically independent

How well did you know this?

Not at all

Perfectly

Underlying assumptions of a standard linear regression (3)

errors are normally distributed
errors have constant variance
linear relationship between X and Y

How well did you know this?

Not at all

Perfectly

Difference b/w weighted linear regression and standard linear regression

weighted linear regression recognizes errors might have unequal variances

How well did you know this?

Not at all

Perfectly

Main differences between a GLM and a linear regression (2)

non-linear relationship between X and Y
non-normal error term

How well did you know this?

Not at all

Perfectly

Common estimation method for GLM parameters

MLE

How well did you know this?

Not at all

Perfectly

Requirements for selection of a GLM and purpose of each (4)

selection of:

cumulant function (controls the shape of the distribution)
index, p (controls relationship b/w mean and variance in an EDF)
covariates (x’s = explanatory variables)
link function (controls relationship b/w mean and covariates)

How well did you know this?

Not at all

Perfectly

Measure of model goodness of fit

Taylor and McGuire

deviance

> > smaller = better

How well did you know this?

Not at all

Perfectly

Deviance formula (unscaled)

deviance = 2 * sum ( log-likelihood (perfect model ) - log-likelihood ( actual model ) )

How well did you know this?

Not at all

Perfectly

Scale parameter calculated from deviance and corresponding distribution
(Taylor and McGuire)

scale parameter = deviance / ( n - p )

> > Chi-square distribution w/ ( n - p ) df

How well did you know this?

Not at all

Perfectly

Standardized Pearson Residuals (Taylor and McGuire)

= raw residual / std. dev. ( observation )

unbiased and homoscedastic

How well did you know this?

Not at all

Perfectly

Problem with standardized Pearson residuals

Taylor and McGuire

does not remove skewness from the data

we prefer residuals that are approximately normally distributed

Best residual to use for model assessment and why

Taylor and McGuire

deviance residuals

why: corrects any non-normality in the data

Deviance residual

Rd_i = sqrt(d_i / phi^hat) * sign(Y_i - Y_i^hat)

Types of stochastic models (4)

Taylor and McGuire

non-parametric Mack model
parametric Mack models
cross-classified models
GLM representations of CL models

Results of non-parametric Mack model (2)

estimators of CL age-to-age factors are MVUE’s among estimators that are unbiased linear combinations
unbiased CL reserve estimates

Special cases of parametric Mack models (2)

ODP Mack model
Tweedie Mack model

these remove Variance assumption from Mack, variance confined to the variance of an EDF distribution

Assumption required to turn a non-parametric Mack model into a parametric one

require that incremental observations (given claims to date) come from the EDF distributions

Theorem 3.1 from Taylor and McGuire (3 MVUE results for parametric models)

under EDF and general Mack assumptions:

MLEs are the unbiased CL estimators
for ODP Mack and column dispersion parameters, CL estimators are MVUEs
cumulative loss and reserve estimates are also MVUEs

Reason Taylor and McGuire’s parametric MVUE results are stronger than regular Mack results

minimum variance of all unbiased estimators, not just linear combinations

EDF cross-classified model assumptions (2)

1. stochastic independence of response variable 2. explicit row and column parameters, where column parameters sum to 1

Theorem 3.2 from Taylor and McGuire (EDF and ODP cross-classified results)

under the EDF cross-classified assumptions, restricted to an ODP distribution with constant dispersion parameter, the MLE fitted values and forecasts are the same as the usual CL method

Theorem 3.3 from Taylor and McGuire (MVUE for cross-classified models)

if theorem 3.2 applies and the fitted and forecasted values are corrected for bias, then they are MVUEs

Alpha and beta parameter calculations for non-GLM version of ODP cross-classified model, order of calculations, and forecasted incremental losses

order: alpha increasing, beta decreasing alpha ( 1 ) = latest cumulative loss all other alphas = latest cumulative loss / ( 1 - sum of already calculated betas ) beta = sum ( incremental losses in column ) / sum (already calculated alphas ) forecasted incremental losses = alpha * beta for given row and column

Difference b/w ODP Mack model and ODP cross-classified model under GLM representations of CL models

ODP Mack models link ratios ODP cross-classified models incremental losses

Matrix notation for GLM representation of ODP Mack model

Y = X * beta where Y = individual predicted age-to-age factors (all) X = identity matrix beta = volume weighted age-to-age factors

Matrix notation for GLM representation of ODP cross-classified model

Y = X * beta where Y = estimated incremental losses (all) X = identity matrix beta = ln of all alpha and beta parameters (single column in order)

Problem with GLM representation of ODP cross-classified model and consequence

can lead to parameter redundancy, need to alias a parameter (GLM software will do this automatically) >> if parameters are aliased, estimates will not match the non-GLM version (rescale alpha and beta parameters)

Forecast design matrix (GLM ODP cross-classified model)

same as regular GLM representation but only shows estimates of future incremental losses