01.06.2015 Views

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

112 <strong>Actuarial</strong> <strong>Modelling</strong> <strong>of</strong> <strong>Claim</strong> <strong>Counts</strong><br />

<strong>Risk</strong> classification techniques for claim counts have been the topic <strong>of</strong> many papers<br />

appearing in the actuarial literature. Early references include ter Berg (1980b) and<br />

Albrecht (1983a,b,c). Dionne & Vanasse (1989, 1992) used a Negative Binomial<br />

regression model, while Dean, Lawless & Willmot (1989) used a Poisson-Inverse<br />

Gaussian distribution to fit the number <strong>of</strong> claims. Cummins, Dionne, McDonnald &<br />

Pritchett (1990) applied the GB2 family <strong>of</strong> distributions in modelling claim counts.<br />

Ter Berg (1996) considered the Generalized Poisson distribution <strong>of</strong> Consul (1990) and<br />

incorporated explanatory variables with the help <strong>of</strong> a loglinear model. There are now several<br />

textbooks devoted to the statistical analysis <strong>of</strong> count data. Let us mention Cameron &<br />

Trivedi (1998) and Winkelmann (2003). Before the Poisson regression became popular<br />

among actuaries, claims data were <strong>of</strong>ten analysed using logistic regression; see, e.g.,<br />

Beirlant, Derveaux, De Meyer, Goovaerts, Labies & Maenhoudt (1991).<br />

Separate analyses are usually conducted for claim frequencies and costs, including<br />

expenses, to arrive at a pure premium. With the noticeable exception <strong>of</strong> Jorgensen<br />

& Paes de Souza (1994), all the actuarial analyses <strong>of</strong> the pure premium so far have<br />

examined frequencies and severities separately. This approach is particularly relevant in<br />

motor insurance, where the risk factors influencing the two components <strong>of</strong> the pure premium<br />

are usually different.<br />

2.10.2 Nonlinear Effects<br />

GLMs however only deal with categorical risk factors in an efficient way. The main drawback<br />

<strong>of</strong> GLMs is that covariate effects are modelled in the form <strong>of</strong> a linear predictor. GLMs are<br />

too restrictive if nonlinear effects <strong>of</strong> continuous covariates are present. Continuous covariates<br />

can efficiently enter GLMs only if they are suitably transformed to reflect their true effect<br />

on the score scale. However, it is not always clear how the variables should be transformed.<br />

It has been common practice in insurance companies to model possibly nonlinear effects<br />

<strong>of</strong> a covariate by polynomials. However, it is well known to statisticians that polynomials<br />

are <strong>of</strong>ten not flexible enough to capture the variability <strong>of</strong> the data particularly when the<br />

polynomial degree is small (see, e.g., Fahrmeir & Tutz (2001), Chapter 5). For larger<br />

degrees the flexibility <strong>of</strong> polynomials increases but at the cost <strong>of</strong> possibly high variability<br />

<strong>of</strong> resulting estimates particularly at the left and right extreme values <strong>of</strong> the covariate.<br />

A more flexible approach for modelling nonlinear effects can be based on piecewise<br />

polynomials. More specifically, the unknown functions are approximated by polynomial<br />

splines, which may be regarded as piecewise polynomials with additional regularity<br />

conditions (see, e.g., de Boor, 1978). We refer the interested reader to Denuit & Lang<br />

(2004) for an overview <strong>of</strong> the existing approaches.<br />

2.10.3 Zero-Inflated Models<br />

Insurance data usually include a relatively large number <strong>of</strong> zeros (no claim). Deductibles<br />

and no claim discounts increase the proportion <strong>of</strong> zeros, since small claims are not reported<br />

by insured drivers. Zero-inflated models, including the Zero-Inflated Poisson (ZIP) model,<br />

account for this phenomenon.<br />

ZIP models can be considered as a mixture <strong>of</strong> a zero point mass and a Poisson distribution<br />

and were first used to study soldering defects on print wiring boards (Lambert, 1992). To

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!