01.06.2015 Views

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Efficiency and Bonus Hunger 235<br />

Table 5.4 Results <strong>of</strong> the Inverse Gaussian regression on the claim costs recorded in Portfolio C.<br />

Variable Level Coeff Std error Wald 95 % conf limit Chi-sq Pr>Chi-sq<br />

Intercept 71169 00282 70616 71722 636343 < .0001<br />

Ageph 18–24 02293 01104 00129 04457 431 0.0378<br />

Ageph 25–30 01699 00697 00334 03065 595 0.0147<br />

Ageph > 30 0 0 0 0 . .<br />

Agev 0–2 01609 00756 00127 03091 453 0.0334<br />

Agev > 2 0 0 0 0 . .<br />

LogNormal Distribution<br />

Before the generalized linear models gained popularity in the actuarial pr<strong>of</strong>ession, claim sizes<br />

were <strong>of</strong>ten analysed using a Normal linear regression model after having been transformed<br />

to the log-scale. Although the results are usually quite similar for this method and Gamma<br />

regression, the latter approach is easier to interpret since it does not require any logarithmic<br />

transformation <strong>of</strong> the claim costs.<br />

Assume that the moderate claim sizes for policyholder i are independent and LogNormally<br />

distributed, with parameters 0 + ∑ p<br />

j=1 jx ij and 2 . Specifically, the C ik s are independent,<br />

and identically distributed for fixed i, with C ik ∼ Nor 0 + ∑ p<br />

j=1 jx ij 2 . Let n i be the<br />

number <strong>of</strong> claims reported by policyholder i, and let c i1 c i2 c ini be the corresponding<br />

claim costs. The likelihood associated with the observations is<br />

= ∏<br />

n<br />

∏ i<br />

in i >0 k=1<br />

⎛ (<br />

1<br />

√<br />

2cik exp ⎝− 1<br />

2 2<br />

ln c ik − 0 −<br />

p∑<br />

j=1<br />

j x ij<br />

) 2<br />

⎞<br />

⎠ <br />

The maximum likelihood estimators are obtained with the help <strong>of</strong> Newton-Raphson<br />

techniques. The average cost <strong>of</strong> a standard claim for policyholder i is then obtained from<br />

the formula<br />

(<br />

)<br />

p∑<br />

EC ik x i = exp 0 + j x ij + 2<br />

<br />

2<br />

Note that, in contrast to the Gamma and Inverse Gaussian cases, we cannot easily deal with<br />

the situation where only the total amount <strong>of</strong> moderate claims is available. This is due to the<br />

fact that the LogNormal family <strong>of</strong> distributions is not closed under convolution. Therefore,<br />

we fit the model to the observations made on policyholders having filed a single standard<br />

claim.<br />

Example 5.3 (LogNormal Regression for the Moderate <strong>Claim</strong> Costs in Portfolio C) The<br />

LogNormal regression cannot be performed with the help <strong>of</strong> SAS R /STAT procedure<br />

GENMOD (which does not support the LogNormal distribution). Often in practice, the<br />

data are first transformed on the log-scale, and a standard linear model is then fitted to<br />

the logarithms <strong>of</strong> the claim amounts. This ad-hoc procedure will be avoided here, and a<br />

maximum likelihood estimation procedure is performed on the original claim costs.<br />

j=1

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!