01.06.2015 Views

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>Risk</strong> <strong>Classification</strong> 63<br />

Example 2.3 Figure 2.7 suggests that the variables Age and Gender interact in Portfolio<br />

A. It is thus not possible to represent accurately the effect <strong>of</strong> being a male policyholder<br />

(compared with being a female policyholder) in terms <strong>of</strong> a single multiplier, nor can the<br />

effect <strong>of</strong> Age be represented by a single multiplier. The relevant explanatory variables are not<br />

x i1 , x i2 , x i3 , and x i4 but rather x i1 x i4 , x i2 x i4 , x i3 x i4 , 1 − x i1 x i2 x i3 x i4 , x i1 1 − x i4 , x i2 1 − x i4 ,<br />

and x i3 1 − x i4 (denoted henceforth as x<br />

ij ′ ).<br />

To reflect the situation accurately, it is thus necessary to consider multipliers dependent<br />

on the combined levels <strong>of</strong> Age and Gender. To this end, a variable Gender ∗ Age is created,<br />

with levels ‘Female 18–24’, ‘Female 25–30’, ‘Female 31–60’, ‘Female over 60’, ‘Male<br />

18–24’, ‘Male 25–30’, ‘Male 31–60’ and ‘Male over 60’. This new variable possesses 8<br />

levels, and is coded by means <strong>of</strong> 7 dummies, being all 0 for the reference level (taken as<br />

‘Male 31–60’). Specifically, the explanatory variables x i1 to x i4 in Example 2.2 are replaced<br />

with<br />

{ 1 if policyholder i is a female less than 24<br />

x ′ i1 = 0 otherwise<br />

{ 1 if policyholder i is a female aged 25–30<br />

x ′ i2 = 0 otherwise<br />

{ 1 if policyholder i is a female aged 31–60<br />

x ′ i3 = 0 otherwise<br />

{ 1 if policyholder i is a female over 60<br />

x ′ i4 = 0 otherwise<br />

{ 1 if policyholder i is a male less than 24<br />

x ′ i5 = 0 otherwise<br />

{ 1 if policyholder i is a male aged 25–30<br />

x ′ i6 = 0 otherwise<br />

{ 1 if policyholder i is a male over 60<br />

x ′ i7 = 0 otherwise<br />

Rather than declaring Age and Gender as two explanatory variables coded with the help <strong>of</strong><br />

x i1 to x i4 , a combined Age ∗ Gender variable is declared and is coded with the help <strong>of</strong> the<br />

covariates x<br />

i1 ′ to x′ i7 .<br />

The part <strong>of</strong> the linear predictor involving Age and Gender is 1 x i1 +···+ 4 x i4 without<br />

interaction, and becomes ′ 1 x′ i1 +···+′ 7 x′ i7<br />

if allowance is made for interaction. If Age and<br />

Gender indeed interact then the values <strong>of</strong> these two linear combinations differ from each<br />

other, whereas they collapse in the absence <strong>of</strong> interaction.<br />

Allowing for interaction thus dramatically increases the number <strong>of</strong> explanatory variables.<br />

Introducing Age and Gender separately requires 4 binary variables whereas allowing<br />

for the Age–Gender interaction requires 7 dummies. As a consequence, the number<br />

<strong>of</strong> parameters to be estimated increases accordingly. We will see below that grouping<br />

some levels <strong>of</strong> the combined variable accounting for interaction is nevertheless <strong>of</strong>ten<br />

possible.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!