01.06.2015 Views

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Risk</strong> <strong>Classification</strong> 81<br />

The unobserved heterogeneity (when correlated to observable characteristics) thus modifies<br />

the regression coefficients: the true effect <strong>of</strong> X i on N i becomes an apparent effect ˜.<br />

Hence, the estimated jth regression coefficient does not only represent the effect <strong>of</strong> the<br />

jth covariate on the number <strong>of</strong> claims, but also accounts for the effect <strong>of</strong> all the hidden<br />

characteristics Z i correlated with the jth observable one. This is why the ̂ j s may strongly<br />

depend on which covariates are included in the model. Moreover, there remains an error<br />

term ˜ i representing the influence <strong>of</strong> the hidden variables on N i , corrected for the effect <strong>of</strong><br />

the observed risk factors X i .<br />

For these reasons, we now consider a mixed Poisson model<br />

( (<br />

))<br />

p∑<br />

N i ∼ oi exp 0 + j x ij + i i= 1 2n (2.10)<br />

j=1<br />

where the random variable i represents the residual effect <strong>of</strong> the hidden characteristics.<br />

Therefore, the heterogeneity is taken into account by assuming that the number <strong>of</strong> accidents<br />

is Poisson distributed with mean varying from one policyholder to another.<br />

Note that some hidden characteristics are correlated to those in X i (e.g. the hidden annual<br />

mileage and the observable use <strong>of</strong> the vehicle). The random variable i in (2.10) models the<br />

effect <strong>of</strong> hidden characteristics that is not already explained by X i . Since i accounts for a<br />

residual effect, we will consider in the remainder <strong>of</strong> this book that i is independent from<br />

X i . The price to pay is that the estimated regression coefficient j does not only express the<br />

effect <strong>of</strong> the jth regressor, but also the effect <strong>of</strong> all the hidden characteristics correlated with<br />

the jth regressor. This is important when the actuary tries to interpret the resulting price list.<br />

The policyholders have different accident proneness because <strong>of</strong> observable characteristics<br />

taken into account in the price list and hidden characteristics to be corrected a posteriori.<br />

The heterogeneity is taken into account by assuming that the number <strong>of</strong> accidents is Poisson<br />

distributed with mean varying from one policyholder to another. The annual claim frequency<br />

becomes a random variable i i where i = exp i models the oscillations around the grand<br />

mean i (with E i = 1). We can now write N i ∼ Poi i i where i = d i exp˜x T i .<br />

As in (1.29), we have<br />

VN i = i + 2 i V i> i = EN i (2.11)<br />

so that any mixed Poisson regression model induces overdispersion.<br />

2.4.5 Detecting Overdispersion<br />

Residual heterogeneity remains considerable in the risk classes despite the use <strong>of</strong> many a<br />

priori variables. Indeed, many explanatory variables are unknown to the insurance company<br />

and cannot be incorporated in the price list. Let us denote as ̂m k the empirical mean claim<br />

number <strong>of</strong> risk class k, and ̂<br />

k<br />

2 the associated variance. In order to graphically test the<br />

Poisson mixture assumption, we have plotted the points ̂m k ̂<br />

k 2 and also the bisecting line.<br />

The plot is displayed in Figure 2.9. We observe that ̂m k < ̂<br />

k<br />

2 in numerous risk classes,<br />

indicating that the homogeneous Poisson model is inappropriate. We also see that most <strong>of</strong><br />

the observed pairs ̂m k ̂<br />

k 2 lie above the bisecting line, thus supporting overdispersion. The<br />

three points below the 45-degree line correspond to classes with just a few policies (36, 13<br />

and 12, respectively).

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!