01.06.2015 Views

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

64 <strong>Actuarial</strong> <strong>Modelling</strong> <strong>of</strong> <strong>Claim</strong> <strong>Counts</strong><br />

2.3.2 Loglinear Poisson Regression Model<br />

In Poisson regression, we have a collection <strong>of</strong> independent Poisson counts whose means are<br />

modelled as nonnegative functions <strong>of</strong> covariates. Specifically, let N i , i = 1 2n,bethe<br />

number <strong>of</strong> claims reported by policyholder i and d i be the corresponding risk exposure (the<br />

N i s are assumed to be independent). All the observable characteristics (the a priori variables<br />

presented in Section 2.2 for Portfolio A, say) related to this policyholder are summarized<br />

into the vector xi<br />

T = x i1 x ip . Poisson regression is a technique analogous to linear<br />

regression except that errors no longer follow a Normal distribution but the randomness<br />

in the model is described by a Poisson distribution. We first assume that the conditional<br />

expectation <strong>of</strong> N i given x i is <strong>of</strong> the form<br />

(<br />

)<br />

p∑<br />

EN i x i = d i exp 0 + j x ij i= 1 2n (2.1)<br />

j=1<br />

where T = 0 1 p is the vector <strong>of</strong> unknown regression coefficients. The<br />

explanatory variables enter the model in the linear combination 0 + ∑ p<br />

j=1 jx ij , where 0<br />

acts as an intercept and j is the coefficient indicating the weight given to the jth covariate.<br />

The Poisson regression model consists in stating that N i is Poisson distributed with mean<br />

given by Expression (2.1), that is<br />

( (<br />

))<br />

p∑<br />

N i ∼ oi d i exp 0 + j x ij i= 1 2n<br />

2.3.3 Score<br />

The quantity<br />

j=1<br />

score i = 0 +<br />

p∑<br />

j x ij<br />

is called the score (or linear predictor in statistics) because it allows the actuary to rank the<br />

policyholders from the least to the most dangerous. The claim frequency for policyholder i is<br />

d i expscore i ; its annual claim frequency being expscore i . Increasing the score thus means<br />

that the associated average annual claim frequency increases. The use <strong>of</strong> the exponential link<br />

function ensures the claim frequency is positive even if the score is negative.<br />

Let us denote as ̂ 0 ̂ 1 ̂ p the estimators <strong>of</strong> the regression coefficients<br />

0 1 p . In a statistical sense,<br />

̂i = d i exp ( (<br />

)<br />

) p∑<br />

ŝcore i = di exp ̂0 + ̂j x ij<br />

is the predicted expected number <strong>of</strong> claims for policyholder i. Prediction in this sense does<br />

not refer to ‘predicting the future’ (called forecasting by statisticians) but rather to guessing<br />

the expected number <strong>of</strong> claims (i.e., the response) from the values <strong>of</strong> the regressors in an<br />

observation taken under the same circumstances as the sample from which the regression<br />

equation was estimated.<br />

j=1<br />

j=1

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!