01.06.2015 Views

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

Actuarial Modelling of Claim Counts Risk Classification, Credibility ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Risk</strong> <strong>Classification</strong> 103<br />

<strong>Modelling</strong> Dependence with the ‘Working Correlation Matrix’<br />

The ‘working’ correlation matrix R i takes into account the dependence between the<br />

observations corresponding to the same policyholder. The form <strong>of</strong> this matrix must be<br />

specified and depends on the parameter vector .<br />

If R i = I, (2.18) gives exactely the likelihood equations (2.16) under the assumption<br />

<strong>of</strong> independence. The SAS R /STAT procedure GENMOD supports the following structures<br />

<strong>of</strong> the ‘working’ correlation matrix: fixed (user-specific correlation matrix, not estimated<br />

from the data but specified by the actuary), independent (R i = I, giving the estimates<br />

<strong>of</strong> obtained under serial dependence), m-dependent (correlation equal to t for lags<br />

t = 1m, and 0 for higher lags), exchangeable (constant correlation , whatever the<br />

lag), unstructured (each correlation coefficient jk , between observations made at times j<br />

and k, is estimated from the data), and AR1 (autoregressive <strong>of</strong> order 1, with a correlation<br />

coefficient equal to t at lag t).<br />

Numerical Example<br />

The GEE approach can be performed thanks to the procedure GENMOD <strong>of</strong> SAS R . The<br />

‘Repeated’ statement <strong>of</strong> GENMOD invokes the GEE method, specifies the correlation structure,<br />

and controls the displayed output from the GEE model. Initial parameter estimates for iterative<br />

fitting <strong>of</strong> the GEE model are computed as in a Poisson regression model, as described<br />

previously. Results <strong>of</strong> the initial model fit are displayed as part <strong>of</strong> the generated output<br />

<strong>of</strong> SAS R . Statistics for the initial model fit such as parameter estimates, standard errors,<br />

deviances, and Pearson Chi-squares do not apply to the GEE model, and are only valid<br />

for the initial model fit. The SAS R parameter estimates table contains parameter estimates,<br />

standard errors, confidence intervals, Z-scores, and p-values for the parameter estimates.<br />

The ‘Repeated’ statement specifies the covariance structure <strong>of</strong> multivariate responses for<br />

GEE model fitting in the GENMOD procedure. In addition, the ‘Repeated’ statement controls<br />

the iterative fitting algorithm used in GEE and specifies optional output. Other GENMOD<br />

procedure statements are used in the same way as they are for ordinary Poisson regression<br />

models to specify the regression model for the mean <strong>of</strong> the responses.<br />

The statement ‘SUBJECT = subject-effect’ identifies subjects in the input data set. The<br />

subject-effect can be a single variable, an interaction effect, a nested effect, or a combination.<br />

Each distinct value, or level, <strong>of</strong> the effect identifies a different subject, or cluster. Responses<br />

from different subjects are assumed to be independent, and responses within subjects are<br />

assumed to be correlated. A subject-effect must be specified, and variables used in defining<br />

the subject-effect must be listed in the CLASS statement. In actuarial applications, the policy<br />

number is typically used as subject-effect.<br />

The same variables as for the model where the serial independence was assumed are kept<br />

in the final model. The results are given in Table 2.12 that is similar to the SAS R outputs<br />

for GEEs. The estimation <strong>of</strong> the ‘working correlation matrix’ gives<br />

⎛<br />

⎝<br />

1 00493 00460<br />

00493 1 00493<br />

00460 00493 1<br />

and ̂ = 13419.<br />

Since the GEE apprach is not based on a likelihood function, we cannot use the large<br />

sample approximations for the estimated variance-covariance matrix ̂̂ <strong>of</strong> the estimated<br />

⎞<br />

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!