0559 0587
0559 0587
[Presented at the Seminar, ‘Applications of Mathematics in Insurance, Finance and Actuarial Work’,
sponsored by the Institute of Mathematics and Its Applications, the Institute of Actuaries, and the
Faculty of Actuaries, held at the Institute of Actuaries, 6–7 July 1989.]
1. MOTIVATION
(i) To develop more fully the statistical analogue of the original actuarial
chain-ladder technique.
(ii) To investigate the magnitude and nature of predictor instability asso-
ciated with the technique.
559
560 Chain Ladder and Interactive Modelling
The GLIM software package, because of its user defined macro facility, is an
invaluable tool in achieving these objectives. Indeed we note with interest that
Taylor (1983)(8) and Taylor & Ashe (1983)(9) used the GLIM package to fit Taylor’s
so called ‘invariant see-saw’ model to run-off data.
We identify our philosophical approach to estimating claims whole-heartedly
with the sentiments expressed by Taylor & Ashe (1983)(9) from which we quote
the following passage:
Our view is that claims analysis is a special case of data analysis; that therefore there are few
preconceptions as to what should be done with the data; indeed, anything goes, if it leads to a model
which exhibits acceptable adherence to the data and is plausible in the light of any collateral
information. To us, faced with a problem of multivariate data analysis, regression analysis represents
a most useful exploratory tool.
We would view this application of GLIM to run-off data as the natural extension
of other applications of generalized linear models in actuarial work reported by
Haberman and Renshaw (1988).(4)
2. CLAIMS DATA
Claims run-off data are generated when delay is incurred in settling insurance
claims. Typically the format for such data is that of a triangle (Figure 1.1) in
which the rows (i) denote accident years and the columns (5) delay or
development years. The settlement or payment year is k=i+j–1. The entries in
the body of the triangle are the adjusted (non-cumulative) amounts
Figure 1.1.
(Claims Reserving and GLIM) 561
The triangle is augmented each year with the addition of a new diagonal. Two
noteworthy variations of the triangular format are induced by either truncation
after a fixed period of delay or by the removal of data for the early settlement
years.
Additional information in the guise of numbers of claims settled per cell is
required to implement Taylor’s (1983)(8) ‘invariant see-saw’ method.
An obvious first step in any analysis is to plot the adjusted claims against
accident year, against development year and against payment year. One might
even be tempted to use a three-dimensional plot. Such displays can be very
informative about the type of model structure that the data might support.
The remit is essentially to predict likely claim amounts in the incomplete south-
east region bounded by broken lines in Figure 1.1. A two stage modelling/
predicting process is envisaged.
3. LOG-NORMAL MODELS
Let
Yij= log(Cij )
with
and
Here we have assumed that the normal responses Yij decompose (additively)
into deterministic non-random components (means) mij and independent
homoscedastic normally distributed random error components about a zero
mean. It will be necessary to monitor these assumptions by displaying various
residual plots on fitting specific model structures to the logarithms of the adjusted
claims data.
A number of specific model structures are of interest. These include:
Case (II)
562 Chain Ladder and Interactive Modelling
so that yi<0 ensures claims amounts ultimately decay. Referring to the data
matrix sketched in Figure 3.1(c), prediction beyond the observed limit of d as well
as in the south east triangular region is feasible.
Case (IV) M:
Here we have written d for j when j exceeds some fixed integer q. The model is
clearly a mixture of Case I and Case III applied to separate parts of the data
matrix.
Each of the models discussed above has obvious submodels. We concentrate
on Case I.
4. MODEL FITTING
denote the total number of observations, the number of observations in row iand
the number of observations in column j respectively.
We choose µ, ai, µ, âi,ßj(i,j#
1) SO as to minimize
Partial differentiation with respect to µ, âi, for each i ( # 1) and ßj for each j ( # 1)
leads to the system of linear equations
where
(Claims Reserving and GLIM) 565
denote the grand total, row totals and column total of the transformed adjusted
claims. The solution of this set of non-singular linear equations yield the required
estimates.
By way of illustration, the artificial data set
j→ 1 2 3 Totals
i1 2 4 6 12
↓2 2 3 4 9
3 4 2 5
4 2
Totals 9 9 10 28
(l = 3, ω = 1, r = 4, g= 0)
gives rise to the system of linear equations
28 3 3
9 2 2 1
9 1 1
3 3 0 0
10 2 0 2 1 0 0
=
9 3 1 1 3 0 0
5 2 1 0 0 2 0
1 0 0 0 0 0
2
are
2·917 3·583 5·500
1·917 2·583 4·500
2·167 2·833 4·50
2·000 2·666 4·583
Scrutiny of these fitted and predicted values reveals the true nature of the
assumed non-interactive model structure which manifests itself in the constant
differences between columns and between rows.
566 Chain Ladder and Interactive Modelling
A noteworthy submodel is that involving development year effects only. The
one-way ANOVA sub-structure is
H: mij=µ+ßj
where, again we define ß1=0 because of overparameterization. This time the
incomplete nature of the data matrix (Figure 4.1) is irrelevant. The parameter
estimates are determined by
The solution is
so that the fitted and predicted values are the column averages. Justification for
using this simplified model is sought by examining the t-statistics associated with
the parameters ai, examination of further residual plots and through a formal
ANOVA F-test based on the statistic
in which RM and RH denote the residual sums of squares or deviance under the
full mode1 M and the submodel H respectively.
Whereas it has been established by Kremer (1982)(6) that the model structure in
use here is identical to that utilized in the standard actuarial chain-ladder
technique as described, for example, in Hossack et al. (1983)(s), the current
treatment of the model differs in two important respects-namely the ways in
which the model parameters are estimated and the predicted values are
constructed.
5. PREDICtED VALUES
(5.1)
provides a point predictor for the empty (i,j)th cell in the south east triangular
region. Since the mij are linear in the Yijs, they are distributed normally with
(5.2)
and
(5.3)
(Claims Reserving and GLIM) 567
If, in keeping with common practice, the predictor is augmented by an
independent additive error term, distributed as N(0, 2), then 2 has to be added
to the RHS of (5.3).
Reverting to the original (anti-log) scale, predictors ij are needed where
Since the ijSare normally distributed, the ijSare log-normally distributed with
(5.4)
and
(5.5)
One method of computing predicted values and their standard errors, apparently
favoured by Zehnwirth (1985)(14), is based on (5.4) and (5.5) in which E( ij) and
V(mij) are replaced by their estimated values as dictated, in this instance by (5.2)
and (5.3). It should be stressed, however, that Zehnwirth is working within a
Bayesian framework and would presumably seek to justify the method of
prediction within this framework.
(6.1)
from which (5.5) is retrieved on settingj= k. Further, (5.1) implies that forj#k
(6.2)
This time (5.3) is retrieved on settingj=k. Also note the useful identity
and
catering for between row dependencies are needed to compute the variances of
the predicted diagonal totals and the overall predicted total. Notice that (6.1) and
(6.2) are retrieved on setting il = i2 = i (together with ji =j, j2 = k).
7. PREDICTOR INSTABII.ITY
First the comment that the adjusted claim amounts are generally characterized
by significant differences between development years but only small differences
across accident years.
The extent of any instability exhibited by each predicted value depends directly
on the number of parameters used to make the prediction, in this case just three
which is not excessive, and more importantly on the extent to which the estimates
of these parameters are sensitive to fluctuations in the data. Not surprisingly in
view of the nature of the model structure and data format, simulation exercises
confirm that predictions are sufficiently robust to data fluctuations in the heart of
and in the north-west corner of the run-off triangle; and that stability deteriorates
as data points further into the other two corners of the run-off triangle are varied.
However, the instability in the north-east corner is generally not a serious
problem since claims amounts in this region are relatively low in comparison with
the remainder of the data triangle. The position is further improved if truncation
has occurred.
(Claims Reserving and GLIM) 569
The objective is to partition the set of ais by varying the limit h. This would seem
to work well, is objective, intuitively appealing, and induces the required degree
of stability provided no new parameters are allocated to the more recent accident
year.
8. IMPLEMENTATION
This is by user defined macros within GLIM. Essentially four primary macros
are required:
(i) to create related vectors, scalars and to output data plots;
(ii) to do the model fitting and output graphical checks;
(iii) to conduct the multiple comparison t-tests;
(iv) to output further graphical checks; to compute and output the predicted
claims amounts, their totals and standard errors.
It is suggested that these macros could form the basis of a more extensive suite of
macros to be offered to practitioners. It is noted with interest that one such
practitioner, Taylor ( 1988),(11)strongly recommends the use of such regression
methods.
9. AN APPLICATION
development 1
2 3 4 5 6 7 8 9 10
year j
accident 1 357848 766940 610542 482940 527326 574398 146342 139950 227229 61948
year 2 352118 884021 933894 1183289 445745 320996 527804 266172 425046
9 376686 986608
10 344014
EXPOSURES
610 721 697 621 600 552 543 503 525 420
Figure 9. I.
(Claims Reserving and GLIM) 571
Residual plots for the two-way ANOVAModel defined by (3.1) (Figures 9.2(a)–
(e)) are reasonably supportive of the model although the histogram is slightly
skewed. Estimates for the model parameters and their standard errors are given
in standard GLIM format (Table 9.2). Here the model parameters of (3.1) have
been recoded according to 1 for µ, the general mean; DY _(j) for βj, the
development year parameters and AY _(i) for αi, the accident year parameters.
The system automatically sets 1α = β 1 = 0, a feature utilized in the development of
Section 4.
Figure 9.2(a).
Figure 9.2(b).
572 Chain Ladder and Interactive Modelling
Figure 9.2(c).
Figure 9.2(d).
(Claims Reserving and GLIM) 573
Figure 9.2(e)
Table 9.2
[o]
[o]
[o] estimate s.e. parameter
[o] 1 6.106 0.1646 1
[o] 2 0.9112 0.1607 DY_(2)
[o] 3 0.9387 0.1681 DY_(3)
[o] 4 0.9650 0.1761 DY_(4)
[o] 5 0.3832 0.1857 DY_(5)
[o] 6 -0.0004909 0.1978 DY_(6)
[o] 7 -0.1181 0.2142 DY_(7)
[o] 8 -0.4393 0.2387 DY_(8)
[o] 9 -0.05351 0.2806 DY_(9)
[o] 10 -1.393 0.3786 DY_(10)
[o] 11 0.1938 0.1607 AY_(2)
[o] 12 0.1489 0.1681 AY_(3)
[o] 13 0.1533 0.1761 AY_(4)
[o] 14 0.2988 0.1857 AY_(5)
[o] 15 0.4117 0.1978 AY_(6)
[o] 16 0.5084 0.2142 AY_(7)
[o] 17 0.6731 0.2387 AY_(8)
[o] 18 0.4952 0.2806 AY_(9)
[o] 19 0.6018 0.3786 AY_(10)
[o] scale parameter taken as 0.1162
[o]
574 Chain Ladder and Interactive Modelling
Attempted model simplification by excluding accident year effects leads to an
F-statistic value of 1.481 on 9,36 degrees of freedom with an observed
significance level of approximately 20%. Whereas this is supportive of the
simplification, two of the residual plots (Figures 9.3(a) and (b)) under the
simplified one-way development year effects model become unacceptably
distorted. The explanation for this is possibly to be found in the values of the
parameter estimates (Table 9.2) under the full two-way ANOVAmodel. The t-
statistics (obtained by dividing the estimates by their standard errors) indicate
that the accident year parameters from year six onwards are all in fact significant;
a feature which would appear to synchronize with the residual plots (Figures
9.3a–b). Consequently, we retain the two-way ANOVAmodel for the time being.
We also have a vested interest in investigating the extent of predictor instability
for this model. The run-off claims data, their expected (fitted) values under this
model, the predicted claims values and their standard errors are presented in
Table 9.3 together with the predicted totals and their standard errors.
We are involved in a two stage process in which the data are first utilized to
calibrate/validate the proposed model before moving to the predictive second
stage. Model validation is done through scrutiny of response and residual plots
coupled with attempted model simplifications where appropriate. Given a
satisfactory model, both the magnitude of the standard errors of the predicted
values and the degree of stability exhibited by predicted values to fluctuations in
the data are important aspects of performance with which to assess the
effectiveness of this process. Clearly, if relatively minor fluctuations in the data
induce excessive changes in the predicted values there is cause for concern, a
phenomenon which is well known in the context of predictive regression
modelling.
The extent of any instability exhibited by each predicted value depends directly
in the number of parameters used to make each prediction, in this case just three
(and not directly on the total number of model parameters), together with the
extent to which the estimates of these parameters are sensitive to fluctuations in
the data. We concentrate on the latter source of possible instability since the
number of parameters involved in making each prediction is low. Indeed an
identical number of parameters (three) is involved in each prediction based on
the model defined by (3.2) in which a much more rigid structure is imputed to
development year effects.
Suppose first that g=0, w=0 so that the data are triangular in shape. Not
surprisingly in view of the nature of the model structure, simulation exercise
reveals that predictor stability deteriorates as data points further into the apices
of the run-off triangle are varied. This is illustrated by Figure 9.4(a) in which the
arrows indicate the directions of decreasing predictor stability. However, the
magnitude of predictor instability induced by changes in the data would not
appear to be excessive in our experience except for changes in the last few data
rows and columns. This is hardly surprising as so little data are yet available to
stabilize the estimates of the corresponding row and column parameters.
(Claims Reserving and GLIM) 575
Figure 9.3(a).
Figure 9.3(b).
576 Chain Ladder and Interactive Modelling
(Claims Reserving and GLIM) 577
Comparison of Tables 9.4a–b with Table 9.2 and Tables 9.5a–b with Table 9.3
give an indication of the degree of instability involved. In the construction of
Tables 9.4(a) and 9.5(a) the original claims amount C’32 is changed approxima-
tely 10% from 1001799 to 901799 while Tables 9.4(b) and 9.5(b) are based on a
substantial adjustment to the original claims amount C'28 from 266172 to
166172. We leave the reader to assess for his or herself the magnitude and pattern
of changes induced in the predicted values by these two representative changes in
the claims data by comparing Tables 9.5a–b with Table 9.3. As a further guide
changes to the penultimate row or column of the run-off triangle induce some
changes up to the same order of magnitude in the corresponding row or column
Table 9.4(a)
[o] The parameter estimates are
[o]
[o] estimate s.e. parameter
[o] 1 6.106 0.1644 1
[o] 2 0.8995 0.1604 DY_(2)
[o] 3 0.9395 0.1678 DY_(3)
[o] 4 0.9663 0.1758 DY_(4)
[o] 5 0.3852 0.1854 DY_(5)
[o] 6 -0.002226 0.1975 DY_(6)
[o] 7 -0.1145 0.2139 DY_(7)
[o] 8 -0.4345 0.2383 DY_(8)
[o] 9 -0.05308 0.2802 DY_(9)
[o] 10 -1.393 0.3780 DY_(10)
[o] 11 0.1938 0.1604 AY_(2)
[o] 12 0.1358 0.1678 AY_(3)
[o] 13 0.1539 0.1758 AY_(4)
[o] 14 0.3000 0.1854 AY_(5)
[o] 15 0.4136 0.1975 AY_(6)
[o] 16 0.5112 0.2139 AY_(7)
[o] 17 0.6772 0.2383 AY_(8)
[o] 18 0.5015 0.2802 AY_(9)
[o] 19 0.6022 0.3780 AY_(10)
[o] scale parameter taken as 0.1158
578 Chain Ladder and Interactive Modelling
Table 9.4(b)
[o] The parameter estimates are
[o]
[o] estimate s.e. parameter
[o] 1 6.123 0.1663 1
[o] 2 0.9112 0.1623 DY_(2)
[o] 3 0.9387 0.1697 DY_(3)
[o] 4 0.9650 0.1779 DY_(4)
[o] 5 0.3832 0.1875 DY_(5)
[o] 6 -0.004909 0.1998 DY_(6)
[o] 7 -0.1181 0.2164 DY_(7)
[o] 8 -0.5963 0.2411 DY_(8)
[o] 9 -0.04369 0.2834 DY_(9)
[o] 10 -1.410 0.3823 DY_(10)
[o] 11 0.1415 0.1623 AY_(2)
[o] 12 0.1522 0.1697 AY_(3)
[o] 13 0.1370 0.1779 AY_(4)
[o] 14 0.2824 0.1875 AY_(5)
[o] 15 0.3953 0.1998 AY_(6)
[o] 16 0.4920 0.2164 AY_(7)
[o] 17 0.6568 0.2411 AY_(8)
[o] 18 0.4789 0.2834 AY_(9)
[o] 19 0.5854 0.3823 AY_(10)
[o] scale parameter taken as 0.1185
Table 9.6
[o]
[o] estimate s.e. parameter
[o] 1 6.119 0.1520 1
[o] 2 0.9024 0.1476 DY_(2)
[o] 3 0.9324 01528 DY_(3)
[o] 4 0.9363 0.1598 DY_(4)
[o] 5 0.3522 0.1696 DY_(5)
[o] 6 -0.01988 0.1838 DY_(6)
[o] 7 -0.1330 0.1995 DY_(7)
[o] 8 -0.4500 0.2202 DY_(8)
[o] 9 -0.05353 0.2580 DY_(9)
[o] 10 -1.406 0.3551 DY_(10)
[o] 11 0.1682 0.1267 MAY_(2)
[o] 12 0.3009 0.1746 MAY_(3)
[o] 13 0.5102 0.1467 MAY_(4)
[o] scale parameter taken as 0.1030
[o]
[o]
Figure 9.6(a).
582 Chain Ladder and Interactive Modelling
Figure 9.6(b).
Figure 9.6(c).
(Claims Reserving and GLIM) 583
Figure 9.6(d).
Figure 9.6(e).
584 Chain Ladder and Interactive Modelling
(Claims Reserving and GLIM) 585
Figure 9.7.
a i parameters to more than one accident year where appropriate. Indeed, this is
vital if acceptable levels of stability are to be induced for the most recent accident
years for which relatively little data are, as yet, available. We stress that this
defect is also present in the traditional actuarial deterministic chain-ladder
technique, giving rise to much concern about the apparant continuing esteem
afforded to the technique.
A way forward is to examine all contrasts
between row parameters, Such contrasts are invariant of the somewhat arbitrary
choice of the two parameter constraints α 1( = β 1 = 0) needed to estimate the ais.
Application of the multicomparison t-criterion
for h=·5, induces the partition in row parameters displayed in Figure 9.5 in
which accident years are represented by numbered nodes; two nodes being linked
if and only if the inequality is satisfied.
This allocates separate row parameters to years 1 and 5 while linking years, 2,3
586 Chain Ladder and Interactive Modelling
and 4 together as well as linking years 6 to 10 inclusive; making a total of just four
row parameters. For sufficiently large h, all nodes are interlinked, while linkages
are shed as h is reduced.
The residual plots (Figures 9.6(a)–(e)), the parameter estimates (Table 9.6) and
predicted values (Table 9.7) are presented for scrutiny.
Verrall (1989)(13) has conducted a comparative study of estimates for the αis
based on a variety of estimation methods for these data. A graphical comparison
of least squares, empirical Bayes, Kalman filter and multi comparison estimators
is presented in Figure 9.7.
10. POSTSCRIPT
Possible future developments for incorporating within GLIM include:
(i) alternative methods of mapping back from the logarithmic modelling space;
(ii) use of the other model structures discussed in Section 3 (partially
developed);
(iii) use of methods other than the multicomparison tests to induce predictor
stability.
REFERENCES
(1) AITCHISON, J. & BROWN, J. A. C. (1969). The Log-Normal Distribution. Cambridge University
Press.
(2) DEJONG, P. & ZEHNWIRTH, B. (1983). Claims Reserving, State-Space Models and the Kalman
Filter. J.I.A. 110, 157.
(3) GREEN. P. J. (1988). Non-Diagonal Weight Matrices. GLIM Newsletter No. 16.
(4) HABERMAN, S. & RENSHAW, A. E. (1988). Generalised Linear Models in Actuarial Work.
Presented to a joint meeting of the Staple Inn Actuarial Society and the Royal Statistical
Society, General Applications Section. February 1988.
(5) HOSSACK, I. B., POLLARD, J. H. & ZEHNWIRTH, B. (1983). Introductory Statistics with
Applications in General Insurance. Cambridge University Press.
(6) KREMER, E. (1982). IBNR Claims and the Two-Way Model of ANOVA.Scandinavian Actuarial
Journal.
(7) TAYLOR, G. C. (1979). Statistical Testing of a Non-Life Insurance Model. Proceedings Actuarial
Sciences Institute, Act. Wetemschappen, Katholieke Univ. Leuven, Belgium.
(8) TAYLOR, G. C. (1983). An Invariance Principle for the Analysis of Non-Life Insurance Claims.
J.I.A. 110, 205-242.
(9) TAYLOR, G. C. & ASHE, F. R. (1983). Second Moments of Estimates of Outstanding Claims.
Journal of Econometrics 23, 37–61.
(10) TAYLOR, G. C. (1986). Claims Reserving in Non-Life Insurance. North-Holland.
(Claims Reserving and GLIM) 587
(11) TAYLOR, G. C. (1988). Regression Models in Claims Analysis (II), William M. Mercer,
Campbell, Cook, Knight, Sydney, Australia.
(12) VERRALL, R. (1988). Bayes Linear Models and the Claims Run-Off Triangle. Actuarial Research
Report No. 7. The City University, London.
(13) VERRALL, R. (1989). Private communication.
(14) ZEHNWIRTH, B. (1985). Interactive Claims Reserving Forecasting System. Benhar Nominees Pty
Ltd, Turramurra, NSW, Australia.