reserving methods using individual claim-level data
TRANSCRIPT
Reserving Methods Using Individual Claim-LevelData
Katrien Antonio
CAS Loss Reserve SeminarBoston
September 17 2013
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 1 / 38
Motivation: dynamics
◮ Individual, micro or granular level → claim–by–claim.
•t1 t2 t3 t4 t5 t6 t7 t8 t9
Occurrence
Notification
Loss payments
Closure
Re–opening
Payment
Closure
IBNR
RBNS
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 2 / 38
Motivation: dynamics - types of claims
Closed claim = ‘closure’ (at t5) ≤ present (say, τ).
t1 t2 t3 t4 t5 time
Occurrence
Notification
Loss Payments
Closure
Present
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 3 / 38
Motivation: dynamics - types of claims
RBNS claim = reported, but ‘closure’ > present (say, τ).
t1 t2 t3 t4 time
Occurrence
Notification
Loss Payments
Present
Uncertainty
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 4 / 38
Motivation: dynamics - types of claims
IBNR claim = incurred, but reporting and ‘closure’ > present (say, τ).
t1 time
OccurrencePresent
Uncertainty
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 5 / 38
Motivation: from macro to micro–level models
◮ Traditional approach:
- aggregate data by (arrival, development) year combination;
- apply reserving method designed for run–off triangle
(cfr. many of the talks presented at this seminar).
◮ Our viewpoint:
design a reserving method at individual claim level.
◮ Inspiration?
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 6 / 38
Motivation: from macro to micro–level models
“The problem is more with the data than the methods, since, clearly, it is the
estimation of aggregate case reserves which is at fault. [. . . ] In this respect,
models based on individual claims, rather than data aggregated into triangles, are
likely to be of benefit. [. . . ] Aggregate triangles are useful for management
information, and have the advantage that simple deterministic methods can be
used to analyze them.”(England & Verrall, 2002, p.507)
“However, it has to be borne in mind that traditional techniques were developed
before the advent of desktop computers, using methods which could be
evaluated using pencil and paper.” (England & Verrall, 2002, p.507)
“The triangle is a summary, whose origins are very much driven by the
computational restrictions of a bygone era.” (Taylor & Campbell, 2002, p.21)
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 7 / 38
Motivation: from macro to micro–level models
“Recall that condition XXX in Mack’s model had only one purpose: it was chosen
in order to explain the form of the chain ladder estimators XXX used by
practitioners for predicting future claim numbers. Hence condition XXX was
chosen for pragmatic reasons.” (Mikosch, 2009, p.374)
“Conditions such as XXX and XXX (i.e. Mack’s conditions) do not explain the
dynamics of the underlying claim arrival and payment process in a satisfactory way.
This is not surprising since only the first and second conditional moments of the
annual dynamics are specified.” (Mikosch, 2009, p.381)
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 8 / 38
Micro–level reserving: strictly Scandinavian?(Time line contains a selection of papers from international actuarialjournals.)
time1989 1993 1994 1996 1999 2007 2010 2011 2012 2013
Arjas (ASTIN)
Norberg (ASTIN)
Hesselager (ASTIN)
Haastrup & Arjas (ASTIN)
Norberg (ASTIN)
Larsen (ASTIN)
Verrall, Nielsen & Jessen (ASTIN)
Rosenlund (ASTIN)
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 9 / 38
My research on micro–level loss reserving
◮ Antonio & Plat [AP] (2013, SAJ, in press): development of individualclaims in continuous time, parametric;
inspired by Norberg (1993,1999), Haastrup & Arjas (1996), Cook & Lawless
(2007);
◮ Pigeon, Antonio & Denuit [PAD] (2013, ASTIN Bulletin, in press):development of individual claims in discrete time, parametric;
inspired by chain–ladder method;
◮ Antonio & Godecharle: development of individual claims in discretetime, historical simulation from empirical data;
inspired by Drieskens, Henry, Walhin & Wielandts (Scandinavian Actuarial
Journal, 2012) and Rosenlund (ASTIN Bulletin, 2012).
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 10 / 38
[AP]: continuous time, claim-by-claim
◮ A claim i is a combination of
- an accident date Ti ;
- a reporting delay Ui ;
- a set of covariates Ci ;
- a development process Xi : Xi = ({Ei (v),Pi (v)})v∈[0,ViNi];
◮ In the development process we use:
- Ei (vij) := Eij the type of the jth event in development of claim i ;
- occurs at time vij , in months after notification date;
- corresponding payment vector Pi (vij) := Pij .
◮ Event types?
- payment, settlement with payment, settlement without payment, . . .
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 11 / 38
[AP]: continuous time, claim-by-claim
Run–off process of a non–life claim: micro–level.
Ti Ti + Ui
0
Ti + Ui + Vi1
Vi1 Vi2. . .
ViNi
•
Occurrence
Notification
Loss payments
Closure
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 12 / 38
[AP]: statistical model
◮ Observed data
development up to time τ of claims reported before τ .
(T oi ,U
oi ,X
oi )i≥1.
◮ Development of claim i is censored τ − T oi − Uo
i time units afternotification.
◮ Likelihood of the observed claim development process:
∝
∏
i≥1
λ(T oi )PU|t(τ − T o
i )
exp
(
−
∫ τ
0w(t)λ(t)PU|t(τ − t)dt
)
·
∏
i≥1
PU|t(dUoi )
PU|t(τ − T oi )
·∏
i≥1
Pτ−T o
i−Uo
i
X |t,u (dX oi ).
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 13 / 38
[AP]: statistical model - building blocks
◮ The reporting delay:∏
i≥1PU|t(dU
oi)
PU|t(τ−T oi) ;
◮ The occurrence times (given the reporting delay distribution):
∏
i≥1
λ(T oi )PU|t(τ − T o
i )
exp
(
−
∫ τ
0w(t)λ(t)PU|t(τ − t)dt
)
;
◮ The development process – event part:
∏
i≥1
∏Nij=1
{
hδij1se (Vij )·h
δij2sep (Vij )·h
δij3p (Vij )
}
exp (−∫ τi0 (hse(u)+hsep(u)+hp(u))du);
◮ The development process – severity part:
∏
i≥1
∏
j
Pp(dVij).
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 14 / 38
[AP]: building block - reporting delay
◮ Reporting delay: Weibull with degenerate components at 0 daysdelay, 1 day delay, . . . , 8 days delay:
8∑
k=0
pk IU=k + (1−∑
k
pk)fU|U>8(u).
Fit Reporting Delay − ’Injury’
In months since occurrence
Density
0.0 0.5 1.0 1.5 2.0 2.5 3.0
01
23
4
Weibull / Degenerate
Observed
Fit Reporting Delay − ’Material’
In months since occurrence
Density
0.0 0.5 1.0 1.5 2.0 2.5 3.0
01
23
45 Weibull / Degenerate
Observed
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 15 / 38
[AP]: building block - occurrence of claims◮ Poisson process driving the occurrence of Material Damage (MD)
claims.
0.0
40.0
50.0
60.0
70.0
8
Occurrence process: Material
t
lam
bda(t
)
Jan’00 Jan’01 Jan’02 Jan’03 Jan’04 Jan’05 Jan’06 Jan’07 Jan’08 Jan’09
0.0
40.0
50.0
60.0
70.0
8
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 16 / 38
[AP]: building block - occurrence, type of events
0 40 80 120
0.00
0.04
0.08
0.12
Type 1 − Injury
t
h(t)
first
later
Weibull
0 40 80 120
0.00
00.
010
0.02
00.
030
Type 2 − Injury
th(
t)
0 40 80 120
0.05
0.10
0.15
0.20
0.25
Type 3 − Injury
t
h(t)
0 5 15 25
0.1
0.2
0.3
0.4
0.5
Type 1 − Mat
t
h(t)
0 5 15 25
0.0
0.1
0.2
0.3
0.4
Type 2 − Mat
t
h(t)
0 5 15 25
0.05
0.10
0.15
0.20
Type 3 − Mat
t
h(t)
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 17 / 38
[AP]: building block - severity model
◮ Severity distribution.
◮ Lognormal distributions with µ and σ depending on:
- the development period: 0-12 months after notification, 12-24 months. . . (for injury) and 0-4 months, 4-8 months . . . (for material);
- the initial reserve (set by company experts): categorized.
◮ Policy limit of 2,500,000 euro is implemented.
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 18 / 38
[AP]: simulating the predictive distribution of
reserves
◮ Using these building blocks we can easily:
• simulate the time to a next event, the corresponding type and severityfor an RBNS claim;
- use survival function/cdf determined by estimated hazard rates;
• simulate the number of IBNR claims that will show up, their occurrencetime and their development;
- use filtered Poisson process, in combination with reporting delaydistribution;
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 19 / 38
[AP]: example with data from practice
◮ Data from Antonio & Plat (2013, SAJ):
- portfolio of general liability insurance policies for private individuals;
- use data from 1997 to 2004 as training set, 2005-2009 as validation set;
- exposure measure available;
- all payments discounted to 1/1/1997;
- Bodily Injury (BI) and Material Damage (MD) payments;
- 279,094 reported claims: 273,977 are MD, 5,117 are BI;
- closed?: 268,484 MD and 4,098 BI claims.
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 20 / 38
[AP]: example with data from practice
Perform a back–test for BI and MD claims.
Arrival Development yearyear 1 2 3 4 5 6 7 8
1997 308 635 366 530 549 137 132 3391998 257 482 312 336 269 56 179 781999 292 590 410 273 254 286 132 972000 317 601 439 498 407 371 247 2752001 466 846 566 567 446 375 147 2402002 314 615 540 449 133 131 332 1, 0822003 304 802 617 268 223 216 1732004 333 864 412 245 273 100
Total outstanding BI = 7,923,000 and total outstanding MD = 1,861,000.
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 21 / 38
[AP]: output for BI claims, CY 2006IBNR claims
Number
Fre
quency
5 10 15 20 25 30 35
0500
1000
1500
IBNR + RBNS Reserve
Reserve
Fre
quency
1000 2000 3000 4000 5000 6000
0200
400
600
800
1000
Total events
Number
Fre
quency
700 800 900 1000 1100
0200
400
600
800
1000
Type 1 events
Number
Fre
quency
100 120 140 160 180 200 220
0500
1000
1500
Type 2 events
Number of events
Fre
quency
0 20 40 60 80
0200
400
600
800
1000
1200
Type 3 events
Number
Fre
quency
500 550 600 650 700 750 800 850
0200
400
600
800
1000
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 22 / 38
[AP]: MD claims, collective vs. micro–model
Micro−level: Total
Reserve
Fre
quency
2000 3000 4000 5000 6000 7000
0200
400
600
800
1000
1200
1400
Aggregate Model − ODP: Total
Reserve
Fre
quency
2000 3000 4000 5000 6000 7000
0500
1000
1500
2000
Aggregate Model − Lognormal: Total
Fre
quency
0 5000 10000 15000 20000 25000 30000
0500
1000
1500
2000
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 23 / 38
[AP]: BI claims, collective vs. micro–model
Micro−level: Total
Reserve
Fre
quency
4000 6000 8000 10000 12000 14000 16000
0500
1000
1500
Aggregate Model − ODP: Total
Reserve
Fre
quency
4000 6000 8000 10000 12000 14000 16000
0500
1000
1500
Aggregate Model − Lognormal: Total
Fre
quency
4000 6000 8000 10000 12000 14000 16000
0200
400
600
800
1000
1200
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 24 / 38
From continuous time to discrete time,
claim–by–claim
◮ Viewpoint in Antonio & Plat (2013, SAJ): continuous time, inspired bysurvival analysis.
◮ Pigeon, Antonio & Denuit (2013, ASTIN) switch to discrete time,inspired by chain–ladder.
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 25 / 38
[PAD]: discrete–time, an example
Date Our notation
Accident 06/17/1997 i = ‘1997′
Reporting 07/22/1997 t(ik) = 0
Number of periods with payment > 0 u(ik) = 4
after first one
Payment 09/24/1997 q(ik) = 0
10/21/1997
11/07/1997 Y(ik),1
05/08/1998 n(ik),1 = 1
12/11/1998 Y(ik),2
03/23/1999 n(ik),2 = 1 Y(ik),3
02/23/2000 n(ik),3 = 1 Y(ik),4
01/03/2001 n(ik),4 = 1
02/24/2001 Y(ik),5
Closure 08/13/2001 n(ik),5 = 0
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 26 / 38
[PAD]: model set–up
We identify (in discrete time):
(ik) claim k from occurrence period i ;
Tik reporting delay, i.e. number of periods between occurrence andnotification period;
Qik first payment delay, i.e. number of periods between notification andfirst payment period;
Uik number of periods with partial payment > 0 after first payment;
Yikj the jth incremental partial amount for claim (ik) (> 0);
Nikj delay between 2 periods with payment, i.e. number of periods betweenpayments j and j + 1;
Nik,Uik+1 number of periods between last payment and settlement.
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 27 / 38
[PAD]: development pattern
◮ We use:
- initial amount Yik1;
- payment–to–payment development factors (note: chain–ladder isperiod–to–period)
λ(ik)j =
∑j+1r=1 Yikr
∑j
r=1 Yikr
, j = 1, . . . , uik .
◮ Thus, development pattern Λ(ik)uik+1 is
Λ(ik)uik+1 =
[
Yik1 λ(ik)1 . . . λ
(ik)uik
]′.
◮ Note: multivariate distribution of Λ(ik)uik+1 should account for
dependence and asymmetry ⇒ use Multivariate (Skew) Normal.
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 28 / 38
[PAD]: likelihood expressions◮ As in [AP] we can write down likelihood contributions of:
- occurrence of claims: use Poisson process, thinned;
- closed claims;
- RBNS claims: uik is not observed, development is censored;
- RBNP claims (new!): Reported But Not Paid
• first period with payment not observed (yet);
• first payment delay is censored, etc.
- IBNR claims: use Poisson process, appropriately thinned;
◮ We use:
- (not MSN) maximum likelihood;
- (in MSN) maximum product of spacings for shape, combined with MLfor location and scale parameters.
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 29 / 38
[PAD]: what we get – analytical results
◮ An IBNR or RBNP claim: C = Y1 · λ1 · λ2 · . . . · λU
with E [IBNR|I] versus E [RBNP|I]
= (x) · EU
[
2U+1 exp(t′1µU+1 + 0.5t′1Σ1/2U+1
(
Σ1/2U+1
)′t1) ·
∏U+1j=1 Φ
(
∆j ·((Σ1/2U+1)
′t1)j√
1+∆2j
)]
,
where (x) is E [KIBNR] versus kRBNP.
◮ An RBNS claim: [C |ΛA = ℓA] = y1 · ℓ1 · . . . · ℓuA−1 · λuA . . . · λU
with E [RBNS|I]
=∑
(ik)RBNS
y1 · ℓ1 · . . . · ℓu1−1
· EUB
2UB exp(h′1µ∗U+1 + 0.5h′1
(
Σ∗U+1
)1/2(
(
Σ∗U+1
)1/2)′
h1) ·∏UB
j=1Φ
∆∗j·
(
(
(Σ∗U+1)
1/2)′h1
)
j√
1+(∆∗j )
2
.
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 30 / 38
[PAD]: what we get – predictive distributions
◮ Simulation–based, using all building blocks;
◮ Parameter uncertainty?
- simulate each parameter from asymptotic normal distribution;
- (at least for discrete time components and location parameter inM(S)N);
- more work to be done.
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 31 / 38
[PAD]: example with (same) data from practice
◮ Distribution for number of periods: {Tik ,Qik ,Uik ,Nik}
- mixtures of discrete distribution with degenerate components;
- model selection using AIC and BIC;
- comparison of observed data and fits.
◮ Development pattern:
- Multivariate Normal (MN) and Multivariate Skew Normal (MSN) with
UN, TOEP, CS and DIA structure for Σ1/2c ;
- model selection with AIC and BIC;
- comparison of observed data and fits.
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 32 / 38
[PAD]: example with (same) data from practice
◮ Claims closing with ‘single’ payment: explore U(S)N.
Single payment severity
Bodily Injury
2 4 6 8 10
0.0
00.0
50.1
00.1
50.2
00.2
5
Normal density
Single payment severity
Material Damage
−2 0 2 4 6 8 10 12
0.0
00.0
50.1
00.1
50.2
00.2
50.3
00.3
5
Normal density
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 33 / 38
[PAD]: example with (same) data from practice
◮ Claims with multiple payments: explore M(S)N.
2 4 6 8 10
02
46
Initial pmt
First
lin
k r
atio
2 4 6 8 10
02
46
8
Initial pmt
Se
co
nd
lin
k r
atio
2 4 6 8 10
02
46
8
Initial pmt
Th
ird
lin
k r
atio
2 4 6 8 10
02
46
8
Initial pmt
Fo
urt
h lin
k r
atio
-2 0 2 4 6
02
46
8
First link ratio
Se
co
nd
lin
k r
atio
-2 0 2 4 6
02
46
8
First link ratio
Th
ird
lin
k r
atio
-2 0 2 4 6
02
46
8
First link ratio
Fo
urt
h lin
k r
atio
-2 -1 0 1 2 3 4
-10
12
34
Second link ratio
Th
ird
lin
k r
atio
-2 -1 0 1 2 3 4
-10
12
34
Second link ratio
Fo
urt
h lin
k r
atio
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 34 / 38
[PAD]: example – predictive results
Model or Item Expected S.E. VaR0.95 VaR0.995
Scenario Value
Individual MSN IBNR+ 2, 970, 645Analytical RBNS 5, 433, 548
(until settlement) Total 8, 404, 192
Individual MSN IBNR+ 3, 035, 519 494, 771 3, 912, 159 4, 673, 340Simulated RBNS 5, 439, 318 704, 701 6, 650, 958 7, 738, 003
(until settlement) Total 8, 474, 837 853, 812 9, 927, 439 11, 105, 174
Chain-Ladder Total 9, 126, 639 1, 284, 793 11, 380, 743 13, 061, 937(Bootstrap, ODP)
Observed Total 7, 684, 000
([PAD] uses slightly adjusted distinction MD vs BI, compared to [AP].)
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 35 / 38
[PAD]: example – predictive results
Model or Item Expected S.E. VaR0.95 VaR0.995
Scenario Value
Individual MSN Total 8, 568, 506 922, 657 10, 134, 198 11, 406, 905Sim. + Unc.
(until settlement)
Individual MSN Total 8, 568, 355 902, 601 10, 141, 226 11, 320, 931Sim. + Unc. + Pol. Limit
(until settlement)
Individual MSN Total 7, 251, 103 817, 878 8, 679, 618 9, 717, 771Sim. + Unc. + Pol. Limit
(until triangle bound)
Chain-Ladder Total 9, 126, 639 1, 284, 793 11, 380, 743 13, 061, 937(Bootstrap, ODP)
Observed Total 7, 684, 000
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 36 / 38
[PAD]: example – predictive results
Prediction for lower triangle: MSN vs. Chain−Ladder
Bodily Injury
4.0e+06 6.0e+06 8.0e+06 1.0e+07 1.2e+07 1.4e+07
0e+
00
1e−
07
2e−
07
3e−
07
4e−
07
5e−
07
●
●
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 37 / 38
Highlights
◮ Novel setting for claims reserving at individual level.
◮ Continuous time framework, inspired by survival analysis, on the onehand.
◮ Discrete time framework, inspired by chain–ladder, on the other hand.
◮ Analytical expressions for moments of (RBNS, RBNP, IBNR) reserve +simulation approach.
◮ More work ongoing.
◮ Comments?: [email protected].
September 17 2013, K. Antonio, KU Leuven and University of Amsterdam 38 / 38