MIXED STRUCTURE

(1)

TIIE STRUCTURE DISTRIBUTION IN A MIXED POISSON PROCESS

JOZEF L. TEUGELS and PETRA VYNCKIER JLT, Department of Mathematics,

Katholieke Universiteit

Leuven,

Celestijnenlaan

200B, B-3001,

Heverlee

(Leuven),

^Belgium.

(Received

^July,

1996;

^Revised

October, 1996)

ABSTRACT

We

use a variety of real inversion formulas to derive the structure distribution in a mixed Poisson process. These approaches should prove to be useful in applications, e.g., in insurance where such processes are very popular.

This article is dedicated to the memory ofRoland L. Dobrushin.

Key

words: Mixed Poisson

Process,

Inversion Formulas.

AMS (MOS)

subject classifications: 60G55.

1. Introduction

One

of the most classical examples of a counting process

{N(t);t >_ O}

^is ^the homogeneous Poisson process. Theprobability distribution isgiven ^on

N

in the form:

e-t(t)

ⁿ

pn(t) P(N(t) n) _n!

The risk-parameter

A

givesthe average number of events per unit time.

In

many applications,

however,

the Poisson process is too simpleto be applicable.

Example: If

N(t)

^is the number of claims up to time t in a specific insurance portfolio, then it has been known to actuaries that the variability in the portfolio-expressed by

Var{N(t)}-

^is

much larger than

At,

the value corresponding to the strict Poisson case.

One

reason is

that,

^even when the number of claims for each individual policy follows a Poisson

distribution,

the averages vary over the portfolio. This means that the value

A

for an individual policy is one of the possible values ofa random variable

A.

This then leads to the notion ofa mixed Poisson process, which is defined as follows

(see ^Lundberg, [11]).

Definition:

A

mixed Poisson process

(MPP) {N(t);t >_ 0}

^is ^a pure birth process with state space

N

and counting distribution

Pn(t)

^{of the} ^form:

/

Pn(t) P(N(t) n) e- At(At)

.dH(A),

o

n

where

H

is the structure distribution given by

H(A)= P{A A}

^with

H(0)=

^0.

Here

are afew popular choicesfor the structure function.

Printedinthe U.S.A. ()1996by North Atlantic SciencePublishing Company 489

(2)

Homogeneous

Poisson process. The random variable

A

is

degenerate

at

,( > 0).

^This ^is

the only

MPP

that is simultaneously a renewal process. The interclaim times are independent and exponentially distributed.

Double Poisson process. The structure distribution has two different jump points, and

’2,

^with corresponding heights,

Pl (0,1)

^and

_P2 1-p,

respectively. The counting distribution is then

A₁

)n A2t

ⁿ

P() _P n ⁺ ^p n

^t

^>

^O.

This kind of process might be used if the population were to be subdivided into females and males.

P61ya

or Pcalpress.

For

this example, the structure distribution is

Gamma(a, 1/b),

where

,

^b

> 0,

so that

b

-A-I

dU(A) e

^dA.

Theresulting counting distribution is then

p (t)

^t

n

t+b] tb] ^t>O,

which is a Pascal ornegative binomial distribution. The two parameters, a and

b,

allow

great

flexibility whileone fits actual data to this theoretical distribution.

Sichel proc.

In [12],

Sichel introduced a distribution as a mixed Poisson by mixing it witha general inverse Gaussian distribution of the form:

h()dH()_-OO-i

^dA

_--2K0()exp { 2+} 2fl

where the three parameters,

fl,0

and

,

^are nonnegative. The function

K

₀ is the modified Bessel function of the third kind.

An

explicit form of the probabilities can be obtained so that

(t)n _}(o ⁺ ^n)Uo + .(41 ⁺ ^2fit)

Pn(t) n ¹ + 2t)

KO(

The case where 0

-1/2

^is particularly interesting since the

general

inverse Gaussian distribution simplifies to the classical inverse Gaussian distribution.

The introduction ofa

general MPP

is probably due to Thyrion

[15]

^for ^the

^general

^case ^and

to

Ammeter [4]

^for ^the ^special ^case ^of ^the Pascal process. The first detailed and fundamental study of

MPP’s

is due to Lundberg

[11],

who derived the deeper connection between

MPP’s

and continuous-time Markov chains.

In

particular,

Lundberg

derived the binomial criterion as a characterization of

MPP’s

amongMarkov processes.

Other contributions are due to Albrecht

[1-3],

^who ^has ^been ^the first to discuss statistical problems connected with

MPP’s. For

more information on

MPP’s,

see Johnson and

Kotz [9], Bfihlmann[5],

^Gerber

[6, 7]

^and ^Grandell

[8]. ^Moment

estimators and maximum likelihood estimators for the structure distribution have been derived by Tucker

[16]

^{and Simar}

[13];

^they

result in discrete estimates for

H.

Again, Albrecht

[3]

studied estimatorsfor the case ofa mixture ofa known finite number of Poisson components.

In

all these estimators, ^{one uses} the number of claims in successive repetitions of the process.

An

alternative approach is due to

Karr [10],

who estimated

H

by inverting the Laplace transform.

In

this case, only the time epoch ofthe first claim in each of the realizations of the

MPP

is used.

Our

approach ismore in the spirit of

Karr’s.

(3)

2. Estimation of H by Real Inversion Formulas

Before we start deriving inversion

formulas,

let us introduce an abbreviation.

For

0

_< _Yl -<

Y2 <

c, ^we shall write

H{Yl;Y2}" 1/2H({yl} ⁺ ^{H(yl, y2)+} 1/2H({Y2} ^). ⁽¹⁾

In

the above

H({y})

is the point mass of

H

at the point y, while

H(yl, y2)

denotes the mass in the open interval

(Yl, Y2)"

2.1 Inversionusingthe Laplacetransform of

H

This methodis based on the equality"

where

H

is the Laplace transform of

H. We

know that

N(e) ¹

At

E

1- e

E n! dH(A)

0 n’-O

0 0

(2)

2.1.1 Inversion via Poisson variates

The following limiting relation can be derived by looking at a sequence of independent, identically distributed Poisson variables.

See,

for example,

Teugels [14].

t):

n, 0

Define,

for 0

< _Yl < _Y2 < ,

the expression"

0 als t

<

^u

als t u 1 als t

>

^u.

[nY₂

In(Yl’Y2)" E

m 1

+ [nYl]

Using the definition of

H,

^{we can} ^writethat

(- _m! n)mi(m)(n)

[nY2]

In(Yl Y2)-- E _{+ [u]} ^(-- ^m! ^n)m ^(-1 ^)m /

^e

o

Xn,mdH(,)

/ ^ell(a)

0 ^m 1

+ [nYl] m---’e

j {dn(’Y2)-dn(1’Yl)}dH(1)"

0

On

the other

hand,

equality

(2)

^canbe invoked to write

{ N(t)!(N(t)- . N(t)-rn}

(m)(O)--(--t)- ^’E

m)! (

^t

(4)

Interchanging summation and expectation, wesee that

Iu(y!,y2)- ^E E (N(mt)) () (1-)

^N(t)-m

m 1

+ [nYl]

We

combine this with the first formula and apply

Lebesgue’s

dominated convergence theorem to arrive ata first inversion formula:

H{Yl,y} lim

_n|_c

^E

¹

⁽⁴⁾

m 1

+

^[ny₁

For

the mass at apoint, we can use a slightlydifferent approachthan that for 1 als v- 1

exp{ n(v

¹

^-log v)}

_{0 als} _v _1.

Following the same

argument

as

above,

we

get

-limE

[ ^N(t)!

^N(t)-n ^e

n

H({y})

,qoo

[,(N(t)-n)’(1-yY) (Y)

2.1.2 Inversion via

Gamma

variates

An analogous

derivation can be made starting from a sequence of^i.i.d, exponential variables.

For

thelatter wehave the limiting relation"

u

f

⁰ ^als ^u

^<

¹

r(n) 1/2

¹

0 1 als u

>

^1.

Define

Y2

"n(Yl,Y2):- i (-}n(n)()snd ^p(

^1"

Yl

We

easily findfrom the definition of

H

that

"n(YI’Y2)- J

₀

From (2)

^{we see}^that

Jn(Yi’ Y2)

^g

_g(rt,

N(, I

^r

- ¹⁾

^ty²^tyⁿ¹

zn-l(1--z)N(t)-ndz

Combining these elements with Lebesgue’s

theorem,

^we^obtain ^a second inversion formula:

n ty₁

H{Yl;Y2} nliTrnE ^B(n,N(t I

ⁿ

⁺ ^1)/ ^zn- ¹⁽¹ ^z)N(t) ^-ndz

ty₂

(6)

One advantage

^{of this} latter formula is

that,

if thedensity h of

H

exists, then

h(y)-limE { ^{B(n, N(t)}

¹ ⁿ

⁺ ¹⁾

¹

(n)n(n)N(t)-n} ^-7

¹

^-7

(5)

2.1.3 Normal approximation

For

both of the inversion

formulas, (4)

^and

(6),

ône ^can âpply â normal approximation.

usillustrate the procedure on

(6).

Let In (6),

^let ^z-

_anU + bn,

^where ^an and b_n are functions of t that are to be determined in the sequel. Rewrite the incomplete beta-integral in theform:

t ⁽⁾

] ^zn-l(

¹

_z)N(t)-ndz--ant i

tY2

with

bn-

^1-bⁿ ^and

(anu + bn)

^n-

1(

_n

i-

1,2.

The expression inside the expectation sign in

(6)

^{then reads} ^as

1

f ^z.-1

^(t ⁿ^d

B(n, N(t)

ⁿ

+ 1) (1 z)

^N ^z

^I

₁

(n)

ty₂

where --N(t)-

anb b.

I1() (n- 1)!(N(t)- n)!

and

anu)N(t)-ndu,

exp(I2(n,u))du,

I2(n, u)- (n- 1)log ( an)

^l

^+--u ^b ⁺ ^(N(t) ⁿ⁾ ^log (

¹

^a )

Now

choose a_n and b_n such that

exp(I2(n u))

converges to the key factor in the normal density.

We

then need both

N(t)

^and ⁿ ^{to be}

large,

but also

N(t)-

ⁿ ^needs ^to ^be

^large. ^A

series expan- sion of the logarithms yields for

12

^that

I2(n u)--uan{

^n-1

N(t)-n} ^u2_2 {n-1 ⁺ N(t)-n} ^+o ⁽ⁿⁿ ⁾³

The obvious choice for b_n should annihilate the first term on the right. The subsequent choice of a_nis madeto reduce the coefficient of

-u2/2

^{to 1. This} ^{yields the} choices"

(n- 1)(N(t)- n) N(t)-n

bn Nn(t) ^-1’

¹ ^b

^n- N(t)- ^1’

^and

an-

²

(N(t)- 1)

³

With the help of these expressions and Stirling’s

formula,

^one easily shows that

Ii(n),,o (27r)

Introducing

the above values for a_nand

b,

ⁱⁿ _{ai yields}

3

(N(t)-l)

²

{n

^n-1

_{} /} n{N(t)

¹

(i(n)

v/{n 1)(N(t) n) tYi N(t) "-

¹

l-tt ^tYi

Combining all of the above

results,

oneobtains anormal approximation to

(6)"

{i

ⁿ

^N(t) 1)}-{i ^n,n ^N(t) 1)}

H {YI Y2} 1--ff(t)( ^YI 1--(t)( ^tY

²

2.2 Inversionbased onthe time epochs

As

an

alternative,

^{we can} start from the explicit expression for the distribution of the epoch of the nth

event, Tn,

and combine this with the limiting relation

(5).

^According ^to ^a ^result ^by

(6)

Lundberg

[11], (see

alternatively, Albrecht

[3]),

fT ^(x)

ⁿ

/

For

the distribution we

get

w

fTn(U)du--

0

Henceforth,

)x)n(n 1)!dH(A)

(n-1 en

0

P(Yl nY2 ^)-

Now,

^we apply

(5)

^directly ^{to obtain}^a ^fourth ^inversion^formula:

H{Yl;Y2}

ⁿ

^limPly

^o,:) \ I

< -n

ⁿ

^< ^Y2 )

3. Simulations

3.1 Simulation ofan

MPP

Another important property of an

MPP

is that the waiting times between epochs of

events,

with each

W

_n

T

_n _1-

Tn,

âre ^dependent, ûnless ^{we are} ^dealing ^with â strict Poisson process.

Albrecht

[3]

has shown that the

Wn’s

^are identically distributed,

exchangeable,

but positively correlated.

More

specifically, onehas

Cov(Wn_

1,

Wn)- ,ar(-).

There are at least two distinct procedures to simulate the time epochsofan

MPP.

(i) ^A

^first ^method to simulate the time epochs of an

MPP

uses the above interdependence between the waiting times. Recall from Albrecht

[3]

^that

n-1 Pn- l(t ^nt- tn-1) P(W

n

<_

^t

Tn_

1

tn_l)

^1- _t_/

tn_

1

Pn_l(n_l)

(ii) ^A

second method is based on the uniformity

property

of the

MPP.

Given that

N(t)-

n, the first n epochs,

T1,...,Tn,

^{have the} ^same joint distribution as the order statistics of a sample from a uniform distribution on

(0, t) (Albrecht [3]). ^One

^starts

with the simulation of n as a value of

N(t)

^with ^a given value of t and in accordance with the distribution

pn(t). ^One

then simulates n random numbers in

(0, t).

^After

rearrangement, these values give ^asample

(T1,... Tn).

3.2 Illustrations

We

perform simulations for the empirical versions of the inversion formulas given in

(4), (7)

and

(8)

^on homogeneous, double Poisson processes, Pascal processes, and mixtures of Pascal processes with Poisson point masses.

In

all cases, we considered the distribution function by taking Yl to be zero.

From

the simulations, we can conclude that the estimators for the inversions

(4)

^and

(8)

^perform ^quite

well,

^even^for a relatively small number of realizations.

On

the other

hand,

the normal approximation in

(7)

^seems ^to ^{be too} ^rough. ^Generally, ^continuous structure functions are better estimated by the inversion formulas than discrete distributions are.

By

means of example, the results of two simulations are shown in Figures

(a)

^and

(b).

(7)

0

(a)

/i :"1

,/,"/

0 1 2 3 4 5 6

Figure

(a)

0

(b)

./:

....!

//

/l

1 2 3 4

Figure

(b)

(8)

In

^both parts, the estimators are based on 15 simulated realizations with fixed t 20. The solid lines represent the theoretical structure function and the dotted lines represent the estimator for

(8),

while the points are the estimators for inversion formula

(4). ^For

Figure

(a),

^we ^simulated

double Poisson processes with parameters

A1 ^2, 2

^{4 and}

^Pl

^.5. The value ofn in

(4)

^was

taken to be 20.

In

the case ofthe estimator for

(8),

^we ^considered ^the averages for n-values from n 18 to n

N(20).

Figure

(b)

^concerns simulations ofPascal processes with parameters a 21 and b- 10. The value for n in

(4)

^was taken to be

18,

whereas for

(8)

the averages for n- 15 to n-

N(20)

^were considered.

Acknowledgements

The authors take pleasure in acknowledging

D. Wasters

for helping with simulations. They also enjoyed discussions with

B.

Sundt and

H. Rootzn

on the topicon mixed Poisson processes.

References [1]

[2]

[3]

[4]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

Albrecht, P., 0ber

einige Eigenschaften des gemischten Poisson prozesses, Bull.

of

^the

Assoc. of

Swiss Actuaries

(1981),

^241-249.

Albrecht, P., Zur

statistischen analyse des gemischten Poisson-prozesses, gestiitzt auf Schadeneintrittzeitpunkte, ^Blgtter der Deutschen

Gesellschaft fiir

Versicherungsrnathematik 15

(1982),

^249-257.

Albrecht, P., On

some statistical methods connected with the mixed Poisson process,

Scan&

Actuarial Journal

(1982),

^1-14.

Ammeter, H., A

generalization of the collective theory of risk in

regard

to fluctuatingbasic probabilities, Skand. Akt. 31

(1948),

171-198.

Biihlmann, H.,

MathematicalMethods in Risk Theory, Springer-Verlag, Heidelberg 1970.

Gerber, H.U., An

Introduction to Mathematical Risk Theory, Hiibner Foundation, Univer- sity ofPennsylvania, Philadelphia 1979.

Gerber, H., ^On

the asymptotic behavior of the mixed Poisson process,

Scan&

Actuarial

J. (1983),

^256.

Grandell, J., Aspects of

^Risk Theory, Springer-Verlag,

New

York 1991.

Johnson, N.I.

and

Kotz, S.,

Mixed Poisson process, Encyclopedia

of

Statistical Sciences 5

(1985),

^556-559.

Karr, A.F.,

Combined nonparametric inference and state estimation for mixed Poisson processes,

Zeitschrift fiir

Wahrscheinlichkeitstheorie und verwandte Gebiete 66

(1984),

^81-96.

Lundberg, O., On

Random

Processes

and theirApplication to Sickness and Accident

Stat-

istics, Almquist and

Wicksells,

Uppsala 1964.

Sichel,

H., On

a family of discrete distributions particularly suited to represent

long

^tailed frequency

data, Proc.

3rd

Syrup.

Math. Statistics

(1971),

^Pretoria,

^CSIR.

Simar,

L.,

Maximum likelihood estimation of a compound Poisson process, The Annals

of

Statistics4

(1976),

^1200-1209.

Teugels,

J.L.,

Probabilistic proofs ofsome real inversion

formulas,

Math. Nachrichten 146

(1990),

^149-157.

Thyrion,

P., Sur

une proprit de8 processu8 de Poi88on

Gnralis8,

^Bull.

Assoc.

Royale Actuaire Belges59

(1959),

^35-46.

Tucker, H.G., An

estimate ofthe compounding distribution ofa compound Poisson distribution, Th. Prob. Appl. 8

(1963),

^195-200.

MIXED STRUCTURE

TIIE STRUCTURE DISTRIBUTION IN A MIXED POISSON PROCESS

JOZEF L. TEUGELS and PETRA VYNCKIER JLT, Department of Mathematics,

Leuven,

200B, B-3001,

(Leuven),

(Received

1996;

October, 1996)

ABSTRACT

We

Key

Process,

AMS (MOS)

1. Introduction

One

{N(t);t >_ O}

N

e-t(t)

pn(t) P(N(t) n) n!

A

In

however,

N(t)

Var{N(t)}-

At,

One

that,

distribution,

A

A.

(see Lundberg, [11]).

A

(MPP) {N(t);t >_ 0}

N

Pn(t)

/

Pn(t) P(N(t) n) e- At(At)

.dH(A),

n

H

H(A)= P{A A}

H(0)=

Here

Homogeneous

A

degenerate

,( > 0).

MPP

’2,

Pl (0,1)

P2 1-p,

)n A2t

P() P n + p n

>

P61ya

For

Gamma(a, 1/b),

,

> 0,

-A-I

dU(A) e

p (t)

t+b] tb] t>O,

b,

great

In [12],

h()dH()_-OO-i

--2K0()exp { 2+} 2fl

fl,0

,

K

An

(t)n _}(o + n)Uo + .(41 + 2fit)

Pn(t) n 1 + 2t)

KO(

-1/2

general

general MPP

[15]

pn(t) P(N(t) n) _n!

(see ^Lundberg, [11]).

_P2 1-p,

P() _P n ⁺ ^p n

^>

t+b] tb] ^t>O,

_--2K0()exp { 2+} 2fl

(t)n _}(o ⁺ ^n)Uo + .(41 ⁺ ^2fit)

Pn(t) n ¹ + 2t)

^general

[8]. ^Moment

_< _Yl -<

H{Yl;Y2}" 1/2H({yl} ⁺ ^{H(yl, y2)+} 1/2H({Y2} ^). ⁽¹⁾

< _Yl < _Y2 < ,

(- _m! n)mi(m)(n)

In(Yl Y2)-- E _{+ [u]} ^(-- ^m! ^n)m ^(-1 ^)m /

/ ^ell(a)

(m)(O)--(--t)- ^’E

Iu(y!,y2)- ^E E (N(mt)) () (1-)