• 検索結果がありません。

1Introductionandthemainresult DmitryB.Rokhlin StochasticPerron’smethodforoptimalcontrolproblemswithstateconstraints

N/A
N/A
Protected

Academic year: 2022

シェア "1Introductionandthemainresult DmitryB.Rokhlin StochasticPerron’smethodforoptimalcontrolproblemswithstateconstraints"

Copied!
16
0
0

読み込み中.... (全文を見る)

全文

(1)

ISSN:1083-589X in PROBABILITY

Stochastic Perron’s method for optimal control problems with state constraints

Dmitry B. Rokhlin

*

Abstract

We apply the stochastic Perron method of Bayraktar and Sîrbu to a general infinite horizon optimal control problem, where the stateXis a controlled diffusion process, and the state constraint is described by a closed set. We prove that the value function vis bounded from below (resp., from above) by a viscosity supersolution (resp., sub- solution) of the related state constrained problem for the Hamilton-Jacobi-Bellman equation. In the case of a smooth domain, under some additional assumptions, these estimates allow to identifyvwith a unique continuous constrained viscosity solution of this equation.

Keywords: Stochastic Perron’s method; State constraints; Viscosity solution; Comparison re- sult.

AMS MSC 2010:93E20; 49L25; 60H30.

Submitted to ECP on June 19, 2014, final version accepted on October 20, 2014.

1 Introduction and the main result

The aim of the paper is to extend the scope of applications of the stochastic Per- ron method, developed by Bayraktar and Sîrbu. This method allows to characterise the value function of a controlled diffusion problem as a viscosity solution of the cor- responding Hamilton-Jacobi-Bellman (HJB) equation, bypassing the dynamic program- ming principle. Instead it requires a comparison result, implying the uniqueness of a viscosity solution of the HJB equation. Previously this method was applied to linear parabolic equations [5], stochastic differential games [7, 25, 26], regular [6, 24] and singular control problems [8].

The method involves the construction of two familiesV,V+of functions, bounding the value function from below and above

u≤v≤w, u∈ V, w∈ V+.

Elements ofV, V+ are called stochastic sub- and supersolutions. By the superposi- tion with the state process,uandwgenerate sub- and supermartingale-like processes.

Similarly to the classical Perron method [14, Sections 2.8, 6.3], the setV (resp.,V+) is directed upward (resp., downward) with respect to the pointwise maximum (resp., minimum) operation. The essence of the method is to prove that the functions

u(x) = sup

u∈V

u(x), w+(x) = inf

w∈V+w(x)

*Institute of Mathematics, Mechanics and Computer Sciences, Southern Federal University, Rostov-on- Don, Russia. E-mail:[email protected]

(2)

are respectively viscosity super- and subsolutions of the related HJB equation. If a comparison result, providing the inequalityu ≥w+, holds true, it follows thatu=v= w+is a unique (continuous) viscosity solution. This construction differs from Perron’s method of [17], which is not linked to the value function.

In the present paper we consider the stochastic control problem with state con- straints in the form of [21]. In contrast to [23], where the drift is not assumed to be bounded, and the value function is singular near the boundary, in [21] the problem is

"regular". To achieve the regularity it is assumed that the diffusion coefficient depends on the control and degenerates at the boundary. The same problem was considered in [18, 11]. It was proved that under appropriate assumptions the value function v is a unique continuous constrained viscosity solution of the HJB equation. (The term "con- strained" means, in particular, thatvsatisfies special boundary conditions, which in the deterministic situation were introduced in [27].) Roughly speaking, it is enough to as- sume that for each boundary point there exists a control, which kills the diffusion and directs the drift strictly inside the domain.

An application of the stochastic Perron method to state constrained problems seems rather interesting, since, as it is mentioned in [21], a direct proof of the dynamic pro- gramming principle is not available due to a complicated structure of admissible control processes, retaining a phase trajectory in a predetermined domain. Different penaliza- tion and approximation procedures were used instead in [21, 18, 11, 10].

We turn to the precise statement of our main result (Theorem 1.2). Let Ω be the space C([0,∞),Rm) of continuous Rm-valued functions, endowed with the σ-algebra F of cylindrical sets, and let P be the Wiener measure on F. So, the canonical processWs(ω) =ω(s)is the standardm-dimensional Brownian motion underP. Denote byF = (Ft)t≥0 the natural filtration ofW, and letF= (Ft)t≥0 be the correspondent minimal augmented filtration. The extension of the Wiener measure to the completion F ofFis still denoted byP.

Letαbe anF-progressively measurable stochastic process with values in a compact setA⊂Rk,0∈A. Consider the system of stochastic differential equations

dXt=b(Xt, αt)dt+σ(Xt, αt)dWt, X0=x. (1.1) We assume that the drift vectorb:Rd×A7→Rd and the diffusion matrixσ:Rd×A7→

Rd×Rmare continuous and satisfy the Lipschitz condition

|b(x, a)−b(y, a)|+|σ(x, a)−σ(y, a)| ≤K|x−y|

with some constantKindependent ofx,y,a. Note, that the linear growth condition

|b(x, a)|+|σ(x, a)| ≤K0(1 +|x|)

follows from the continuity ofb, σand compactness of A. Thus, there exist a unique F-adapted strong solutionXx,αof (1.1) on[0,∞): see [22, Chapter 2, Sect. 5].

LetG⊂Rdbe a closed set with the boundary∂Gand nonempty interiorG. It will be convenient to assume that0∈G. Denote byA(x),x∈Gthe set ofF-progressively measurable control processes αwith values in A and such thatXtx,α ∈ G, t ≥ 0 a.s.

Elements ofA(x) are calledadmissible controls for the initial condition x. The cost functionalJ and the value functionvare defined as follows

J(x, α) =E Z

0

e−βsf(Xsx,α, αs)ds, v(x) = inf

α∈A(x)

J(x, α), (1.2) wheref :G×A7→Ris a bounded continuous function.

(3)

We assume that for any initial condition x ∈ Gthere exists an admissible control:

A(x)6=∅. In this case the setGis calledviable. A necessary condition for the validity of this property is given in [2] (Theorem 1). Let

NG2(x) =

(p, Y)∈Rd×Sd: lim inf

G3y→x

p·(y−x)

|y−x|2 +1 2

Y(y−x)·(y−x)

|y−x|2

≥0

be the second order normal cone. HereSd is the set of symmetricd×dmatrices. If the setGis viable then for allx∈∂G,(p, Y)∈NG2(x)there exista∈Asuch that

p·b(x, a) +1

2Tr (σ(x, a)σT(x, a)Y)≥0. (1.3) See [2, Section 3] for more concrete forms of this condition.

We impose a slightly stronger requirement. For any functionψ:Rd7→Aput

bψ(x) =b(x, ψ(x)), σψ(x) =σ(x, ψ(x)). (1.4) Assumption 1.1.There exist a Borel measurable functionψ:Rd7→Asuch thatbψψ

are globally Lipschitz continuous and p·bψ(x) +1

2Tr (σψ(x)σTψ(x)Y)≥0, x∈∂G, (p, Y)∈NG2(x).

Under this assumption there exist a unique strong solution of the equation

dXt=bψ(Xt)dt+σψ(Xt)dWt, X0=x (1.5) and Xt ∈ G, t ≥ 0 a.s.: see [1, Theorem 3.1]. The correspondent control process αt=ψ(Xt)is admissible forx. Hence,A(x)6=∅,x∈G.

Consider the Bellman operator F(x, r, p, Y) = sup

a∈A

βr−f(x, a)−b(x, a)·p−1

2Tr (σ(x, a)σT(x, a)Y)

,

defined onR×R×Rd×Sd. Recall that a bounded upper semicontinuous (usc) function uis called aviscosity subsolutionof the equation

F(x, u, Du, D2u) = 0 (1.6)

on a setE⊂Rdif for anyϕ∈C2(Rd)and for any local maximum pointx0ofu−ϕonE the inequality

F(x0, u(x0), Dϕ(x0), D2ϕ(x0))≤0

holds true. In the same way, a bounded lower semicontinuous (lsc) functionwis called aviscosity supersolution of (1.6) onE if for anyϕ∈C2(Rd)and for any local minimum pointx0ofw−ϕonE we have the inequality

F(x0, w(x0), Dϕ(x0), D2ϕ(x0))≥0.

In these definitions one can assume that the maximum (resp., minimum) pointx0is strict andϕ(x0) =u(x0)(resp.,ϕ(x0) =w(x0)).

It is convenient to introduce thestate constrainedproblem F(x, u, Du, D2u)≤0 on G,

F(x, u, Du, D2u)≥0 on G. (1.7)

(4)

We say that a bounded usc (resp., lsc) functionu, defined onG, is viscosity subsolution (resp., supersolution) of the state constrained problem (1.7) ifF(x, u, Du, D2u)≤0on G (resp., F(x, u, Du, D2u) ≥ 0 on G) in the viscosity sense. A bounded function u is called a viscosity solution of (1.7) (or aconstrained viscosity solution), if its upper semicontinuous envelope u is a viscosity subsolution, and its lower semicontinuous envelopeuis a viscosity supersolution of (1.7).

Denote byΓthe set of pointsx∈∂Gsuch that for someα∈A(x)the solutionXx,α of (1.1) immediately entersGwith probability1:

P(inf{t >0 :Xtx,α∈G}= 0) = 1.

Theorem 1.2.There exist a viscosity subsolutionw+ and a viscosity supersolutionu of the state constrained problem (1.7) such that

u≤v on G; v≤w+ on G, andv(x)≤lim supG3y→xw+(y),x∈Γ.

The nature ofw+ andu is not explicitly indicated here. Their construction, which is presented in Sections 2 and 3 respectively, is based on the technique of stochastic semisolutions, developed in [5, 6, 7]. The details are quite similar to [6, 24]. One only should take care of admissibility of controls.

Theorem 1.2 is useful if a sort of comparison result is available, and one can conclude thatw+≤u. In Section 4 we consider the case of a smooth domain and, under some additional assumptions, mention that such inequality follows from the known result, concerning the boundary behavior of viscosity subsolutions of linear equations [3], and the comparison result of [21]. In combination with Theorem 1.2 this allows to identifyv with a unique continuous viscosity solution of (1.7). The related result (Theorem 4.1) is not new and is presented only to demonstrate the capabilities of the stochastic Perron method.

2 Stochastic supersolutions

ForF-stopping timesτ,σand a setD∈ Fτ denote by

Jτ, σK={(t, ω)∈[0,∞)×Ω :τ(ω)≤t≤σ(ω)}

the stochastic interval, and by

τD=τ ID+ (+∞)IDc, Dc= Ω\D

the restriction ofτ onD. Put Bε(x) ={y ∈Rd : |y−x|< ε} and denote byBε(x)the closure of this ball.

Let τ : Ω 7→[0,∞]be a stopping time and take anFτ-measurable random vectorξ such thatξI{τ <∞}is bounded andξ∈Gon{τ <∞}. For anF-progressively measurable processαwith values inAconsider the stochastic differential equation (1.1) with the randomized initial condition(τ, ξ):

Xt=ξI{t≥τ}+ Z t

τ

b(Xs, αs)ds+ Z t

τ

σ(Xs, αs)dWs, t≥0. (2.1) ByRt

τ(·)we meanRt

0I{s≥τ}(·). As is known, see [22, Chapter 2, Sect. 5], there exists a pathwise unique strong solutionXτ,ξ,α of (2.1). The trajectories of the processXτ,ξ,α are continuous on the stochastic intervalJτ,∞K. Moreover,Xτ,ξ,α= 0onJ0, τJand

Xττ,ξ,α= lim

t&τXtτ,ξ,α=ξ on {τ <∞}.

(5)

Denote byA(τ, ξ)the set of progressively measurable control processesαsuch that αt ∈Aand Xtτ,ξ,α ∈G, t∈ [τ,∞)a.s. That is,A(τ, ξ)is the set of admissible controls for a randomized initial condition(τ, ξ). We omit indexτ ifτ= 0. For instance,Xx,α= X0,x,α,A(x) =A(0, x).

Lemma 2.1.Under Assumption 1.1 the set A(τ, ξ) is non-empty for any randomized initial condition(τ, ξ).

Proof. For anF-stopping timeτ0theσ-algebraFτ0is countably generated ([29, Lemma 1.3.3]), and there exists a regular conditional probability distributionPτ0 = (Pτ0)ω∈Ω of Pwith respect to Fτ0: see [29, Theorem 1.3.4] or [28, Theorem 9.2.1]. For each B ∈ F the function ω 7→ Pτ0(B) is Fτ0-measurable, for each ω ∈ Ω the function B7→Pτ0(B)is a probability measure onF such that

Pτ0(B) =E(IB|Fτ0)(ω) P-a.s., B∈F. Moreover, there exists aP-null setN ∈Fτ0 with the property that

Pτ0(C) =IC(ω) for all ω6∈N, C ∈Fτ0. (2.2) Consider the SDE

Xt=ξI{t≥τ}+ Z t

τ

bψ(Xs)ds+ Z t

τ

σψ(Xs)dWs, t≥0, (2.3)

whereψsatisfies Assumption 1.1. To work withPτ0, related to the raw filtrationF, we pass fromξI{t≥τ} to an indistinguishableF-adapted process of the same form. Recall that anyF-stopping time is predictable (see [4, Proposition 16.22]) and the filtrationFis quasi-left continuous (see [15, Theorem 3.40]), that is,Fτ− =Fτ for any (predictable) F-stopping timeτ. By Theorem IV.78 of [13] there exists anFstopping timeτ0such that P(τ0 6=τ) = 0, and for anyB ∈Fτ−=Fτthere existsB0∈Fτsuch thatP(IB0 6=IB) = 0. It easily follows that the process ξI{t≥τ} is indistinguishable from an F-adapted processξ0I{t≥τ0}with someFτ0-measurableξ0.

PutZt0=t−t∧τ,Zt=Wt−Wt∧τ. The processZ is a continuous martingale under P, and we can rewrite equation (2.3) in the form

Xt=Ht+ Z t

0

bψ(Xs)dZs0+ Z t

0

σψ(Xs)dZs, t≥0, (2.4) whereHt0I{t≥τ0}.

Recall the pathwise construction of a strong solution, presented in [20] (see also [9, 19]). Denote byD=D([0,∞),Rd)the set of functions from[0,∞)toRd, which are right continuous and have left limits. There exist a mappingS :D×C([0,∞),Rm)7→D such that ifZis a continuous semimartingale on a filtered probability space(Ω,F,Q,F), whereFsatisfies the usual conditions, and ifHis anF-adapted process with trajectories inD, then

Xt(ω) =S(H·(ω), Z·(ω))t is a strong solution of (2.4).

Takeω∈Ω\N withτ0(ω)<∞. Note thatZ is aPτ0-martingale, andZ is the stan- dardd-dimensionalPτ0-Brownian motion on[τ0(ω),∞). It follows thatX is a strong solution of (2.4) underPτ0 with respect to thePτ0-augmentation ofF. Moreover, by (2.2) we get

Pτ0({ω:τ0(ω) =τ0(ω), ξ0(ω) =ξ0(ω)}) = 1.

(6)

Hence, underPτ0, the processH is indistinguishable from ξ0(ω)I{t≥τ0(ω)}, andX is a strong solution of the SDE with a non-random initial condition:

Xt0(ω) + Z t

τ0(ω)

bψ(Xs)ds+ Z t

τ0(ω)

σψ(Xs)dWs, t≥τ0(ω).

In addition,Xt = 0,t∈[0, τ0(ω))Pτ0-a.s. sinceZ0, Z,H are indistinguishable from 0 on[0, τ0(ω)).

By Assumption 1.1 the diffusion coefficientsbψψsatisfy conditions of Theorem 3.1 of [1]. Since0 ∈Gandξ0(ω)∈G, we conclude thatXt∈G, t≥0Pτ0-a.s. It follows thatGis invariant underP:

P(Xt∈G, t≥0) =E

I0(ω)<∞}Pτ0(Xt∈G, t≥0)

= 1.

The desired control processα∈A(τ, ξ)is given by the formulaα=ψ(X).

Letwbe a uniformly bounded continuous function:w∈Cb(G). Consider the stochas- tic process

Ztτ,ξ,α(w) = Z t

τ

e−βsf(Xsτ,ξ,α, αs)ds+I{t≥τ}e−βtw(Xtτ,ξ,α).

Definition 2.2.We say that a control processα∈ A(τ, ξ)isw-suitablefor(τ, ξ)if E(Zρτ,ξ,α(w)|Fτ)≤Zττ,ξ,α(w) =e−βτw(ξ)

for any stopping timeρ≥τ. A functionw∈Cb(G)is called a stochastic supersolution of (1.7) if for any randomized initial condition(τ, ξ)withξ∈Gthere exists aw-suitable controlα.

The set of stochastic supersolutions is denoted byV+. Note that in the above defi- nition the valuesXare irrelevant, sinceZ=R

0 e−βsf(Xsτ,ξ,α, αs)ds. We emphasize also that the conditionA(τ, ξ)6=∅for all randomized initial conditions (τ, ξ), ξ∈G is necessary for the existence of stochastic supersolutions.

A stochastic supersolutionw is an upper bound for the value function (1.2) onG. To see this putτ = 0, ξ =x∈G,ρ =∞and take aw-suitable controlα ∈A(x). By Definition 2.2, with the conventionZx,α=Z0,x,α, we get

v(x)≤J(x, α) =EZx,α(w)≤EZ0x,α(w) =w(x).

The setV+is non-empty and contains sufficiently large constantsc: it is easy to see that

E(Zρτ,ξ,α(c)|Fτ)≤ce−βτ =Zττ,ξ,α(c) for c≥f /β, wheref = sup(x,a)∈G×Af(x, a).

Lemma 2.3.Ifw1, w2 are stochastic supersolutions thenw = w1∧w2 is a stochastic supersolution.

Proof. Letαi∈A(τ, ξ),i= 1,2bewi-suitable controls for a randomized initial condition (τ, ξ). PutA1={w1(ξ)< w2(ξ)} ∈Fτ,A2=Ac1:= Ω\A1. We claim that

α=IA1I{τ≤t}α1+IA2I{τ≤t}α2

belongs toA(τ, ξ)and that it isw-suitable.

(7)

The process Y = P2

i=1Xtτ,ξ,αiIAi satisfy the same equation as Xτ,ξ,α. From the pathwise uniqueness property it follows that Y = Xτ,ξ,α. We have Xτ,ξ,α ∈ G, t ≥ τ P-a.s., andαisw-suitable for(τ, ξ):

E(Zρτ,ξ,α(w)|Fτ) =

2

X

i=1

E(IAiZρτ,ξ,αi(w)|Fτ)≤

2

X

i=1

IAiE(Zρτ,ξ,αi(wi)|Fτ)

2

X

i=1

IAie−βτwi(ξ) =e−βτw(ξ).

The following result was used in [5, 6, 24] (see, e.g., Lemmas 2 and 4 of [24]). Its proof use only the fact thatV+ is directed downward, that is, the statement of Lemma 2.3 holds true.

Lemma 2.4.There exists a sequencewn∈ V+,wn(x)≥wn+1(x),x∈Gsuch that

n→∞lim wn(x) =w+(x) := inf

u∈V+w(x).

The next assertion is the most important part of the stochastic Perron method.

Lemma 2.5.The function

w+(x) = inf

w∈V+w(x) is a viscosity subsolution of (1.7).

Proof. If w+ is not a viscosity subsolution then there existx0 ∈ G, ϕ∈ C2and ε > 0 such thatw+(x0) =ϕ(x0),w+< ϕon the setBε(x0)\{0} ⊂Gand

F(x0, ϕ(x0), Dϕ(x0), D2ϕ(x0))>0.

Hence, there exists somea∈Asuch thatβϕ(x0)−(Laϕ)(x0)−f(x0, a)>0, where (Laϕ)(x) =b(x, a)Dϕ(x) +1

2Tr σ(x, a)σT(x, a)D2ϕ(x) .

By the continuity ofb,σ,f we may assume that

βϕ(x)−(Laϕ)(x)−f(x, a)>0, x∈Bε(x0)⊂G (2.5) for someε >0.

Sincew+is upper semicontinuous, we have

w+(x)−ϕ(x)≤ −δ <0, x∈Sε:=Bε(x0)\Bε/2(x0).

By Lemma 2.4 there exists a decreasing sequencewn∈ V+,wn&w+. The sets An={x∈Sε:wn(x)−ϕ(x)≥ −δ0}, δ0∈(0, δ)

are compact,An ⊃An+1 and∩n=1An =∅. Thus,∩Nn=1An =∅for someN. This means that there exists a functionw=wN ∈ V+such thatw−ϕ <−δ0 onSε.

Define the function ϕη = ϕ−η, whereη ∈ (0, δ0)is such that the inequality (2.5) holds true forϕη instead ofϕ. Note that

w−ϕη=w−ϕ+η <−δ0+η <0 on Sε. We claim that

wη =

ϕη∧w on Bε(x0),

w otherwise

(8)

is a stochastic subsolution. This gives a contradiction with the definition of w+ since wη(x0) =ϕη(x0) =w+(x0)−η < w+(x0).

It is clear thatwη ∈Cb(G). We only need to construct awη-suitable controlαfor a randomized initial condition(τ, ξ),ξ∈G. Put

U ={x∈Bε/2(x0) :w(x)> ϕη(x)}, H={ξ∈U} ∈Fτ

and define a progressively measurable process

αt= (aIH0tIHc)I{t≥τ}∈A, whereα0is aw-suitable control for(τ, ξ). Furthermore, put

τ1= inf{t≥τ :Xtτ,ξ,α6∈Bε/2(x0)}, αttI{t≤τ1}1tI{t>τ1},

whereα1is aw-suitable control for(τ1, ξ1),ξ1=Xττ,ξ,α

1 I1<∞}. We haveXτ,ξ,α=Xτ,ξ,α on the stochastic intervalJτ, τ1K andXτ,ξ,α =Xτ111 on Jτ1,∞K. Thus, α∈ A(τ, ξ). Note also that forE ={ξ∈Bε/2(x0)}we get

Xτ,ξ,α∈Bε/2(x0) on JτE,(τ1)EK; Xτ,ξ,α=ξ on JτEc,(τ1)EcK.

It remains to show thatαis awη-suitable control for(τ, ξ). For a stopping timeρ≥τ putD={ρ > τ1}. We have

Zρτ,ξ,α(wη)ID=ID Z τ1

τ

e−βsf(Xsτ,ξ,α, αs)ds

+ID

Z ρ

τ1

e−βsf(Xsτ111, α1s)ds+e−βρwη(Xρτ111)

≤ID

Z τ1

τ

e−βsf(Xsτ,ξ,α, αs)ds+IDZρτ111(w). (2.6) By Definition 2.2 we get

E(Zρτ111(w)ID|Fτ1) =E(Zρτ111

D (w)ID|Fτ1)≤IDe−βτ1w(ξ1)

=IDe−βτ1wη1). (2.7)

The last equality follows from the fact thatξ16∈Bε/2(x0)on the set{ρ > τ1}andw=wη onG\Bε/2(x0). From (2.6), (2.7) it follows that

E(Zρτ,ξ,α(wη)ID|Fτ1)≤ID

Z τ1

τ

e−βsf(Xsτ,ξ,α, αs)ds+e−βτ1wη1)

=IDZττ,ξ,α

1 (wη), and we obtain the estimate

E(Zρτ,ξ,α(wη)|Fτ) =E(I{ρ≤τ1}Zρτ,ξ,α(wη)|Fτ) +E(I{ρ>τ1}E(Zρτ,ξ,α(wη)|Fτ1)|Fτ)

≤E(I{ρ≤τ1}Zρτ,ξ,α(wη)|Fτ) +E(I{ρ>τ1}Zττ,ξ,α1 (wη)|Fτ)

=E(Zρ∧ττ,ξ,α1(wη)|Fτ). (2.8)

On the stochastic intervalJτH,(τ1)HKthe trajectories ofXτ,ξ,α do not leave the ball Bε/2(x0). Hence, the estimate wη(Xρ∧ττ,ξ,α1)≤ϕη(Xρ∧ττ,ξ,α1)holds true onH and we get the inequality

Zρ∧ττ,ξ,α1(wη) =Zρ∧ττ,ξ,a1(wη)IH+Zρ∧ττ,ξ,α10(wη)IHc≤Zρ∧ττ,ξ,a1η)IH+Zρ∧ττ,ξ,α10(w)IHc. (2.9)

(9)

Applying Ito’s formula Ztτ,ξ,aη) =

Z t

τ

e−βsf(Xsτ,ξ,a, a)ds+e−βtϕη(Xtτ,ξ,a)

=e−βτϕη(ξ) + Z t

τ

e−βs

f(Xsτ,ξ,a, a) + (Laϕη−βϕη)(Xsτ,ξ,a) ds

+ Z t

τ

e−βsϕηx(Xsτ,ξ,a)·σ(Xsτ,ξ,a, a)dWs. (2.10) on the intervalJτ, ρ∧τ1K, taking the conditional expectation, and using (2.5), we get

E(Zρ∧ττ,ξ,a1η)IH|Fτ)≤e−βτϕη(ξ)IH=e−βτwη(ξ)IH=Zττ,ξ,α(wη)IH. (2.11) Furthermore,

E(Zρ∧ττ,ξ,α10(w)|Fτ)IHc≤Zττ,ξ,α0(w)IHc=Zττ,ξ,α(wη)IHc (2.12) by the definition ofα0. The combination of (2.11), (2.12) with (2.9) and (2.8) gives the desired inequality

E(Zρτ,ξ,α(wη)|Fτ)≤Zττ,ξ,α(wη).

To show thatw+ satisfies the last assertion of Theorem 1.2, we study its behavior near the points ofΓ. Fixx∈Γ. By the definition ofΓthere existsα1∈A(x)such that

τ= inf{t >0 :Xtx,α1∈G}= 0 a.s. (2.13) Forε >0consider the predictable set

E={(t, ω) :Xtx,α1(ω)∈G, t∈(0, ε]}=K0, εK∩ Xx,α1−1

(G)

and its projection:D={ω: (t, ω)∈E for some t∈[0,∞)}. The equality (2.13) means that P(D) = 1. By the section theorem [4, Theorem 16.12] there exist an F-stopping timeσεsuch that

{(σε(ω), ω) :ω∈Ω, σε(ω)<∞} ⊂E, P(σε<∞)≥1−ε. (2.14) PutDε={σε≤ε}={σε<∞}. Then (2.14) means that

Xσx,αε 1 ∈G on Dε, P(Dε)≥1−ε.

Let wbe a stochastic supersolution, bounded from above by the constantf /β. Put ξε=IDεXσx,αε 1 ∈G and take aw-suitable controlα2∈A(σε, ξε). Then

α=α1I{t<σε}2I{t≥σε}∈A(x).

Taking into account thatσε=∞onDεc, by the definitions ofvandwwe obtain:

v(x)≤E Z σε

0

e−βtf(Xtx,α1, αt1)dt+E Z

σε

e−βtf(Xtσεε2, α2t)dt Fσε

! ,

≤E Z σε

0

e−βtf(Xtx,α1, αt1)dt+e−βσεw(ξε)

!

It easily follows that v(x)≤ f

β

1−Ee−βσε

+Ee−βσεw(ξε)IDε+f

β(1−P(Dε)). (2.15)

(10)

Moreover, by Lemma 2.4 and the monotone convergence theorem we can changewto w+in this inequality.

Takeεn such thatP(Dεc

n)≤1/2n. By the Borel-Cantelli lemma for allω in some set Ω0withP(Ω0) = 1we haveω∈Dεn for sufficiently largen. Thus,

IDεn →1, ξεn→x, σεn→0 on Ω0, and from (2.15) we obtain the estimatev(x)≤lim supG3y→xw+(y).

3 Stochastic subsolutions

Definition 3.1.With the notation of Section 2 we callu∈Cb(G)a stochastic subsolu- tionif

E(Zρτ,ξ,α(u)|Fτ)≥Zττ,ξ,α(u) =e−βτu(ξ) (3.1) for any randomized initial condition(τ, ξ), admissible control processα ∈A(τ, ξ)and stopping timeρ≥τ.

Any stochastic subsolutionuis a lower bound forv: forτ= 0,ξ=x,ρ=∞we have J(x, α) =EZx,α(u)≥Z0x,α(u) =u(x), α∈A(x).

Putf = inf(x,a)∈G×Af(x, a).The setV of stochastic subsolutions is non-empty and contains sufficiently large negative constantsc. Indeed, it is easy to see that

E(Zρτ,ξ,α(c)|Fτ)≥ce−βτ for c≤f /β.

Lemma 3.2.Letu1,u2be stochastic subsolutions. Thenu1∨u2is a stochastic subsolu- tion.

The proof follows from the inequality E(Zρτ,ξ,α(u1∨u2)|Fτ)≥max

i=1,2E(Zρτ,ξ,α(ui)|Fτ)≥max

i=1,2Zττ,ξ,α(ui) =e−βτ(u1∨u2)(ξ).

Lemma 3.3.There exists a sequenceun ∈ V,un(x)≤un+1(x),x∈Gsuch that

n→∞lim un(x) =u(x) := sup

u∈V

u(x).

This lemma is analogous to Lemma 2.4.

Lemma 3.4.The function

u(x) = sup

u∈V

u(x)

is a viscosity supersolution of (1.7).

Proof. Ifu is not a viscosity supersolution then there existx0 ∈G,ϕ∈C2 andε >0 such thatu(x0) =ϕ(x0),u> ϕon(Bε(x0)\{0})∩Gand

F(x0, ϕ(x0), Dϕ(x0), D2ϕ(x0))<0.

By the continuity ofF we can assume that

F(x, ϕ(x), Dϕ(x), D2ϕ(x))<0, x∈Bε(x0)∩G. (3.2) Furthermore, by the lower-semicontinuity ofuwe have

u(x)≥ϕ(x) +δ, x∈Sε:= Bε(x0)\Bε/2(x0)

∩G

(11)

for someδ >0. In the same way as in the proof of Lemma 2.5, one can show that there existu∈ V andδ0∈(0, δ)such thatu≥ϕ+δ0onSε.

Take anη ∈(0, δ0)such that (3.2) holds true forϕη =ϕ+η instead ofϕ. We have u−ϕη ≥δ0−η >0onSε.

To get a contradiction it is enough to prove that the function uη=

ϕη∨u on Bε(x0)∩G,

u otherwise

is a stochastic subsolution, sinceuη(x0) =ϕη(x0)> u(x0), contrary to the definition of u.

Clearly uη ∈ Cb(G), and we only should to verify (3.1) for any randomized initial condition(τ, ξ), control processα∈A(τ, ξ)and stopping timeρ≥τ. Put

τ1= inf{t≥τ:Xtτ,ξ,α6∈Bε/2(x0)}, ξ1=Xττ,ξ,α

1 I1<∞}, E={ξ∈Bε/2(x0)}.

We have

ξ1∈∂Bε/2(x0)∩G on E∩ {τ1<∞}; ξ1= 0 on E∩ {τ1=∞}; ξ1=ξ on Ec. Moreover,Xτ11=Xτ,ξ,αon the stochastic intervalJτ1,∞K.

PutD={ρ > τ1}. Similarly to (2.6) we get Zρτ,ξ,α(uη)ID≥ID

Z τ1

τ

e−βsf(Xsτ,ξ,α, αs)ds+IDZρτ11(u). (3.3) Applying Definition 3.1, we obtain

E(Zρτ11(u)ID|Fτ1) =E(Zρτ11

D (u)ID|Fτ1)≥IDe−βτ1u(ξ1) =IDe−βτ1uη1). (3.4) The last equality follows from the fact thatξ1, restricted to D, takes values in the set G\Bε/2(x0)whereu=uη.

From (3.3), (3.4) it follows that E(Zρτ,ξ,α(uη)ID|Fτ1)≥ID

Z τ1

τ

e−βsf(Xsτ,ξ,α, αs)ds+e−βτ1uη1)

=IDZττ,ξ,α1 (uη). (3.5)

By (3.5) we have

E(Zρτ,ξ,α(uη)|Fτ) =E(I{ρ≤τ1}Zρτ,ξ,α(uη)|Fτ) +E(I{ρ>τ1}E(Zρτ,ξ,α(uη)|Fτ1)|Fτ)

≥E(I{ρ≤τ1}Zρτ,ξ,α(uη)|Fτ) +E(I{ρ>τ1}Zττ,ξ,α

1 (uη)|Fτ)

=E(Zρ∧ττ,ξ,α1(uη)|Fτ). (3.6)

Put

U ={x∈G∩Bε/2(x0) :ϕη(x)> u(x)}, H={ξ∈U} ∈Fτ.

On the stochastic intervalJτH,(ρ∧τ1)HKthe trajectories ofXτ,ξ,αdo not leave the set Bε/2(x0)∩G. Hence, we haveuη(Xρ∧ττ,ξ,α1)IH ≥ϕη(Xρ∧ττ,ξ,α1)IHand

Zρ∧ττ,ξ,α1(uη)≥Zρ∧ττ,ξ,α1η)IH+Zρ∧ττ,ξ,α1(u)IHc. (3.7) Apply Ito’s formula (2.10) on the intervalJτ, ρ∧τ1Kwithαinstead ofa. Taking the conditional expectation and using (3.2), we get

E(Zρ∧ττ,ξ,α1η)IH|Fτ)≥e−βτϕη(ξ)IH =e−βτuη(ξ)IH=Zττ,ξ,α(uη)IH. (3.8)

(12)

Furthermore,

E(Zρ∧ττ,ξ,α1(u)|Fτ)IHc ≥Zττ,ξ,α(u)IHc =Zττ,ξ,α(uη)IHc, (3.9) and the desired inequality

E(Zρτ,ξ,α(uη)|Fτ)≥Zττ,ξ,α(uη)

follows from (3.8), (3.9), combined with (3.6), (3.7).

4 The case of a smooth domain

LetGcoincide with the closure ofG, and assume that∂Gis of classC2. Then the distance functionρfrom∂G:

ρ(x) = inf{y∈Gc :|y−x|}, x∈G

is of classC2 in a neighbourhood of∂G(see [14, Lemma 14.16]). Put−n(x) =Dρ(x), x∈G. Ifx∈∂G,n(x)is the unit outer normal to∂Gat x. It is shown in [1, Example 3.2], [2, Example 1] that condition (1.3) is reduced to the following: for any x ∈ ∂G there existsa∈Asuch that

σT(x, a)n(x) = 0, −n(x)·b(x, a) +1

2Tr σ(x, a)σT(x, a)D2ρ(x)

≥0.

To get a comparison result we need a stronger condition, presented in the next theorem.

Theorem 4.1.Assume that there exists a Borel measurable functionψ : G7→ Asuch that the functions (1.4) are globally Lipschitz continuous and

σψ(x) = 0, −n(x)·bψ(x)>0, x∈∂G. (4.1) Then the value functionv, defined by (1.2), is the unique continuous viscosity solution of the state constrained problem (1.7).

Proof. The viscosity subsolutionw+, specified in Theorem 1.2, satisfies also the linear inequality

βw+(x)−f(x, ψ(x))−(bψ·Dw+)(x)−1

2Tr (σψσψTD2w+)(x)≤0, x∈G (4.2) in the viscosity sense. Consider the function

we+(x) =

( lim sup

G3y→x

w+(y) x∈∂G, w+(x) otherwise.

Clearly,we+is a viscosity subsolution of (4.2), satisfying all conditions of Theorem 1.2.

Now we use conditions (4.1). By Lemma 4.1 of [3] the function we+ is a viscosity subsolution of (4.2) onG. Furthermore, by Theorem 4.1(ii) of [3], for anyx∈∂Gthere exists a sequencexk∈G,xk→xsuch thatwe+(x) = limk→∞we+(xk)and

lim sup

k→∞

|xk−x|

d(xk) <∞, or, equivalently,

lim sup

k→∞

(xk−x)·n(x)

|xk−x| ≤ −β

(13)

for some β ∈ (0,1). This is the nontangential upper semicontinuity property of we+, which, by the comparison result of [21] (Theorem 2.2), implies that

we+≤u onG. (4.3)

Let us prove that∂G= Γ. Forx∈∂Gdenote byX the solution of the equation Xt=x+

Z t

0

bψ(Xs)ds+ Z t

0

σψ(Xs)dWs, x∈∂G.

Since conditions (4.1) imply the viability, we get an admissible control αt = ψ(Xt): Xt=Xtx,α∈G,t≥0a.s. Takeε >0such thatρ∈C2(Bε(x))and

inf

y∈Bε(x)∩G

−n(y)·bψ(y) +1

2Tr σψ(y)σTψ(y)D2ρ(y)

>0.

Furthermore, putτ = inf{t≥0 :Xt6∈Bε(x)}. By Ito’s formula we have ρ(Xt∧τ) =ρ(x)−

Z t∧τ

0

n(Xs)·bψ(Xs)ds+1 2

Z t∧τ

0

Tr σψ(XsψT(Xs)D2ρ(Xs)

ds+Mt,

whereM is a continuous martingale with M0 = 0. From the representation ofM as a time-changed Brownian motion on an extended filtered probability space (see [16, Theorem 7.2’]) it follows that 0 is a limit point of the set{t > 0 : Mt = 0} a.s. For a sequencetk(ω)→0withMtk= 0we have

ρ(Xtk) =ρ(x) + Z tk

0

−n(Xs)·bψ(Xs)ds+1

2Tr σψ(XsTψ(Xs)D2ρ(Xs)

ds >0 a.s.

for sufficiently largek. Thus,X immediately entersG:

inf{t >0 :Xt∈G}= inf{t >0 :ρ(Xt)>0}= 0 a.s., and we conclude thatx∈Γand∂G= Γ.

This fact, together with Theorem 1.2 and inequality (4.3), implies that v≤we+≤u≤v on G.

Hence,v=we+=uis a continuous function, and it satisfies (1.7) in the viscosity sense.

Note also that the uniqueness of a continuous constrained viscosity solution is a more classical result: see [12, Theorem 7.10].

Theorem 4.1 is similar to Theorem 4.1 of [21]. Although, the second condition (4.1) is presented there in the form

−n(x)·bψ(x) +1

2Tr σψ(x)σTψ(x)D2ρ(x)

≥c >0, x∈∂G,

which is formally not comparable to ours local condition−n(x)·bψ(x)>0,x∈∂G, the result of [21] is more sophisticated. To get the comparison result in Theorem 4.1 we used only the fact that any subsolution, being suitably modified at the boundary points, possesses the nontangential upper semicontinuity property under conditions (4.1). In [21] it is shown that a subsolutionu≥v with this property exists even some diffusion in the tangent direction to∂Gis allowed: see conditions A3 of [21].

Certainly, the stochastic Perron method can be applied in the case of finite horizon as well. However, some work is required to study the parabolic problem, corresponding

(14)

to (1.7). In particular, a new boundary condition at the terminal time appears, and the viability notion should be modified. Such a problem was studied in [10] by another methods. We mention a comparison result, ensuring the continuity of the value function, proved under conditions similar to (4.1): see [10, Theorem A.1].

Acknowledgments.The author thanks the anonymous referees for useful remarks and for pointing out an error in the previous proof of Lemma 2.1. The research is supported by Southern Federal University, project 213.01-07-2014/07.

References

[1] M. Bardi and P. Goatin,Invariant sets for controlled degenerate diffusions: a viscosity so- lutions approach, Stochastic Analysis, Control, Optimization and Applications (W.M. McE- neaney, G.G. Yin, and Q. Zhang, eds.), Systems & Control: Foundations & Applications, Birkhäuser Boston, 1999, pp. 191–208. MR-1702960

[2] M. Bardi and R. Jensen,A geometric characterization of viable sets for controlled degener- ate diffusions, Set-Valued Analysis10(2002), no. 2-3, 129–141. MR-1926377

[3] G. Barles and E. Rouy, A strong comparison result for the Bellman equation arising in stochastic exit time control problems and its applications, Comm. Partial Differential Equa- tions22(1998), no. 11-12, 1995–2033. MR-1662164

[4] R.F. Bass, Stochastic processes, Cambridge University Press, Cambridge, 2011. MR- 2856623

[5] E. Bayraktar and M. Sîrbu,Stochastic Perron’s method and verification without smoothness using viscosity comparison: the linear case, Proc. Amer. Math. Soc.140 (2012), no. 10, 3645–3654. MR-2929032

[6] ,Stochastic Perron’s method for Hamilton-Jacobi-Bellman equations, SIAM J. Control Optim.51(2013), no. 6, 4274–4294. MR-3124891

[7] ,Stochastic Perron’s method and verification without smoothness using viscosity com- parison: Obstacle problems and Dynkin games, Proc. Amer. Math. Soc.142(2014), no. 4, 1399–1412. MR-3162260

[8] E. Bayraktar and Y. Zhang,Stochastic Perron’s method for the probability of lifetime ruin problem under transaction costs, Preprint arXiv:1404.7406v1 [math.OC], 24 pages, 2014.

[9] K. Bichteler,Stochastic integration andLp-theory of semimartingales, Ann. Prob.9(1981), no. 1, 49–89. MR-0606798

[10] B. Bouchard and M. Nutz,Weak dynamic programming for generalized state constraints, SIAM J. Control Optim.50(2012), no. 6, 3344–3373. MR-3024163

[11] D. Buckdahn, M. Goreac, and M. Quincampoix,Stochastic optimal control and linear pro- gramming approach, Appl. Math. Optim.63(2011), no. 2, 257–276. MR-2772196

[12] M. Crandall, H. Ishii, and P.-L. Lions,User’s guide to viscosity solutions of second-order partial differential equations, Bull. Amer. Math. Soc.27(1992), no. 1, 1–67. MR-1118699 [13] C. Dellacherie and P.-A. Meyer, Probabilities and potential, North-Holland Mathematics

Studies, vol. 29, North-Holland, Amsterdam, 1978. MR-0521810

[14] D. Gilbarg and N.S. Trudinger, Elliptic partial differential equations of second order, Springer, Berlin, 2001. MR-1814364

[15] S.W. He, J.G. Wang, and J.A. Yan,Semimartingale theory and stochastic calculus, Science Press, Beijing, New York, 1992. MR-1219534

[16] N. Ikeda and S. Watanabe,Stochastic differential equations and diffusion processes, 2nd ed., North-Holland, Amsterdam, 1989. MR-1011252

[17] H. Ishii, Perron’s method for Hamilton-Jacobi equations, Duke Math. J.55(1987), no. 2, 369–384. MR-0894587

[18] H. Ishii and P. Loreti,A class of stochastic optimal control problems with state constraint, Indiana Univ. Math. J.51(2002), no. 5, 1167–135. MR-1947872

[19] R.L. Karandikar,Pathwise solutions of stochastic differential equations, Sankhya Ser. A43 (1981), no. 2, 121–132. MR-0666376

(15)

[20] ,On pathwise stochastic integration, Stoch. Proc. Appl.57(1995), no. 1, 11–18. MR- 1327950

[21] M.A. Katsoulakis,Viscosity solutions of second order fully nonlinear elliptic equations with state constraints, Indiana Univ. Math. J.43(1994), no. 2, 493–519. MR-1291526

[22] N.V. Krylov,Controlled diffusion processes, Springer, New York, 1980. MR-0601776 [23] J.M. Lasry and P.L. Lions, Nonlinear elliptic equations with singular boundary conditions

and stochastic control with state constraints, Math. Ann.283(1989), no. 4, 583–630. MR- 0990591

[24] D.B. Rokhlin,Verification by stochastic Perron’s method in stochastic exit time control prob- lems, J. Math. Anal. Appl.419(2014), no. 1, 433–446. MR-3217159

[25] M. Sîrbu,Stochastic Perron’s method and elementary strategies for zero-sum differential games, Preprint arXiv:1305.5083 [math.OC], 17 pages, 2013. MR-3206980

[26] ,Asymptotic Perron’s method and simple Markov strategies in stochastic games and control, Preprint arXiv:1402.7030 [math.OC], 12 pages, 2014.

[27] H.M. Soner,Optimal control with state-space constraint. I, SIAM J. Control Optim.24(1986), no. 3, 552–561. MR-0838056

[28] D.W. Stroock, Probability theory. an analytic view, 2nd ed., Cambridge University Press, Cambridge, 2011. MR-2760872

[29] D.W. Stroock and S.R.S. Varadhan, Multidimensional diffusion processes, Springer, New York, 1979. MR-0532498

(16)

Electronic Communications in Probability

Advantages of publishing in EJP-ECP

• Very high standards

• Free for authors, free for readers

• Quick publication (no backlog)

Economical model of EJP-ECP

• Low cost, based on free software (OJS

1

)

• Non profit, sponsored by IMS

2

, BS

3

, PKP

4

• Purely electronic and secure (LOCKSS

5

)

Help keep the journal free and vigorous

• Donate to the IMS open access fund

6

(click here to donate!)

• Submit your best articles to EJP-ECP

• Choose EJP-ECP over for-profit journals

1OJS: Open Journal Systemshttp://pkp.sfu.ca/ojs/

2IMS: Institute of Mathematical Statisticshttp://www.imstat.org/

3BS: Bernoulli Societyhttp://www.bernoulli-society.org/

4PK: Public Knowledge Projecthttp://pkp.sfu.ca/

5LOCKSS: Lots of Copies Keep Stuff Safehttp://www.lockss.org/

参照

関連したドキュメント

In this paper, we study a more general model of the steady state cooling problem by assuming general mixed boundary conditions over the upper surface of an infinite rectangular

Formulate the investment decision as optimal stopping problem of the stochastic process, and seek for the optimal timing of new technology adoption which maximize the net

The optimal control problem is defined as a discrete-time tracking problem (nominal state 4nd nominal control trajectories are tracked) for a linear stochastic economic models with

Nakai, An Optimal Selection Problem on a Partially Observable Markov process, In Stochastic Modelling in Innovative Manufacturing, Lecture Notes in Economics and

Keywords: optimal transportation problem, Legendre transform, duality theorem, stochastic control, forward-backward stochastic

(1990) A measure-valued diffusion process describing the stepping stone.. model with infinitely many

Key words: Probability measure, infinite programming, optimal control, semi-infinite programming, generalized moment problem, communication

Hattori, Existence of an infinite particle limit of stochastic ranking process, Stochastic Processes and their Applications, 119 (2009), 966–979.