ISSN:1083-589X in PROBABILITY
Stochastic Perron’s method for optimal control problems with state constraints
Dmitry B. Rokhlin
*Abstract
We apply the stochastic Perron method of Bayraktar and Sîrbu to a general infinite horizon optimal control problem, where the stateXis a controlled diffusion process, and the state constraint is described by a closed set. We prove that the value function vis bounded from below (resp., from above) by a viscosity supersolution (resp., sub- solution) of the related state constrained problem for the Hamilton-Jacobi-Bellman equation. In the case of a smooth domain, under some additional assumptions, these estimates allow to identifyvwith a unique continuous constrained viscosity solution of this equation.
Keywords: Stochastic Perron’s method; State constraints; Viscosity solution; Comparison re- sult.
AMS MSC 2010:93E20; 49L25; 60H30.
Submitted to ECP on June 19, 2014, final version accepted on October 20, 2014.
1 Introduction and the main result
The aim of the paper is to extend the scope of applications of the stochastic Per- ron method, developed by Bayraktar and Sîrbu. This method allows to characterise the value function of a controlled diffusion problem as a viscosity solution of the cor- responding Hamilton-Jacobi-Bellman (HJB) equation, bypassing the dynamic program- ming principle. Instead it requires a comparison result, implying the uniqueness of a viscosity solution of the HJB equation. Previously this method was applied to linear parabolic equations [5], stochastic differential games [7, 25, 26], regular [6, 24] and singular control problems [8].
The method involves the construction of two familiesV−,V+of functions, bounding the value function from below and above
u≤v≤w, u∈ V−, w∈ V+.
Elements ofV−, V+ are called stochastic sub- and supersolutions. By the superposi- tion with the state process,uandwgenerate sub- and supermartingale-like processes.
Similarly to the classical Perron method [14, Sections 2.8, 6.3], the setV− (resp.,V+) is directed upward (resp., downward) with respect to the pointwise maximum (resp., minimum) operation. The essence of the method is to prove that the functions
u−(x) = sup
u∈V−
u(x), w+(x) = inf
w∈V+w(x)
*Institute of Mathematics, Mechanics and Computer Sciences, Southern Federal University, Rostov-on- Don, Russia. E-mail:[email protected]
are respectively viscosity super- and subsolutions of the related HJB equation. If a comparison result, providing the inequalityu− ≥w+, holds true, it follows thatu−=v= w+is a unique (continuous) viscosity solution. This construction differs from Perron’s method of [17], which is not linked to the value function.
In the present paper we consider the stochastic control problem with state con- straints in the form of [21]. In contrast to [23], where the drift is not assumed to be bounded, and the value function is singular near the boundary, in [21] the problem is
"regular". To achieve the regularity it is assumed that the diffusion coefficient depends on the control and degenerates at the boundary. The same problem was considered in [18, 11]. It was proved that under appropriate assumptions the value function v is a unique continuous constrained viscosity solution of the HJB equation. (The term "con- strained" means, in particular, thatvsatisfies special boundary conditions, which in the deterministic situation were introduced in [27].) Roughly speaking, it is enough to as- sume that for each boundary point there exists a control, which kills the diffusion and directs the drift strictly inside the domain.
An application of the stochastic Perron method to state constrained problems seems rather interesting, since, as it is mentioned in [21], a direct proof of the dynamic pro- gramming principle is not available due to a complicated structure of admissible control processes, retaining a phase trajectory in a predetermined domain. Different penaliza- tion and approximation procedures were used instead in [21, 18, 11, 10].
We turn to the precise statement of our main result (Theorem 1.2). Let Ω be the space C([0,∞),Rm) of continuous Rm-valued functions, endowed with the σ-algebra F◦ of cylindrical sets, and let P be the Wiener measure on F◦. So, the canonical processWs(ω) =ω(s)is the standardm-dimensional Brownian motion underP. Denote byF◦ = (Ft◦)t≥0 the natural filtration ofW, and letF= (Ft)t≥0 be the correspondent minimal augmented filtration. The extension of the Wiener measure to the completion F ofF◦is still denoted byP.
Letαbe anF-progressively measurable stochastic process with values in a compact setA⊂Rk,0∈A. Consider the system of stochastic differential equations
dXt=b(Xt, αt)dt+σ(Xt, αt)dWt, X0=x. (1.1) We assume that the drift vectorb:Rd×A7→Rd and the diffusion matrixσ:Rd×A7→
Rd×Rmare continuous and satisfy the Lipschitz condition
|b(x, a)−b(y, a)|+|σ(x, a)−σ(y, a)| ≤K|x−y|
with some constantKindependent ofx,y,a. Note, that the linear growth condition
|b(x, a)|+|σ(x, a)| ≤K0(1 +|x|)
follows from the continuity ofb, σand compactness of A. Thus, there exist a unique F-adapted strong solutionXx,αof (1.1) on[0,∞): see [22, Chapter 2, Sect. 5].
LetG⊂Rdbe a closed set with the boundary∂Gand nonempty interiorG◦. It will be convenient to assume that0∈G◦. Denote byA(x),x∈Gthe set ofF-progressively measurable control processes αwith values in A and such thatXtx,α ∈ G, t ≥ 0 a.s.
Elements ofA(x) are calledadmissible controls for the initial condition x. The cost functionalJ and the value functionvare defined as follows
J(x, α) =E Z ∞
0
e−βsf(Xsx,α, αs)ds, v(x) = inf
α∈A(x)
J(x, α), (1.2) wheref :G×A7→Ris a bounded continuous function.
We assume that for any initial condition x ∈ Gthere exists an admissible control:
A(x)6=∅. In this case the setGis calledviable. A necessary condition for the validity of this property is given in [2] (Theorem 1). Let
NG2(x) =
(p, Y)∈Rd×Sd: lim inf
G3y→x
p·(y−x)
|y−x|2 +1 2
Y(y−x)·(y−x)
|y−x|2
≥0
be the second order normal cone. HereSd is the set of symmetricd×dmatrices. If the setGis viable then for allx∈∂G,(p, Y)∈NG2(x)there exista∈Asuch that
p·b(x, a) +1
2Tr (σ(x, a)σT(x, a)Y)≥0. (1.3) See [2, Section 3] for more concrete forms of this condition.
We impose a slightly stronger requirement. For any functionψ:Rd7→Aput
bψ(x) =b(x, ψ(x)), σψ(x) =σ(x, ψ(x)). (1.4) Assumption 1.1.There exist a Borel measurable functionψ:Rd7→Asuch thatbψ,σψ
are globally Lipschitz continuous and p·bψ(x) +1
2Tr (σψ(x)σTψ(x)Y)≥0, x∈∂G, (p, Y)∈NG2(x).
Under this assumption there exist a unique strong solution of the equation
dXt=bψ(Xt)dt+σψ(Xt)dWt, X0=x (1.5) and Xt ∈ G, t ≥ 0 a.s.: see [1, Theorem 3.1]. The correspondent control process αt=ψ(Xt)is admissible forx. Hence,A(x)6=∅,x∈G.
Consider the Bellman operator F(x, r, p, Y) = sup
a∈A
βr−f(x, a)−b(x, a)·p−1
2Tr (σ(x, a)σT(x, a)Y)
,
defined onR×R×Rd×Sd. Recall that a bounded upper semicontinuous (usc) function uis called aviscosity subsolutionof the equation
F(x, u, Du, D2u) = 0 (1.6)
on a setE⊂Rdif for anyϕ∈C2(Rd)and for any local maximum pointx0ofu−ϕonE the inequality
F(x0, u(x0), Dϕ(x0), D2ϕ(x0))≤0
holds true. In the same way, a bounded lower semicontinuous (lsc) functionwis called aviscosity supersolution of (1.6) onE if for anyϕ∈C2(Rd)and for any local minimum pointx0ofw−ϕonE we have the inequality
F(x0, w(x0), Dϕ(x0), D2ϕ(x0))≥0.
In these definitions one can assume that the maximum (resp., minimum) pointx0is strict andϕ(x0) =u(x0)(resp.,ϕ(x0) =w(x0)).
It is convenient to introduce thestate constrainedproblem F(x, u, Du, D2u)≤0 on G◦,
F(x, u, Du, D2u)≥0 on G. (1.7)
We say that a bounded usc (resp., lsc) functionu, defined onG, is viscosity subsolution (resp., supersolution) of the state constrained problem (1.7) ifF(x, u, Du, D2u)≤0on G◦ (resp., F(x, u, Du, D2u) ≥ 0 on G) in the viscosity sense. A bounded function u is called a viscosity solution of (1.7) (or aconstrained viscosity solution), if its upper semicontinuous envelope u∗ is a viscosity subsolution, and its lower semicontinuous envelopeu∗is a viscosity supersolution of (1.7).
Denote byΓthe set of pointsx∈∂Gsuch that for someα∈A(x)the solutionXx,α of (1.1) immediately entersG◦with probability1:
P(inf{t >0 :Xtx,α∈G◦}= 0) = 1.
Theorem 1.2.There exist a viscosity subsolutionw+ and a viscosity supersolutionu− of the state constrained problem (1.7) such that
u−≤v on G; v≤w+ on G◦, andv(x)≤lim supG◦3y→xw+(y),x∈Γ.
The nature ofw+ andu− is not explicitly indicated here. Their construction, which is presented in Sections 2 and 3 respectively, is based on the technique of stochastic semisolutions, developed in [5, 6, 7]. The details are quite similar to [6, 24]. One only should take care of admissibility of controls.
Theorem 1.2 is useful if a sort of comparison result is available, and one can conclude thatw+≤u−. In Section 4 we consider the case of a smooth domain and, under some additional assumptions, mention that such inequality follows from the known result, concerning the boundary behavior of viscosity subsolutions of linear equations [3], and the comparison result of [21]. In combination with Theorem 1.2 this allows to identifyv with a unique continuous viscosity solution of (1.7). The related result (Theorem 4.1) is not new and is presented only to demonstrate the capabilities of the stochastic Perron method.
2 Stochastic supersolutions
ForF-stopping timesτ,σand a setD∈ Fτ denote by
Jτ, σK={(t, ω)∈[0,∞)×Ω :τ(ω)≤t≤σ(ω)}
the stochastic interval, and by
τD=τ ID+ (+∞)IDc, Dc= Ω\D
the restriction ofτ onD. Put Bε(x) ={y ∈Rd : |y−x|< ε} and denote byBε(x)the closure of this ball.
Let τ : Ω 7→[0,∞]be a stopping time and take anFτ-measurable random vectorξ such thatξI{τ <∞}is bounded andξ∈Gon{τ <∞}. For anF-progressively measurable processαwith values inAconsider the stochastic differential equation (1.1) with the randomized initial condition(τ, ξ):
Xt=ξI{t≥τ}+ Z t
τ
b(Xs, αs)ds+ Z t
τ
σ(Xs, αs)dWs, t≥0. (2.1) ByRt
τ(·)we meanRt
0I{s≥τ}(·). As is known, see [22, Chapter 2, Sect. 5], there exists a pathwise unique strong solutionXτ,ξ,α of (2.1). The trajectories of the processXτ,ξ,α are continuous on the stochastic intervalJτ,∞K. Moreover,Xτ,ξ,α= 0onJ0, τJand
Xττ,ξ,α= lim
t&τXtτ,ξ,α=ξ on {τ <∞}.
Denote byA(τ, ξ)the set of progressively measurable control processesαsuch that αt ∈Aand Xtτ,ξ,α ∈G, t∈ [τ,∞)a.s. That is,A(τ, ξ)is the set of admissible controls for a randomized initial condition(τ, ξ). We omit indexτ ifτ= 0. For instance,Xx,α= X0,x,α,A(x) =A(0, x).
Lemma 2.1.Under Assumption 1.1 the set A(τ, ξ) is non-empty for any randomized initial condition(τ, ξ).
Proof. For anF◦-stopping timeτ0theσ-algebraFτ◦0is countably generated ([29, Lemma 1.3.3]), and there exists a regular conditional probability distributionPτ0 = (Pτ0,ω)ω∈Ω of Pwith respect to Fτ◦0: see [29, Theorem 1.3.4] or [28, Theorem 9.2.1]. For each B ∈ F◦ the function ω 7→ Pτ0,ω(B) is Fτ◦0-measurable, for each ω ∈ Ω the function B7→Pτ0,ω(B)is a probability measure onF◦ such that
Pτ0,ω(B) =E(IB|Fτ◦0)(ω) P-a.s., B∈F◦. Moreover, there exists aP-null setN ∈Fτ◦0 with the property that
Pτ0,ω(C) =IC(ω) for all ω6∈N, C ∈Fτ◦0. (2.2) Consider the SDE
Xt=ξI{t≥τ}+ Z t
τ
bψ(Xs)ds+ Z t
τ
σψ(Xs)dWs, t≥0, (2.3)
whereψsatisfies Assumption 1.1. To work withPτ0, related to the raw filtrationF◦, we pass fromξI{t≥τ} to an indistinguishableF◦-adapted process of the same form. Recall that anyF-stopping time is predictable (see [4, Proposition 16.22]) and the filtrationFis quasi-left continuous (see [15, Theorem 3.40]), that is,Fτ− =Fτ for any (predictable) F-stopping timeτ. By Theorem IV.78 of [13] there exists anF◦stopping timeτ0such that P(τ0 6=τ) = 0, and for anyB ∈Fτ−=Fτthere existsB0∈Fτ◦such thatP(IB0 6=IB) = 0. It easily follows that the process ξI{t≥τ} is indistinguishable from an F◦-adapted processξ0I{t≥τ0}with someFτ◦0-measurableξ0.
PutZt0=t−t∧τ,Zt=Wt−Wt∧τ. The processZ is a continuous martingale under P, and we can rewrite equation (2.3) in the form
Xt=Ht+ Z t
0
bψ(Xs)dZs0+ Z t
0
σψ(Xs)dZs, t≥0, (2.4) whereHt=ξ0I{t≥τ0}.
Recall the pathwise construction of a strong solution, presented in [20] (see also [9, 19]). Denote byD=D([0,∞),Rd)the set of functions from[0,∞)toRd, which are right continuous and have left limits. There exist a mappingS :D×C([0,∞),Rm)7→D such that ifZis a continuous semimartingale on a filtered probability space(Ω,F,Q,F), whereFsatisfies the usual conditions, and ifHis anF-adapted process with trajectories inD, then
Xt(ω) =S(H·(ω), Z·(ω))t is a strong solution of (2.4).
Takeω∈Ω\N withτ0(ω)<∞. Note thatZ is aPτ0,ω-martingale, andZ is the stan- dardd-dimensionalPτ0,ω-Brownian motion on[τ0(ω),∞). It follows thatX is a strong solution of (2.4) underPτ0,ω with respect to thePτ0,ω-augmentation ofF◦. Moreover, by (2.2) we get
Pτ0,ω({ω:τ0(ω) =τ0(ω), ξ0(ω) =ξ0(ω)}) = 1.
Hence, underPτ0,ω, the processH is indistinguishable from ξ0(ω)I{t≥τ0(ω)}, andX is a strong solution of the SDE with a non-random initial condition:
Xt=ξ0(ω) + Z t
τ0(ω)
bψ(Xs)ds+ Z t
τ0(ω)
σψ(Xs)dWs, t≥τ0(ω).
In addition,Xt = 0,t∈[0, τ0(ω))Pτ0,ω-a.s. sinceZ0, Z,H are indistinguishable from 0 on[0, τ0(ω)).
By Assumption 1.1 the diffusion coefficientsbψ,σψsatisfy conditions of Theorem 3.1 of [1]. Since0 ∈Gandξ0(ω)∈G, we conclude thatXt∈G, t≥0Pτ0,ω-a.s. It follows thatGis invariant underP:
P(Xt∈G, t≥0) =E
I{τ0(ω)<∞}Pτ0,ω(Xt∈G, t≥0)
= 1.
The desired control processα∈A(τ, ξ)is given by the formulaα=ψ(X).
Letwbe a uniformly bounded continuous function:w∈Cb(G). Consider the stochas- tic process
Ztτ,ξ,α(w) = Z t
τ
e−βsf(Xsτ,ξ,α, αs)ds+I{t≥τ}e−βtw(Xtτ,ξ,α).
Definition 2.2.We say that a control processα∈ A(τ, ξ)isw-suitablefor(τ, ξ)if E(Zρτ,ξ,α(w)|Fτ)≤Zττ,ξ,α(w) =e−βτw(ξ)
for any stopping timeρ≥τ. A functionw∈Cb(G)is called a stochastic supersolution of (1.7) if for any randomized initial condition(τ, ξ)withξ∈G◦there exists aw-suitable controlα.
The set of stochastic supersolutions is denoted byV+. Note that in the above defi- nition the valuesX∞are irrelevant, sinceZ∞=R∞
0 e−βsf(Xsτ,ξ,α, αs)ds. We emphasize also that the conditionA(τ, ξ)6=∅for all randomized initial conditions (τ, ξ), ξ∈G◦ is necessary for the existence of stochastic supersolutions.
A stochastic supersolutionw is an upper bound for the value function (1.2) onG◦. To see this putτ = 0, ξ =x∈G◦,ρ =∞and take aw-suitable controlα ∈A(x). By Definition 2.2, with the conventionZx,α=Z0,x,α, we get
v(x)≤J(x, α) =EZ∞x,α(w)≤EZ0x,α(w) =w(x).
The setV+is non-empty and contains sufficiently large constantsc: it is easy to see that
E(Zρτ,ξ,α(c)|Fτ)≤ce−βτ =Zττ,ξ,α(c) for c≥f /β, wheref = sup(x,a)∈G×Af(x, a).
Lemma 2.3.Ifw1, w2 are stochastic supersolutions thenw = w1∧w2 is a stochastic supersolution.
Proof. Letαi∈A(τ, ξ),i= 1,2bewi-suitable controls for a randomized initial condition (τ, ξ). PutA1={w1(ξ)< w2(ξ)} ∈Fτ,A2=Ac1:= Ω\A1. We claim that
α=IA1I{τ≤t}α1+IA2I{τ≤t}α2
belongs toA(τ, ξ)and that it isw-suitable.
The process Y = P2
i=1Xtτ,ξ,αiIAi satisfy the same equation as Xτ,ξ,α. From the pathwise uniqueness property it follows that Y = Xτ,ξ,α. We have Xτ,ξ,α ∈ G, t ≥ τ P-a.s., andαisw-suitable for(τ, ξ):
E(Zρτ,ξ,α(w)|Fτ) =
2
X
i=1
E(IAiZρτ,ξ,αi(w)|Fτ)≤
2
X
i=1
IAiE(Zρτ,ξ,αi(wi)|Fτ)
≤
2
X
i=1
IAie−βτwi(ξ) =e−βτw(ξ).
The following result was used in [5, 6, 24] (see, e.g., Lemmas 2 and 4 of [24]). Its proof use only the fact thatV+ is directed downward, that is, the statement of Lemma 2.3 holds true.
Lemma 2.4.There exists a sequencewn∈ V+,wn(x)≥wn+1(x),x∈Gsuch that
n→∞lim wn(x) =w+(x) := inf
u∈V+w(x).
The next assertion is the most important part of the stochastic Perron method.
Lemma 2.5.The function
w+(x) = inf
w∈V+w(x) is a viscosity subsolution of (1.7).
Proof. If w+ is not a viscosity subsolution then there existx0 ∈ G◦, ϕ∈ C2and ε > 0 such thatw+(x0) =ϕ(x0),w+< ϕon the setBε(x0)\{0} ⊂G◦and
F(x0, ϕ(x0), Dϕ(x0), D2ϕ(x0))>0.
Hence, there exists somea∈Asuch thatβϕ(x0)−(Laϕ)(x0)−f(x0, a)>0, where (Laϕ)(x) =b(x, a)Dϕ(x) +1
2Tr σ(x, a)σT(x, a)D2ϕ(x) .
By the continuity ofb,σ,f we may assume that
βϕ(x)−(Laϕ)(x)−f(x, a)>0, x∈Bε(x0)⊂G◦ (2.5) for someε >0.
Sincew+is upper semicontinuous, we have
w+(x)−ϕ(x)≤ −δ <0, x∈Sε:=Bε(x0)\Bε/2(x0).
By Lemma 2.4 there exists a decreasing sequencewn∈ V+,wn&w+. The sets An={x∈Sε:wn(x)−ϕ(x)≥ −δ0}, δ0∈(0, δ)
are compact,An ⊃An+1 and∩∞n=1An =∅. Thus,∩Nn=1An =∅for someN. This means that there exists a functionw=wN ∈ V+such thatw−ϕ <−δ0 onSε.
Define the function ϕη = ϕ−η, whereη ∈ (0, δ0)is such that the inequality (2.5) holds true forϕη instead ofϕ. Note that
w−ϕη=w−ϕ+η <−δ0+η <0 on Sε. We claim that
wη =
ϕη∧w on Bε(x0),
w otherwise
is a stochastic subsolution. This gives a contradiction with the definition of w+ since wη(x0) =ϕη(x0) =w+(x0)−η < w+(x0).
It is clear thatwη ∈Cb(G). We only need to construct awη-suitable controlαfor a randomized initial condition(τ, ξ),ξ∈G◦. Put
U ={x∈Bε/2(x0) :w(x)> ϕη(x)}, H={ξ∈U} ∈Fτ
and define a progressively measurable process
αt= (aIH+α0tIHc)I{t≥τ}∈A, whereα0is aw-suitable control for(τ, ξ). Furthermore, put
τ1= inf{t≥τ :Xtτ,ξ,α6∈Bε/2(x0)}, αt=αtI{t≤τ1}+α1tI{t>τ1},
whereα1is aw-suitable control for(τ1, ξ1),ξ1=Xττ,ξ,α
1 I{τ1<∞}. We haveXτ,ξ,α=Xτ,ξ,α on the stochastic intervalJτ, τ1K andXτ,ξ,α =Xτ1,ξ1,α1 on Jτ1,∞K. Thus, α∈ A(τ, ξ). Note also that forE ={ξ∈Bε/2(x0)}we get
Xτ,ξ,α∈Bε/2(x0) on JτE,(τ1)EK; Xτ,ξ,α=ξ on JτEc,(τ1)EcK.
It remains to show thatαis awη-suitable control for(τ, ξ). For a stopping timeρ≥τ putD={ρ > τ1}. We have
Zρτ,ξ,α(wη)ID=ID Z τ1
τ
e−βsf(Xsτ,ξ,α, αs)ds
+ID
Z ρ
τ1
e−βsf(Xsτ1,ξ1,α1, α1s)ds+e−βρwη(Xρτ1,ξ1,α1)
≤ID
Z τ1
τ
e−βsf(Xsτ,ξ,α, αs)ds+IDZρτ1,ξ1,α1(w). (2.6) By Definition 2.2 we get
E(Zρτ1,ξ1,α1(w)ID|Fτ1) =E(Zρτ1,ξ1,α1
D (w)ID|Fτ1)≤IDe−βτ1w(ξ1)
=IDe−βτ1wη(ξ1). (2.7)
The last equality follows from the fact thatξ16∈Bε/2(x0)on the set{ρ > τ1}andw=wη onG\Bε/2(x0). From (2.6), (2.7) it follows that
E(Zρτ,ξ,α(wη)ID|Fτ1)≤ID
Z τ1
τ
e−βsf(Xsτ,ξ,α, αs)ds+e−βτ1wη(ξ1)
=IDZττ,ξ,α
1 (wη), and we obtain the estimate
E(Zρτ,ξ,α(wη)|Fτ) =E(I{ρ≤τ1}Zρτ,ξ,α(wη)|Fτ) +E(I{ρ>τ1}E(Zρτ,ξ,α(wη)|Fτ1)|Fτ)
≤E(I{ρ≤τ1}Zρτ,ξ,α(wη)|Fτ) +E(I{ρ>τ1}Zττ,ξ,α1 (wη)|Fτ)
=E(Zρ∧ττ,ξ,α1(wη)|Fτ). (2.8)
On the stochastic intervalJτH,(τ1)HKthe trajectories ofXτ,ξ,α do not leave the ball Bε/2(x0). Hence, the estimate wη(Xρ∧ττ,ξ,α1)≤ϕη(Xρ∧ττ,ξ,α1)holds true onH and we get the inequality
Zρ∧ττ,ξ,α1(wη) =Zρ∧ττ,ξ,a1(wη)IH+Zρ∧ττ,ξ,α10(wη)IHc≤Zρ∧ττ,ξ,a1(ϕη)IH+Zρ∧ττ,ξ,α10(w)IHc. (2.9)
Applying Ito’s formula Ztτ,ξ,a(ϕη) =
Z t
τ
e−βsf(Xsτ,ξ,a, a)ds+e−βtϕη(Xtτ,ξ,a)
=e−βτϕη(ξ) + Z t
τ
e−βs
f(Xsτ,ξ,a, a) + (Laϕη−βϕη)(Xsτ,ξ,a) ds
+ Z t
τ
e−βsϕηx(Xsτ,ξ,a)·σ(Xsτ,ξ,a, a)dWs. (2.10) on the intervalJτ, ρ∧τ1K, taking the conditional expectation, and using (2.5), we get
E(Zρ∧ττ,ξ,a1(ϕη)IH|Fτ)≤e−βτϕη(ξ)IH=e−βτwη(ξ)IH=Zττ,ξ,α(wη)IH. (2.11) Furthermore,
E(Zρ∧ττ,ξ,α10(w)|Fτ)IHc≤Zττ,ξ,α0(w)IHc=Zττ,ξ,α(wη)IHc (2.12) by the definition ofα0. The combination of (2.11), (2.12) with (2.9) and (2.8) gives the desired inequality
E(Zρτ,ξ,α(wη)|Fτ)≤Zττ,ξ,α(wη).
To show thatw+ satisfies the last assertion of Theorem 1.2, we study its behavior near the points ofΓ. Fixx∈Γ. By the definition ofΓthere existsα1∈A(x)such that
τ= inf{t >0 :Xtx,α1∈G◦}= 0 a.s. (2.13) Forε >0consider the predictable set
E={(t, ω) :Xtx,α1(ω)∈G◦, t∈(0, ε]}=K0, εK∩ Xx,α1−1
(G◦)
and its projection:D={ω: (t, ω)∈E for some t∈[0,∞)}. The equality (2.13) means that P(D) = 1. By the section theorem [4, Theorem 16.12] there exist an F-stopping timeσεsuch that
{(σε(ω), ω) :ω∈Ω, σε(ω)<∞} ⊂E, P(σε<∞)≥1−ε. (2.14) PutDε={σε≤ε}={σε<∞}. Then (2.14) means that
Xσx,αε 1 ∈G◦ on Dε, P(Dε)≥1−ε.
Let wbe a stochastic supersolution, bounded from above by the constantf /β. Put ξε=IDεXσx,αε 1 ∈G◦ and take aw-suitable controlα2∈A(σε, ξε). Then
α=α1I{t<σε}+α2I{t≥σε}∈A(x).
Taking into account thatσε=∞onDεc, by the definitions ofvandwwe obtain:
v(x)≤E Z σε
0
e−βtf(Xtx,α1, αt1)dt+E Z ∞
σε
e−βtf(Xtσε,ξε,α2, α2t)dt Fσε
! ,
≤E Z σε
0
e−βtf(Xtx,α1, αt1)dt+e−βσεw(ξε)
!
It easily follows that v(x)≤ f
β
1−Ee−βσε
+Ee−βσεw(ξε)IDε+f
β(1−P(Dε)). (2.15)
Moreover, by Lemma 2.4 and the monotone convergence theorem we can changewto w+in this inequality.
Takeεn such thatP(Dεc
n)≤1/2n. By the Borel-Cantelli lemma for allω in some set Ω0withP(Ω0) = 1we haveω∈Dεn for sufficiently largen. Thus,
IDεn →1, ξεn→x, σεn→0 on Ω0, and from (2.15) we obtain the estimatev(x)≤lim supG◦3y→xw+(y).
3 Stochastic subsolutions
Definition 3.1.With the notation of Section 2 we callu∈Cb(G)a stochastic subsolu- tionif
E(Zρτ,ξ,α(u)|Fτ)≥Zττ,ξ,α(u) =e−βτu(ξ) (3.1) for any randomized initial condition(τ, ξ), admissible control processα ∈A(τ, ξ)and stopping timeρ≥τ.
Any stochastic subsolutionuis a lower bound forv: forτ= 0,ξ=x,ρ=∞we have J(x, α) =EZ∞x,α(u)≥Z0x,α(u) =u(x), α∈A(x).
Putf = inf(x,a)∈G×Af(x, a).The setV− of stochastic subsolutions is non-empty and contains sufficiently large negative constantsc. Indeed, it is easy to see that
E(Zρτ,ξ,α(c)|Fτ)≥ce−βτ for c≤f /β.
Lemma 3.2.Letu1,u2be stochastic subsolutions. Thenu1∨u2is a stochastic subsolu- tion.
The proof follows from the inequality E(Zρτ,ξ,α(u1∨u2)|Fτ)≥max
i=1,2E(Zρτ,ξ,α(ui)|Fτ)≥max
i=1,2Zττ,ξ,α(ui) =e−βτ(u1∨u2)(ξ).
Lemma 3.3.There exists a sequenceun ∈ V−,un(x)≤un+1(x),x∈Gsuch that
n→∞lim un(x) =u−(x) := sup
u∈V−
u(x).
This lemma is analogous to Lemma 2.4.
Lemma 3.4.The function
u−(x) = sup
u∈V−
u(x)
is a viscosity supersolution of (1.7).
Proof. Ifu− is not a viscosity supersolution then there existx0 ∈G,ϕ∈C2 andε >0 such thatu−(x0) =ϕ(x0),u−> ϕon(Bε(x0)\{0})∩Gand
F(x0, ϕ(x0), Dϕ(x0), D2ϕ(x0))<0.
By the continuity ofF we can assume that
F(x, ϕ(x), Dϕ(x), D2ϕ(x))<0, x∈Bε(x0)∩G. (3.2) Furthermore, by the lower-semicontinuity ofu−we have
u−(x)≥ϕ(x) +δ, x∈Sε:= Bε(x0)\Bε/2(x0)
∩G
for someδ >0. In the same way as in the proof of Lemma 2.5, one can show that there existu∈ V− andδ0∈(0, δ)such thatu≥ϕ+δ0onSε.
Take anη ∈(0, δ0)such that (3.2) holds true forϕη =ϕ+η instead ofϕ. We have u−ϕη ≥δ0−η >0onSε.
To get a contradiction it is enough to prove that the function uη=
ϕη∨u on Bε(x0)∩G,
u otherwise
is a stochastic subsolution, sinceuη(x0) =ϕη(x0)> u−(x0), contrary to the definition of u−.
Clearly uη ∈ Cb(G), and we only should to verify (3.1) for any randomized initial condition(τ, ξ), control processα∈A(τ, ξ)and stopping timeρ≥τ. Put
τ1= inf{t≥τ:Xtτ,ξ,α6∈Bε/2(x0)}, ξ1=Xττ,ξ,α
1 I{τ1<∞}, E={ξ∈Bε/2(x0)}.
We have
ξ1∈∂Bε/2(x0)∩G on E∩ {τ1<∞}; ξ1= 0 on E∩ {τ1=∞}; ξ1=ξ on Ec. Moreover,Xτ1,ξ1,α=Xτ,ξ,αon the stochastic intervalJτ1,∞K.
PutD={ρ > τ1}. Similarly to (2.6) we get Zρτ,ξ,α(uη)ID≥ID
Z τ1
τ
e−βsf(Xsτ,ξ,α, αs)ds+IDZρτ1,ξ1,α(u). (3.3) Applying Definition 3.1, we obtain
E(Zρτ1,ξ1,α(u)ID|Fτ1) =E(Zρτ1,ξ1,α
D (u)ID|Fτ1)≥IDe−βτ1u(ξ1) =IDe−βτ1uη(ξ1). (3.4) The last equality follows from the fact thatξ1, restricted to D, takes values in the set G\Bε/2(x0)whereu=uη.
From (3.3), (3.4) it follows that E(Zρτ,ξ,α(uη)ID|Fτ1)≥ID
Z τ1
τ
e−βsf(Xsτ,ξ,α, αs)ds+e−βτ1uη(ξ1)
=IDZττ,ξ,α1 (uη). (3.5)
By (3.5) we have
E(Zρτ,ξ,α(uη)|Fτ) =E(I{ρ≤τ1}Zρτ,ξ,α(uη)|Fτ) +E(I{ρ>τ1}E(Zρτ,ξ,α(uη)|Fτ1)|Fτ)
≥E(I{ρ≤τ1}Zρτ,ξ,α(uη)|Fτ) +E(I{ρ>τ1}Zττ,ξ,α
1 (uη)|Fτ)
=E(Zρ∧ττ,ξ,α1(uη)|Fτ). (3.6)
Put
U ={x∈G∩Bε/2(x0) :ϕη(x)> u(x)}, H={ξ∈U} ∈Fτ.
On the stochastic intervalJτH,(ρ∧τ1)HKthe trajectories ofXτ,ξ,αdo not leave the set Bε/2(x0)∩G. Hence, we haveuη(Xρ∧ττ,ξ,α1)IH ≥ϕη(Xρ∧ττ,ξ,α1)IHand
Zρ∧ττ,ξ,α1(uη)≥Zρ∧ττ,ξ,α1(ϕη)IH+Zρ∧ττ,ξ,α1(u)IHc. (3.7) Apply Ito’s formula (2.10) on the intervalJτ, ρ∧τ1Kwithαinstead ofa. Taking the conditional expectation and using (3.2), we get
E(Zρ∧ττ,ξ,α1(ϕη)IH|Fτ)≥e−βτϕη(ξ)IH =e−βτuη(ξ)IH=Zττ,ξ,α(uη)IH. (3.8)
Furthermore,
E(Zρ∧ττ,ξ,α1(u)|Fτ)IHc ≥Zττ,ξ,α(u)IHc =Zττ,ξ,α(uη)IHc, (3.9) and the desired inequality
E(Zρτ,ξ,α(uη)|Fτ)≥Zττ,ξ,α(uη)
follows from (3.8), (3.9), combined with (3.6), (3.7).
4 The case of a smooth domain
LetGcoincide with the closure ofG◦, and assume that∂Gis of classC2. Then the distance functionρfrom∂G:
ρ(x) = inf{y∈Gc :|y−x|}, x∈G
is of classC2 in a neighbourhood of∂G(see [14, Lemma 14.16]). Put−n(x) =Dρ(x), x∈G. Ifx∈∂G,n(x)is the unit outer normal to∂Gat x. It is shown in [1, Example 3.2], [2, Example 1] that condition (1.3) is reduced to the following: for any x ∈ ∂G there existsa∈Asuch that
σT(x, a)n(x) = 0, −n(x)·b(x, a) +1
2Tr σ(x, a)σT(x, a)D2ρ(x)
≥0.
To get a comparison result we need a stronger condition, presented in the next theorem.
Theorem 4.1.Assume that there exists a Borel measurable functionψ : G7→ Asuch that the functions (1.4) are globally Lipschitz continuous and
σψ(x) = 0, −n(x)·bψ(x)>0, x∈∂G. (4.1) Then the value functionv, defined by (1.2), is the unique continuous viscosity solution of the state constrained problem (1.7).
Proof. The viscosity subsolutionw+, specified in Theorem 1.2, satisfies also the linear inequality
βw+(x)−f(x, ψ(x))−(bψ·Dw+)(x)−1
2Tr (σψσψTD2w+)(x)≤0, x∈G◦ (4.2) in the viscosity sense. Consider the function
we+(x) =
( lim sup
G◦3y→x
w+(y) x∈∂G, w+(x) otherwise.
Clearly,we+is a viscosity subsolution of (4.2), satisfying all conditions of Theorem 1.2.
Now we use conditions (4.1). By Lemma 4.1 of [3] the function we+ is a viscosity subsolution of (4.2) onG. Furthermore, by Theorem 4.1(ii) of [3], for anyx∈∂Gthere exists a sequencexk∈G◦,xk→xsuch thatwe+(x) = limk→∞we+(xk)and
lim sup
k→∞
|xk−x|
d(xk) <∞, or, equivalently,
lim sup
k→∞
(xk−x)·n(x)
|xk−x| ≤ −β
for some β ∈ (0,1). This is the nontangential upper semicontinuity property of we+, which, by the comparison result of [21] (Theorem 2.2), implies that
we+≤u− onG. (4.3)
Let us prove that∂G= Γ. Forx∈∂Gdenote byX the solution of the equation Xt=x+
Z t
0
bψ(Xs)ds+ Z t
0
σψ(Xs)dWs, x∈∂G.
Since conditions (4.1) imply the viability, we get an admissible control αt = ψ(Xt): Xt=Xtx,α∈G,t≥0a.s. Takeε >0such thatρ∈C2(Bε(x))and
inf
y∈Bε(x)∩G
−n(y)·bψ(y) +1
2Tr σψ(y)σTψ(y)D2ρ(y)
>0.
Furthermore, putτ = inf{t≥0 :Xt6∈Bε(x)}. By Ito’s formula we have ρ(Xt∧τ) =ρ(x)−
Z t∧τ
0
n(Xs)·bψ(Xs)ds+1 2
Z t∧τ
0
Tr σψ(Xs)σψT(Xs)D2ρ(Xs)
ds+Mt,
whereM is a continuous martingale with M0 = 0. From the representation ofM as a time-changed Brownian motion on an extended filtered probability space (see [16, Theorem 7.2’]) it follows that 0 is a limit point of the set{t > 0 : Mt = 0} a.s. For a sequencetk(ω)→0withMtk= 0we have
ρ(Xtk) =ρ(x) + Z tk
0
−n(Xs)·bψ(Xs)ds+1
2Tr σψ(Xs)σTψ(Xs)D2ρ(Xs)
ds >0 a.s.
for sufficiently largek. Thus,X immediately entersG◦:
inf{t >0 :Xt∈G◦}= inf{t >0 :ρ(Xt)>0}= 0 a.s., and we conclude thatx∈Γand∂G= Γ.
This fact, together with Theorem 1.2 and inequality (4.3), implies that v≤we+≤u−≤v on G.
Hence,v=we+=u−is a continuous function, and it satisfies (1.7) in the viscosity sense.
Note also that the uniqueness of a continuous constrained viscosity solution is a more classical result: see [12, Theorem 7.10].
Theorem 4.1 is similar to Theorem 4.1 of [21]. Although, the second condition (4.1) is presented there in the form
−n(x)·bψ(x) +1
2Tr σψ(x)σTψ(x)D2ρ(x)
≥c >0, x∈∂G,
which is formally not comparable to ours local condition−n(x)·bψ(x)>0,x∈∂G, the result of [21] is more sophisticated. To get the comparison result in Theorem 4.1 we used only the fact that any subsolution, being suitably modified at the boundary points, possesses the nontangential upper semicontinuity property under conditions (4.1). In [21] it is shown that a subsolutionu≥v with this property exists even some diffusion in the tangent direction to∂Gis allowed: see conditions A3 of [21].
Certainly, the stochastic Perron method can be applied in the case of finite horizon as well. However, some work is required to study the parabolic problem, corresponding
to (1.7). In particular, a new boundary condition at the terminal time appears, and the viability notion should be modified. Such a problem was studied in [10] by another methods. We mention a comparison result, ensuring the continuity of the value function, proved under conditions similar to (4.1): see [10, Theorem A.1].
Acknowledgments.The author thanks the anonymous referees for useful remarks and for pointing out an error in the previous proof of Lemma 2.1. The research is supported by Southern Federal University, project 213.01-07-2014/07.
References
[1] M. Bardi and P. Goatin,Invariant sets for controlled degenerate diffusions: a viscosity so- lutions approach, Stochastic Analysis, Control, Optimization and Applications (W.M. McE- neaney, G.G. Yin, and Q. Zhang, eds.), Systems & Control: Foundations & Applications, Birkhäuser Boston, 1999, pp. 191–208. MR-1702960
[2] M. Bardi and R. Jensen,A geometric characterization of viable sets for controlled degener- ate diffusions, Set-Valued Analysis10(2002), no. 2-3, 129–141. MR-1926377
[3] G. Barles and E. Rouy, A strong comparison result for the Bellman equation arising in stochastic exit time control problems and its applications, Comm. Partial Differential Equa- tions22(1998), no. 11-12, 1995–2033. MR-1662164
[4] R.F. Bass, Stochastic processes, Cambridge University Press, Cambridge, 2011. MR- 2856623
[5] E. Bayraktar and M. Sîrbu,Stochastic Perron’s method and verification without smoothness using viscosity comparison: the linear case, Proc. Amer. Math. Soc.140 (2012), no. 10, 3645–3654. MR-2929032
[6] ,Stochastic Perron’s method for Hamilton-Jacobi-Bellman equations, SIAM J. Control Optim.51(2013), no. 6, 4274–4294. MR-3124891
[7] ,Stochastic Perron’s method and verification without smoothness using viscosity com- parison: Obstacle problems and Dynkin games, Proc. Amer. Math. Soc.142(2014), no. 4, 1399–1412. MR-3162260
[8] E. Bayraktar and Y. Zhang,Stochastic Perron’s method for the probability of lifetime ruin problem under transaction costs, Preprint arXiv:1404.7406v1 [math.OC], 24 pages, 2014.
[9] K. Bichteler,Stochastic integration andLp-theory of semimartingales, Ann. Prob.9(1981), no. 1, 49–89. MR-0606798
[10] B. Bouchard and M. Nutz,Weak dynamic programming for generalized state constraints, SIAM J. Control Optim.50(2012), no. 6, 3344–3373. MR-3024163
[11] D. Buckdahn, M. Goreac, and M. Quincampoix,Stochastic optimal control and linear pro- gramming approach, Appl. Math. Optim.63(2011), no. 2, 257–276. MR-2772196
[12] M. Crandall, H. Ishii, and P.-L. Lions,User’s guide to viscosity solutions of second-order partial differential equations, Bull. Amer. Math. Soc.27(1992), no. 1, 1–67. MR-1118699 [13] C. Dellacherie and P.-A. Meyer, Probabilities and potential, North-Holland Mathematics
Studies, vol. 29, North-Holland, Amsterdam, 1978. MR-0521810
[14] D. Gilbarg and N.S. Trudinger, Elliptic partial differential equations of second order, Springer, Berlin, 2001. MR-1814364
[15] S.W. He, J.G. Wang, and J.A. Yan,Semimartingale theory and stochastic calculus, Science Press, Beijing, New York, 1992. MR-1219534
[16] N. Ikeda and S. Watanabe,Stochastic differential equations and diffusion processes, 2nd ed., North-Holland, Amsterdam, 1989. MR-1011252
[17] H. Ishii, Perron’s method for Hamilton-Jacobi equations, Duke Math. J.55(1987), no. 2, 369–384. MR-0894587
[18] H. Ishii and P. Loreti,A class of stochastic optimal control problems with state constraint, Indiana Univ. Math. J.51(2002), no. 5, 1167–135. MR-1947872
[19] R.L. Karandikar,Pathwise solutions of stochastic differential equations, Sankhya Ser. A43 (1981), no. 2, 121–132. MR-0666376
[20] ,On pathwise stochastic integration, Stoch. Proc. Appl.57(1995), no. 1, 11–18. MR- 1327950
[21] M.A. Katsoulakis,Viscosity solutions of second order fully nonlinear elliptic equations with state constraints, Indiana Univ. Math. J.43(1994), no. 2, 493–519. MR-1291526
[22] N.V. Krylov,Controlled diffusion processes, Springer, New York, 1980. MR-0601776 [23] J.M. Lasry and P.L. Lions, Nonlinear elliptic equations with singular boundary conditions
and stochastic control with state constraints, Math. Ann.283(1989), no. 4, 583–630. MR- 0990591
[24] D.B. Rokhlin,Verification by stochastic Perron’s method in stochastic exit time control prob- lems, J. Math. Anal. Appl.419(2014), no. 1, 433–446. MR-3217159
[25] M. Sîrbu,Stochastic Perron’s method and elementary strategies for zero-sum differential games, Preprint arXiv:1305.5083 [math.OC], 17 pages, 2013. MR-3206980
[26] ,Asymptotic Perron’s method and simple Markov strategies in stochastic games and control, Preprint arXiv:1402.7030 [math.OC], 12 pages, 2014.
[27] H.M. Soner,Optimal control with state-space constraint. I, SIAM J. Control Optim.24(1986), no. 3, 552–561. MR-0838056
[28] D.W. Stroock, Probability theory. an analytic view, 2nd ed., Cambridge University Press, Cambridge, 2011. MR-2760872
[29] D.W. Stroock and S.R.S. Varadhan, Multidimensional diffusion processes, Springer, New York, 1979. MR-0532498
Electronic Communications in Probability
Advantages of publishing in EJP-ECP
• Very high standards
• Free for authors, free for readers
• Quick publication (no backlog)
Economical model of EJP-ECP
• Low cost, based on free software (OJS
1)
• Non profit, sponsored by IMS
2, BS
3, PKP
4• Purely electronic and secure (LOCKSS
5)
Help keep the journal free and vigorous
• Donate to the IMS open access fund
6(click here to donate!)
• Submit your best articles to EJP-ECP
• Choose EJP-ECP over for-profit journals
1OJS: Open Journal Systemshttp://pkp.sfu.ca/ojs/
2IMS: Institute of Mathematical Statisticshttp://www.imstat.org/
3BS: Bernoulli Societyhttp://www.bernoulli-society.org/
4PK: Public Knowledge Projecthttp://pkp.sfu.ca/
5LOCKSS: Lots of Copies Keep Stuff Safehttp://www.lockss.org/