• 検索結果がありません。

2.1 Maximum principle with transversality conditions

N/A
N/A
Protected

Academic year: 2022

シェア "2.1 Maximum principle with transversality conditions"

Copied!
22
0
0

読み込み中.... (全文を見る)

全文

(1)

curvilinear integral cost

Constantin Udri¸ste

Abstract.Recently we have created a multitime maximum principle gath- ering together some concepts in Mechanics, Field Theory, Differential Ge- ometry, and Control Theory. The basic tools of our theory are variational PDE systems, adjoint PDE systems, Hamiltonian PDE systems, duality, multitime maximum principle, incavity on manifolds etc. Now we jus- tify the multitime maximum principle for curvilinear integral cost using the m-needle variations. Section 1 recalls the multitime control theory and proves the equivalence between curvilinear integral costs and multi- ple integral costs. Section 2 formulates variants of multitime maximum principle using control Hamiltonian 1-forms produced by a curvilinear in- tegral cost and a controlledm-flow evolution. Section 3 refers to original proofs of the multitime maximum principle using simple and multiple mul- titimem-needle control variations. The key is to use completely integrable first order PDEs (controlled evolution and variational PDEs) and their ad- joint PDEs. Section 4 formulates and proves sufficient conditions that the multitime maximum principle be true.

M.S.C. 2010: 49J20, 49K20, 93C20.

Key words: multitime maximum principle, curvilinear integral cost, variational PDEs, adjoint PDEs,m-needle variations.

1 Multitime control theory

Themultitime control theoryis concerned with partial derivatives dynamical systems and their optimization over multitime [15]-[30]. Such problems are well-known also as the multidimensional control problems of Dieudonn´e-Rashevsky type [9], [10], [31], [32], but both techniques and results in these papers are different from ours. We confirm the expectations of Lev Pontryaguin, Lawrence Evans and Jacques-Louis Lions regarding the analogy between optimal control of systems governed by first order PDEsand optimal control of systems containing first order ODEs. The ideas we use were stimulated by the original point of view of Lawrence C. Evans [4] on

Balkan Journal of Geometry and Its Applications, Vol.16, No.1, 2011, pp. 128-149.

°c Balkan Society of Geometers, Geometry Balkan Press 2011.

(2)

(single-time) Pontryaguin maximum principle [11] and by the papers [1]-[3], [5]-[10], [31], [32].

The multidimensional optimal control arise in the description of torsion of pris- matic bars in the elastic case as well as in the elastic-plastic case. Another instance are optimization problems for convex bodies under geometrical restrictions, e.g., max- imization of the area surface for given width and diameter.

Let (T, h) be anm-dimensionalCRiemannian manifold, (M, g) be ann-dimensional C Riemannian manifold and J1(T, M) the associatedjet bundle of first order. Let (U, η, M) be acontrol fiber bundle. The manifoldM is calledstate manifoldand the componentsxi,i= 1, . . . , nof a pointx∈M are calledstate variables. Then (xi, ua), i = 1, . . . , n; a = 1, . . . , k are adapted coordinates in U, and (tα, xi, xiα = ∂xi

∂tα), i= 1, . . . , n; α= 1, . . . , m are natural coordinates inJ1(T, M). The components ua of the pointu∈Ux=η−1(x) are calledcontrols.

LetXα :U →J1(T, M),Xα=Xαi(x, u)

∂xi be a C fibered mapping, over the identity in the state manifoldM, which produces a continuous control PDEs system (controlled m-flow)

∂xi

∂tα(t) =Xαi(x(t), u(t)), i= 1, . . . , n; α= 1, . . . , m,

where t = (t1, . . . , tm) is the multi-parameter of evolution (multitime). This PDEs system has solutions if and only if the complete integrability conditions

(CIC) ∂Xαi

∂xj Xβj+∂Xαi

∂ua

∂ua

∂tβ = ∂Xβi

∂xj Xαj +∂Xβi

∂ua

∂ua

∂tα are satisfied. These determine the set ofadmissible controls

U ={u(·) :R+m→U| u(·) is measurable and satisfies CIC}.

The evolution of the state manifold is totally characterized by the image setS = Im(Xα)⊂J1(T, M) which is described by the control equations

xi=xi, xiα=Xαi(x, u), i= 1, . . . , n α= 1, . . . , m.

Remark 1.1. A problem of controlled evolution can be thought as a controlled im- mersion if m < n, controlled diffeomorphism if m = n or controlled submersion if m > n. Particularly, if m = n, taking the trace after the indices i, α, we find a divergence type evolution (conservation laws).

To simplify, we replace the manifolds M and T and the fiber Ux by their local representativesRn, Rm, U ⊂Rk respectively. More precisely, for multitimes we use the orthantRm+. Having this in mind, for the multitimes s = (s1, ..., sm) and t = (t1, ..., tm), we denote s t if and only if sα tα, α = 1, ..., m (product order).

Then the parallelepiped Ω0t0, fixed by the diagonal opposite points 0(0, ...,0) and t0 = (t10, ..., tm0), is equivalent to the closed interval 0≤t≤t0. Givenu(·)∈ U, the statex(·) is the solution of the evolution system

(P DE) ∂xi

∂tα(t) =Xαi(x(t), u(t)), x(0) =x0, t∈0t0 ⊂Rm+.

(3)

This multitime evolution system is used as a constraint when we want to optimize a multitime cost functional. On the other hand, the cost functionals can be introduced at least in two equivalent ways:

- either using a curvilinear integral, P(u(·)) =

Z

Γ0t0

Xβ(x(t), u(t))dtβ+g(x(t0)),

where Γ0t0 is an arbitraryC1curve joining the points 0 andt0, therunning costsω= Xβ(x(t), u(t))dtβis a closed (completely integrable) 1-form (autonomous Lagrangian 1-form), andg is theterminal cost;

- or using the multiple integral, Q(u(·)) =

Z

0t0

X(x(t), u(t))dt1...dtm+g(x(t0)),

where therunning costsX(x(t), u(t)) is a continuous function (autonomous Lagrangian), andgis theterminal cost.

Theorem 1.2. The controlled multiple integral

I(t0) = Z

0t0

X0(x(t), u(t))dt1...dtm,

withX0(x(t), u(t))as continuous function, is equivalent to the controlled curvilinear integral

J(t0) = Z

Γ0t0

Xβ0(x(t), u(t))dtβ,

whereω =Xβ0(x(t), u(t))dtβ is a closed (completely integrable)1-form and the func- tionsXβ0 have partial derivatives of the form

∂tα,

∂tα∂tβ(α < β), ... , m−1

∂t1...∂tˆα...∂tm. Thehat symbolposed over∂tαdesignates that∂tα is omitted.

Proof. The multiple integralI(t0) suggests to introduce a new coordinate x0(t) =

Z

0,t

X0(x(t), u(t))dt1...dtm, t∈0t0, x0(t0) =I(t0).

Taking

Xα0(x(t), u(t)) = ∂x0

∂tα(t), we can writex0(t) as the curvilinear integral

x0(t) = Z

γ0t

Xα0(x(s), u(s))dsα, x0(t0) =J(t0),

(4)

whereγ0t is an arbitraryC1 curve joining the points 0 andtin Ω0t0. Also

m−1Xα0

∂t1...∂tˆα...∂tm = mx0

∂t1...∂tα...∂tm.

Conversely, the curvilinear integralJ(t0) suggests to define a new coordinate by x0(t) =

Z

γ0t

Xα0(x(s), u(s))dsα, x0(t0) =J(t0),

where γ0t is an arbitrary C1 curve joining the points 0 and t in Ω0t0, and ω = Xα0(x(s), u(s))dsαis a completely integrable 1-form. SinceXα0 = ∂x0

∂tα, we can define X0= m−1Xα0

∂t1...∂tˆα...∂tm = mx0

∂t1...∂tα...∂tm. Then the new coordinate can be written as

x0(t) = Z

0t

X0(x(t), u(t))dt1...dtm, t∈0,t0, x0(t0) =I(t0).

¤ In this paper, we shall develop the optimization problems using cost functionals as path independent curvilinear integrals and constraints asm-flows, where the complete integrability conditons are piecewise satisfied.

Remark 1.3. New aspects of control theory are developed in the papers [9], [10], [31]

and [32] using weak derivatives instead of usual partial derivatives.

Remark 1.4. We can extend the holonomic controlled evolution to a nonholonomic controlled evolution, using the Pfaff system dxi =Xαi(x, u)dtα. In the noholonomic case, the dimension of evolution is smaller thanm.

2 Maximum principle for multitime control theory based on a curvilinear integral cost

Acurvilinear integral costand amultitime flowwere introduced in theoptimal control theoryby our papers [12]-[30].

Multitime optimal control problem. Find maxu(·) P(u(·)) =

Z

Γ0t0

Xβ(x(t), u(t))dtβ+g(x(t0)) subject to ∂xi

∂tα(t) =Xαi(t, x(t), u(t)), i= 1, ..., n, α= 1, ..., m, u(t)∈ U, t∈0t0, x(0) =x0, x(t0) =xt0.

Themultitime maximum principle(necessary condition) will assert the existence of a costate vector function(p0, p)(·) = (p0(·), pi(·)) which, together with the optimal

(5)

m-sheetx(·), satisfies an appropriate PDEs system and a maximum condition. All these conditions can be written using the appropriatecontrol Hamiltonian 1-form

Hα(x, p0, p, u) =p0Xα0(x, u) +piXαi(x, u).

Theorem 2.1. (multitime maximum principle) Suppose u(·) is optimal for (P),(P DE)and that x(·) is the corresponding optimal m-sheet. Then there exists a function(p0, p) = (p0, pi) : Ω0t0→Rn+1 such that

(P DE) ∂x∗i

∂tα(t) =∂Hα

∂pi (x(t), p0(t), p(t), u(t)),

(ADJ) ∂pi

∂tα(t) =−∂Hα

∂xi (x(t), p0(t), p(t), u(t)) and

(M) Hα(x(t), p0(t), p(t), u(t)) = max

u∈Ux

Hα(x(t), p0(t), p(t), u), t. Also, the functionst→Hα(x(t), p0(t), p(t), u(t))are constants.

(t0) p0(t0) =a0, pi(t0) = ∂g

∂xi(x(t0)) are satisfied.

We callx(·) thestateof the optimally controlled system and (p0, p(·)) thecostate vector.

Remark 2.2. (P DE)means the identities

∂x∗i

∂tβ (t) =Xβi(x(t), u(t)), β= 1, . . . , m; i= 1, . . . , n, (controlled evolution PDEs).

Remark 2.3. (ADJ)means the identities

∂pi

∂tβ (t) = Ã

p0(t)∂Xβ0

∂xi +pj(t)∂Xβj

∂xi

!

(x(t), u(t))

(adjoint PDEs).

Remark 2.4. The relations (M)represent the maximization principle and the rela- tion(t0) means the terminal (transversality) condition.

Remark 2.5. The multitime maximum principle states necessary conditions that must hold on an optimalm-sheet of evolution.

(6)

Example. Consider the problem of a mine owner who must decide at what rate to extract a complex ore from his mine. He owns rights to the ore from two-time date 0 = (0,0) to two-time dateT = (T, T). Atwo-time datecan be the pair (date, useful component frequence). At two-time date 0 there isx0= (xi0) ore in the ground, and the instantaneous stock of orex(t) = (xi(t)) declines at the rateu(t) = (uiα(t)) the mine owner extracts it. The mine owner extracts ore at costqiuiα(t)2

xi(t) and sells ore at a constant pricep= (pi). He does not value the ore remaining in the ground at time T (there is no ”scrap value”). He chooses the rateu(t) of extraction in two-time to maximize profits over the period of ownership with no two-time discounting.

Solution (continuous two-time version). The manager want to maximizes the profit (curvilinear integral)

P(u(·)) = Z

γ0T

Ã

piuiα(t)−qiuiα(t)2 xi(t)

! dtα

subject to the law of evolution ∂x

∂tγ(t) =−uγ(t). Form the Hamiltonian 1-form Hα=piuiα(t)−qiuiα(t)2

xi(t) −λi(t)uiα(t), differentiate and write the equations

∂Hα

∂uβ = µ

pi2qiuiα xi −λi

δβα= 0, no sum after the index i;

∂λi

∂tα(t) =−∂Hα

∂xi =−qi

µuiα(t) xi(t)

2

, no sum after the index i.

As the mine owner does not value the ore remaining at timeT, we haveλi(T) = 0.

Using the above equations, it is easy to solve for the differential equations governing u(t) andλi(t):

2qiuiα(t)

xi(t) =pi−λi(t), ∂λi

∂tα(t) =−qi

µuiα(t) xi(t)

2

no sum after i

and using the initial and turn-T conditions, the equations can be solved numerically.

Free multitime, fixed endpoint problem. Given a controlu(·)∈ U, the state x(·) is the solution of the initial value problem

(P DE) ∂xi

∂tα(t) =Xαi(x(t), u(t)), x(0) =x0, t∈0t0 ⊂Rm+.

Suppose that a target pointx1 is given. Then we consider the cost functional (path independent curvilinear integral)

(P) P(u(·)) =

Z

γ

Xβ0(x(t), u(t))dtβ,

(7)

whereγis an arbitraryC1curve joining the points 0 andτin Ω0t0, forτ =τ(u(·))

∞, τ = (τ1, ..., τm) as the first multitime the solution of (P DE) hits the target point x1. We ask to find an optimal controlu(·) such that

P(u(·)) = max

u(·)∈UP(u(·)).

The control Hamiltonian 1-form is

Hα(x, p0, p, u) =p0Xα0(x, u) +piXαi(x, u).

Theorem 2.6. (multitime maximum principle) Suppose u(·) is optimal for (P),(P DE)and that x(·) is the corresponding optimal m-sheet. Then there exists a function(p0, p) = (p0, pi) : Ω→Rn+1 such that

(P DE) ∂x∗i

∂tα(t) =∂Hα

∂pi

(x(t), p0(t), p(t), u(t)),

(ADJ) ∂pi

∂tα(t) =−∂Hα

∂xi (x(t), p0(t), p(t), u(t)), and

(M) Hα(x(t), p0(t), p(t), u(t)) = max

u∈Ux

Hα(x(t), p0(t), p(t), u), t. Moreover,

Hα(x(t), p0(t), p(t), u(t))| = 0,

whereτ denotes the first multitime them-sheetx(·)hits the target point x1. We callx(·) thestateof the optimally controlled system and (p0, p(·)) thecostate matrix.

Remark 2.7. A more careful statement of the multitime maximum principle is: there exist the constantp0 and the function (pi) : Ω0t→Rn such that (P DE), (ADJ), and(M)hold. The vector functionp(·)is a Lagrange multiplier, which appears owing to the constraint that the optimal m-sheetx(·)must satisfy (P DE).

Remark 2.8. If the number p0 is0, then the control Hamiltonian 1-form does not depend on the corresponding running costsXα0and in this case the maximum principle must be reformulated. Such a problem will be called abnormal problem.

Remark 2.9. The previous theory can be extended to the nonautonomous case

(P) P(u(·)) =

Z

γ

Xβ0(t, x(t), u(t))dtβ and

(P DE) ∂xi

∂tα(t) =Xαi(t, x(t), u(t)), t∈Rm+,

using the idea of reducing to the previous case by introducing new variables xα = tα, α= 1, ..., m.

(8)

2.1 Maximum principle with transversality conditions

We look again at the dynamics

(P DE) ∂xi

∂tα(t) =Xαi(x(t), u(t)), t∈Rm+,

when the initial pointx0belongs to the subsetM0⊂Rn and the terminal pointx1is constrained to lie in the subsetM1⊂Rn. In other word, we must choose the starting pointx0∈M0in order to maximize

(P) P(u(·)) =

Z

γ

Xβ0(x(t), u(t))dtβ,

whereγ is an arbitraryC1curve joining the points 0 andτ in Ω0t0, forτ=τ(u(·)) as the first multitime we hitM1.

Assumption. The subsets M0 and M1 are smooth submanifolds of Rn. In this context, we can use the tangent spacesTx0M0andTx1M1.

Theorem 2.10. (more transversality conditions)If the functionsu(·)andx(·) solve the previous problem, withx0=x(0), x1=x(τ), then there exists the function p(·) : Ω→Rn such that(P DE),(ADJ)and(M)hold fort∈. Also, (t0) the vector p)is orthogonal to Tx1M1,

the vector p(0)is orthogonal to Tx0M0.

Remark 2.11. Let Γ0t0 be an arbitraryC1 curve joining the diagonal points 0 and t0. According the terminal/transversality condition, fort0>0 and

P(u(·)) = Z

Γ0t0

Xβ0(x(t), u(t))dtβ+g(x(t0)),

the condition(t0)means pi(t0) = ∂g

∂xi(x(t0)).

Remark 2.12. Suppose M1 = {x|fk(x) = 0, k = 1, ..., `}. Since the normal space (orthogonal complement of the tangent space Tx1M1) is generated by the vec- tors ∂fk

∂xi(x1), k = 1, ..., `, we must have pi) = λk∂fk

∂xi(x1), for some parameters (constants, Lagrange multipliers)λ1, ..., λ`.

2.2 Maximum principle with state constraints

Let us return to the problem

(P DE) ∂xi

∂tα(t) =Xαi(x(t), u(t)), x(0) =x0, t∈0t0 ⊂Rm+.

(P) P(u(·)) =

Z

γ

Xβ0(x(t), u(t))dtβ,

(9)

forτ=τ(u(·))≤ ∞, τ = (τ1, ..., τm) as the first multitime withx(τ) =x1, andγ

an arbitrary C1 curve joining the points 0 and τ in Ω0t0. In this sense, we have a fixed endpoint problem.

State constraints. Suppose our multitime dynamics remains in the submanifold N = {x Rn|f(x) 0}, where f : Rn R is a differentiable function. The functionsf andXαi define new functions

cα(x, u) = ∂f

∂xi(x)Xαi(x, u).

Ifx(t)∈∂N fort∈s0s1, thencα(x(t), u(t)) = 0.

Theorem 2.13. (maximum principle for state constraints)Supposeu(·), x(·) solve the previous control theory problem, and thatx(t)∈∂N fort s0s1. Then there exist the costate vector function p(·) : Ωs0s1 Rn and there exist λγβ(·) : Ωs0s1 →R such that

(P DE) ∂x∗i

∂tα(t) =∂Hα

∂pi

(x(t), p0(t), p(t), u(t)),

(ADJ0) ∂pi

∂tβ (t) =−∂Hβ

∂xi (x(t), p(t), u(t)) +λ∗γβ(t)∂cγ

∂xi(x(t), u(t)) (M0) Hβ(x(t), p(t), u(t)) = max

u∈Ux

{Hβ(x(t), p(t), u)|cα(x(t), u) = 0}

hold, for multitimest∈s0s1.

Remark 2.14. Let q1, ..., qs be differentiable functions on U which determine the subset

A={u∈U|q1(u)0, ..., qs(u)0}

in the control set (the condition m s is necessary). In this case, instead of the relation(M0) appear(M00):

∂Hβ

∂u (x(t), p(t), u(t)) =λ∗γβ(t)∂cγ

∂u(x(t), u(t)) +µ∗rβ(t)∂qr

∂u(u(t)).

The functionsλ∗γβ(·) are those appearing in (ADJ0). If x(t)lies in the interior of N, for say the multitimes t∈0s0, then the ordinary multitime maximum principle holds.

Remark 2.15. (Jump conditions)Let s0be a multitime thatp hits the boundary

∂N. Then p(s00) =p(s0+ 0). This means that there is no jump inp when we hit∂N. However,

p(s1+ 0) =p(s10)−λ∗ββ(s1)∂f

∂x(x(s1)),

i.e., there is possibly a jump inp when we leave ∂N. Of course, these statements are true when the gluing (contact) sheets, i.e., the unconstrained evolution sheet and the evolution boundary∂N have the same dimension.

(10)

3 Proofs of the multitime maximum principle

The proofs of multitime maximum principle relies either on control variations similar to those in variational calculus (for interior solutions, see also [15]-[30]) or on control m-needle variations. These give rise to variations of a referencem-sheet. In spite of the fact that these variations are not equivalent, they assert similar statements.

Here we use them-needle variations. To explain their meaning, we consider a can- didate optimal controlu(·), the corresponding optimalm-sheetx(·), and a multitime pointsofapproximate continuityfor the functionsXα(x(·), u(·)) and u(·)∈ U. An m-needle variationis a family of controls u²(·) obtained replacingu(·) withu(·) on the set Ω0s\0s−ε. Anym-needle variation gives rise to a gradient variation yα of an m-sheet x(t) satisfying the variational PDE ∂yiα

∂tβ(t) = yjα(t)∂Xβi

∂xj (x(t), u(t)), in the classical sense, only after the multitimes. Of course, the last PDEs satisfies the complete integrability conditions.

3.1 Simple controls variations

The response x(·) to a given control u(·) is the unique solution of the completely integrable PDEs system

(P DE) ∂xi

∂tα(t) =Xαi(x(t), u(t)), x(0) =x0, t∈0t0 ⊂Rm+.

Let us find how the m-needle changes in the control affect the response. In this sense, we fix the multitime s = (s1, ..., sm), sα > 0, α = 1, ..., m, and a control v(·)∈ U. We select ²= (²1, ..., ²m), ²α>0 with the property 0< sα−²α < sα and define the modified control

u²(t) =

½ v(t) ift∈0s\0s−²

u(t) otherwise,

which is called a simple m-needle variation of u(·). We denote by x²(·) the corre- sponding response of our system

(22) ∂xi²

∂tα(t) =Xαi(x²(t), u²(t)), x²(0) =x0, t∈0t0 ⊂Rm+.

Let us try to understand how the choices ofsandv(·) causex²(·) to differ fromx(·), for small||²||>0.

Lemma 3.1. (changing initial conditions)Ifx²(·)is a solution of the initial value problem

∂xi²

∂tα(t) =Xαi(x²(t), u(t)), x²(0) =x0+²αyα0+o(²), t∈Rm+,

thenx²(t) =x(t) +²αyα(t) +o(²)as²→0, uniformly fortin compact subsets ofRm+, whereyα= (yiα) =

µ∂xi²

∂²α|²=0

is the solution of the initial value variational problem

∂yαi

∂tβ(t) =yαj(t)∂Xβi

∂xj (x(t), u(t)), yα(0) =yα0, t∈Rm+.

(11)

Coming back to the dynamics (22) and simple controlm-needle variation, we find Lemma 3.2. (dynamics and simple control variations) If u²(·) is a simple variation of the controlu(·), then

x²(t) =x(t) +²αyα(t) +o(²)as²→0

uniformly fortin compact subsets ofRm+, where yα(t) = 0, t0s and (23) ∂yiα

∂tβ(t) =yαj(t)∂Xβi

∂xj (x(t), u(t)), yα(s) =yαs, t∈Rm+ \0s, for

yαs=Xα(x(s), v(s))−Xα(x(s), u(s)).

Proof. For simplicity, let us drop the superscript∗. First we remark thatx²(t) =x(t) fort∈0s−² and hence yα(t) = 0 fort∈0s−². For the multitimet∈0s\0s−², we can use the curvilinear integral

x²(t)−x(t) = Z

Γs−²t

(Xα(x(s), v(s))−Xα(x(s), u(s)))dsα=o(²),

where Γs−²tis aC curve joining the pointss−²andt. Hence we can putyα(t) = 0 fort∈0s\0s−².

We set t=s. Since the curvilinear integral is independent of the path, we select Γ as being the straight line joining the pointss−²ands, i.e.,

Γ :tα=sα−²α+²ατ, α= 1, ..., m, τ [0,1].

Consequently,

x²(s)−x(s) = (Xα(x(s), v(s))−Xα(x(s), u(s)))²α+o(²).

On the multitime boxRm+\0s, the functionsx(.) andx²(.) are solutions for the same PDEs, but with different initial conditions: x(0) =x0andx²(s) =x(s)+²αyαs+o(²), foryαs defined by (23). The Lemma of changing initial conditions shows that

x²(t) =x(t) +²αyα(t) +o(²)

foryα(·) solving (23) and fort∈Rm+ \0s. ¤

3.2 Free endpoint problem, no running cost

Statement. Let us consider again the multitime dynamics

(P DE) ∂xi

∂tα(t) =Xαi(x(t), u(t)), x(0) =x0, t∈0t0⊂Rm+ together the terminal cost functional

(P) P(u(·)) =g(x(t0)),

(12)

which must be maximized with respect to the control. We denote byu(·) respectively x(·) the optimal control and the optimalm-sheet of this problem. Since the running cost is zero, the control Hamiltonian 1-form is

Hα(x, p, u) =piXαi(x, u).

It remains to find a vector functionp= (pi) : Ω0t0 →Rn such that

(ADJ) ∂pi

∂tα(t) =−∂Hα

∂xi (x(t), p(t), u(t)), t0t0, and

(M) Hα(x(t), p(t), u(t)) = max

u∈U Hα(x(t), p(t), u).

For simplicity, let us drop the superscript∗. Also, we take into account the control variationu²(.).

The costate. We define the costate p : Ω0t0 Rn, p = (pi), as the unique solution of the terminal value problem

(24) ∂pi

∂tβ(t) =−pj(t)∂Xβj

∂xi (x(t), u(t)), t0t0, pi(t0) = ∂g

∂xi(x(t0)).

The solutions of the Cauchy problems (23)+(24) determine an 1-form of components piyβi. The PDEs (24) are calledadjoint equationssince we can verify by computation that the components (scalar products) piyβi are first integrals of the PDEs system (23)+(24). The costate will be used to calculate the variation of the terminal cost.

Lemma 3.3. (variation of terminal cost)The partial variations of the terminal cost are

∂²βP(u²(·))|²=0=pi(s)¡

Xβi(x(s), v(s))−Xβi(x(s), u(s))¢ .

Proof. Since P(u²(·)) = g(x(t0) +²βyβ(t0) +o(²)), where y(·) satisfies the previous Lemmas, we find

∂²βP(u²(·))|²=0= ∂g

∂xi(x(t0))yiβ(t0).

Since the componentspiyiβare first integrals of the PDEs system (23)+(24), we obtain

∂g

∂xi(x(t0))yiβ(t0) =pi(t0)yβi(t0) =pi(s)yβi(s), ∀s∈0t0.

Finally, the functionsyβ(s) =Xβ(x(s), v(s))−Xβ(x(s), u(s)) give the desired formula.

¤

We restore the superscriptand we formulate the next

Theorem 3.4. (multitime maximum principle) There exists a function p : Ω0t0 →Rn satisfying the adjoint dynamics (ADJ), the maximization principle (M) and the terminal (transversality) condition(t0).

(13)

Proof. The adjoint dynamics and the terminal condition appear in the Cauchy prob- lem (24). To prove (M), we fixs∈int Ω0t0 andv(·)∈ U, as above. Since the function

²→P(u²(·)), ²0t0,has a maximum at²= 0, we must have 0 ∂P

∂²β(u²(·))|²=0=pi(s)¡

Xβi(x(s), v(s))−Xβi(x(s), u(s))¢ . Consequently

Hβ(x(s), p(s), v(s)) =pi(s)Xβi(x(s), v(s))

≤pi(s)Xβi(x(s), u(s)) =Hβ(x(s), p(s), u(s)),

for eachs int Ω0t0 and v(·)∈ U. Since the function v(·) is arbitrary, so it is the valuev(s) and therefore

Hα(x(t), p(t), u(t)) = max

u∈U Hα(x(t), p(t), u).

¤

3.3 Free endpoint problem with running costs

Let us consider that the cost functional include a running cost, i.e.,

(P) P(u(·)) =

Z

Γ0t0

Xβ0(x(t), u(t))dtβ+g(x(t0)),

where Γ0t0 is an arbitraryC1curve joining the points 0 andt0, therunning costdx0= Xβ0(x(t), u(t))dtβis a closed (completely integrable) 1-form, andgis theterminal cost.

In this case thecontrol Hamiltonian 1-frommust have the form Hα(x, p0, p, u) =p0Xα0(x, u) +piXαi(x, u),

under the condition that we can built a costate functionp(·) = (p0(·), pi(·)) satis- fying (ADJ), (M) and (t0).

Adding a new variable. Introducing a new variable x0, we convert the theory to the previous case.

Letx0: Ω0t0→Rbe the solution of initial problem

(25) ∂x0

∂tα(t) =Xα0(x(t), u(t)), x0(0) = 0, t0t0, wherex(·) = (xi(·)) is the solution of (P DE). Introduce

x= (x1, ..., xn), x= (x0, x), x0= (0, x0), x(·) = (x0(·), x(·)), and

Xα(x, u) = (Xα0(x, u), Xα(x, u)), g(x) =g(x) +a0x0. Then (P DE) and (25) give the dynamics

(P DE) ∂x

∂tα(t) =Xα(x(t), u(t)), x(0) =x0, t∈0t0.

(14)

Consequently, the actual control problem transforms into a new control problem with no running cost and the terminal cost

(P) P(u(·)) =g(x(t0)).

We apply the previous theorem of multitime maximum principle to obtainp: Ω0t0 Rn+1, p= (pi) satisfying (Mα) for the control Hamiltonian 1-form

Hα(x, p, u) =piXiα(x, u).

The adjoint equations (ADJ) hold for the terminal transversality condition (t0) pj(t0) = ∂g

∂xj(x(t0)), j= 0,1,2, ..., n.

SinceXαdo not depend upon the variablex0, the 0-th equation in the adjoint equa- tions (ADJ) is ∂p0

∂tβ = 0. On the other hand, the relation ∂g

∂x0 =a0 impliesp0 =a0. ConsequentlyHβ(x, p, u) andp(·) = (pi(·)) satisfy (ADJ) and (M).

3.4 Multitime multiple control variations

To formulate and prove the multitime maximum principle for the next fixed endpoint problem, we need to introduce a multiple variation of the control.

Let us find how multiple changes in the control affect the response. We fix the multitimes sA = (s1A, ..., smA), A = 1, ..., N, with 0 < s1 < ... < sN, the control parametersvA(·)∈ U and strictly positive numbersλA,A= 1,2, ..., N. Select² >0 so small that the domains Ω0sA\0sA−λA²do not overlap. Define the modified control

u²(t) =

½ vA(t) ift∈0sA\0,sA−λA², A= 1, ..., N u(t) otherwise,

which is called amultiplem-needle variationof the controlu(·). We denotex²(·) the corresponding response of the Cauchy problem

(26) ∂xi²

∂tα(t) =Xαi(x²(t), u²(t)), x²(0) =x0, t∈Rm+.

Let us try to understand how the choices ofsA and vA(·) cause x²(·) to differ from x(·), for small² >0. Firstly, we setyα(t) =Yα(t, s)yαs, t∈s∞, for the solution of the Cauchy variational problem (linear PDE system)

∂yiα

∂tβ(t) =yαj(t)∂Xβi

∂xj (x(t), u(t)), yα(s) =yαs, t∈s∞, where the pointsyαs∈Rn are given andYα(t, s) is thetransition matrix.

We define

yαsA=Xα(x(sA), vA(s))−Xα(x(sA), u(sA)), A= 1,2, ..., N.

and we replace the Lemma of dynamics and simple control variations with

(15)

Lemma 3.5. (dynamics and multiple control variations)Ifu²(.)is a multiple variation of the controlu(·), then

x²(t) =x(t) +²αyα(t) +o(²)as²→0 uniformly fortin compact subsets ofRm+, where



yα(t) = 0 t∈0s1

yα(t) =PP

A=1λAYα(t, sA)yαsA t∈0sA+1\0sA, P = 1,2, ..., N1 yα(t) =PN

A=1λAYα(t, sA)yαsA t∈sN.

Definition 3.1. (cones of variations) Let 0 < s1 s2 ... sN < t and yαsA ∈Rn, A= 1, ..., N. For eachα, the set

Kα(t) ={ XN

A=1

λAYα(t, sA)yαsA|N = 1,2, ...;λA>0}.

is called thecone of variations at multitimet.

We remark that each Kα(t) is a convex cone inRn, consisting in all changes in the statex(t) (up to order ²) we can make by multiple variations of the controlu(·) (see the previous Lemma). To study the geometry ofKα(t), we need the following topological Lemma:

Lemma 3.6. (zeroes of a vector field)LetSbe a closed, bounded, convex subset of Rn andp∈IntS. IfY :S→Rn is a continuous vector field satisfying||Y(x)−x||<

||x−p||, ∀x∈∂S, then there exists a pointx∈S such that Y(x) =p.

Proof. For the general case, we assume after a translation that p= 0, and 0IntS.

We mapSontoB(0,1) by a radial dilation, and mapY by rigid motion. This process convert the general case to the next case.

Suppose thatS is the unit ballB(0,1) andp= 0. The inequality in hypothesis is equivalent to (Y(x), x)>0, ∀x∈∂B(0,1). Consequently, for smallt, the continuous mappingZ(x) =x−tY(x) mapsB(0,1) into itself. According Brouwer Fixed Point Theorem, the mappingZ has a fixed point, let sayZ(x) =x, and henceY(x) = 0.

¤

3.5 Fixed endpoint problem

The fixed endpoint problem is characterized by the constraint x(τ) = x1, where τ =τ(u(·)) is the first multitime thatx(·) hits the target point x1. In this context, the cost functional is

P(u(·)) = Z

γ

Xβ0(x(t), u(t))dtβ.

Adding a new variable. We define againx0: Ω0t0 →Ras the solution of initial problem

∂x0

∂tα(t) =Xα0(x(t), u(t)), x0(0) = 0, t,

参照

関連したドキュメント

Here is the “surprise”: the validity of assumption (2.14) on Claim 2.3 for some hyperbolic/Petrowski-type systems is verified (see Section 4) by precisely the same hard analysis

Key words: stochastic differential equations, Brownian motion, Law of the Iterated Logarithm, Motoo’s theorem, stochastic comparison principle, stationary processes,

In this paper we study one dimensional backward stochastic differential equations (BSDEs) with random terminal time not necessarily bounded or finite when the generator F (t, Y, Z)

Key words: Barabási-Albert model, sublinear preferential attachment, dynamic random graphs, maximal degree, degree distribution, large deviation principle, moderate

In this section we will discuss some relations between the validity of (Weak) Corestriction Principles for connecting maps of various types. For simplicity we consider only

Key words: anisotropic variable exponent Sobolev spaces, weak solution, critical point, Mountain Pass Theorem, variational methods..

Solving PDEs via RBF-PS Methods M ATLAB

Key words: Perturbed Empirical Distribution Functions, Strong Mixing, Almost Sure Representation, U-statistic, Law of the Iterated Logarithm, Invariance Principle... AMS