• 検索結果がありません。

1.Introduction HuilingWu Time-ConsistentStrategiesforaMultiperiodMean-VariancePortfolioSelectionProblem ResearchArticle

N/A
N/A
Protected

Academic year: 2022

シェア "1.Introduction HuilingWu Time-ConsistentStrategiesforaMultiperiodMean-VariancePortfolioSelectionProblem ResearchArticle"

Copied!
14
0
0

読み込み中.... (全文を見る)

全文

(1)

Volume 2013, Article ID 841627,13pages http://dx.doi.org/10.1155/2013/841627

Research Article

Time-Consistent Strategies for a Multiperiod Mean-Variance Portfolio Selection Problem

Huiling Wu

China Institute for Actuarial Science, Central University of Finance and Economics, Beijing 100081, China

Correspondence should be addressed to Huiling Wu; [email protected] Received 1 January 2013; Accepted 28 February 2013

Academic Editor: Francis T. K. Au

Copyright © 2013 Huiling Wu. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

It remained prevalent in the past years to obtain the precommitment strategies for Markowitz’s mean-variance portfolio optimization problems, but not much is known about their time-consistent strategies. This paper takes a step to investigate the time-consistent Nash equilibrium strategies for a multiperiod mean-variance portfolio selection problem. Under the assumption that the risk aversion is, respectively, a constant and a function of current wealth level, we obtain the explicit expressions for the time-consistent Nash equilibrium strategy and the equilibrium value function. Many interesting properties of the time-consistent results are identified through numerical sensitivity analysis and by comparing them with the classical pre-commitment solutions.

1. Introduction

Since the pioneering work of [1] in a single period, mean- variance formulation has been one of focused topics of portfolio selection optimization and has stimulated hundreds of extensions and applications. The interested readers can refer to [2,3] for detailed information. The objective of quite a number of existing mean-variance portfolio selection models is seeking an optimal strategy 𝜋(⋅) which maximizes the mean-variance utility𝐸[𝑋𝜋𝑇] − 𝜔Var[𝑋𝜋𝑇], where𝑋𝜋𝑇 is the terminal wealth. But it is well known that this mean-variance criterion lacks of iterated-expectation property, which gives rise to time-inconsistent investment strategy in the sense that Bellman optimality principle is not available any more.

The so-called time-inconsistent strategy means that optimal strategy obtained at time 𝑛 does not agree with optimal strategy derived at time 𝑚 where 𝑚 > 𝑛. Therefore, the optimal strategy in the classical time-inconsistent models is just optimal from the viewpoint of the initial time, and decision makers at any time 𝑘 after the initial time must commit themselves to the initial optimal strategy even if it is not optimal at time𝑘. So, the time-inconsistent optimal strategy in the classical mean-variance model is called the precommitment strategy. But this precommitment has been

criticized for lacking rationality. For one simple example, investment psychology and tastes will often change over time, and the decision maker at later time may not commit themselves to following a strategy which is not optimal at their current time. The work in [4] analyzed the incentives which induce the investor to revise her optimal strategy at subsequent dates under mean-variance criterion.

For this reason, we want to find an optimal strategy with time consistency which is necessary for a rational individual. The analysis of inconsistency can be traced back to [5] which pointed out that “optimal plan of the present moment is generally one which will not be obeyed” and the time-inconsistent problem can be solved by precom- mitment strategy or alternatively time-consistent strategy.

The authors of [5, 6] devoted themselves to identifying an intertemporal consumption programme that would be “the best plan that an agent would actually follow.” The work in [7] questioned the generality of the existence of Strotz- Pollak equilibrium and gave an alternative criterion of Nash equilibrium. More recently, it is of interest to study time- inconsistent problems. The work of [8, 9] investigated a time-consistent strategy for a consumption and investment problem with nonexponential discounting. The work in [10] gave general approaches to handle time-inconsistent

(2)

problems by viewing them as a game theoretic framework and looking for Nash subgame perfect equilibrium points. They formally defined the continuous time equilibrium concept and derived the extension of HJB equation and its verification theorem for a very general objective functions. The work in [11] studied a continuous-time mean-variance portfolio optimization model on the assumption that the risk aversion factor depended dynamically on the current wealth. In view of the extension of HJB equation developed in [10], they obtained the time-consistent equilibrium control and equilibrium value function. The work in [12] provided explicit solutions to a series of cases, including mean-standard deviation in continuous-time setting. The work in [13] inves- tigated optimal mean-variance time-consistent investment and reinsurance policies for an insurer under continuous- time setting. The work in [14] developed a fully numerical scheme to determine time-consistent mean-variance strategy based on piecewise constant policy technique. As for the discrete-time mean-variance models, the work in [15] gave a complicated backwards recursive relationship about time- consistent investment strategy but had not found analytical expression for the strategy.

To the best of our knowledge, no existing literature has given time-consistent equilibrium strategy and equilib- rium value function in closed form for discrete-time mean- variance asset allocation. Our research will fill the gap. We view this decision-making process as a noncooperative game and suppose that there is one decision maker, referred to as

“decision maker𝑛”, for each point of time𝑛. This assumption is reasonable. On one hand, in the real world, there are often quite different persons who will join in the decision-making process especially when the investment horizon is long; on the other hand, we can image decision-maker𝑛as the future incarnation of themselves at time𝑛considering that the tastes of the decision maker will change over time. So our work in this paper is listed as follows: (a) derive the analytical expressions for the time-consistent equilibrium strategy and equilibrium value function when the risk aversion is assumed to be a constant and a function of current wealth, respectively;

(b) when risk aversion factor is a constant, compare our time-consistent results with the precommitment ones in [2]

and present the particular properties of the time-consistent results;(c)study the problem in discrete-time setting with nonconstant risk aversion which is a function of current wealth and identify the properties of the investment propor- tion by numerical analysis.

The rest of the paper is organized as follows. InSection 2, the problem formulation is presented, and the recursive formula of the equilibrium value function is derived. In Section 3, equilibrium strategy and equilibrium value func- tion for mean-variance model with constant risk aversion are obtained. Comparison of our time-consistent results with the precommitment ones in [2] is also given in this section.

InSection 4, the equilibrium results are investigated on the assumption that the risk aversion depends dynamically on the current wealth and numerical analysis is given to demonstrate the properties of investment proportion.Section 5presents our conclusions.

2. Problem Formulation

In this paper, we assume that investors join the market at time 0with an initial wealth𝑥0and plan to process the investment in𝑇consecutive time periods. There are one risk-free asset and𝑚1risky assets in the market. The𝑚1risky assets have random returns𝑅𝑛= (𝑅𝑛,1, 𝑅𝑛,2, . . . , 𝑅𝑛,𝑚1)󸀠at period𝑛(time interval [𝑛, 𝑛 + 1)) where 𝑅𝑛,𝑘 denotes the random return of the 𝑘th asset at period 𝑛and superscript “󸀠” stands for the transpose of a matrix or vector. Denote by𝑟𝑛𝑓the return of the risk-free asset at period 𝑛, and denote by 𝑋𝑛 and 𝜋𝑛 = (𝜋𝑛,1, . . . , 𝜋𝑛,𝑚1)󸀠the wealth available for investment and the amounts invested in𝑚1risky asset at time𝑛, respectively.

Then the wealth dynamics is

𝑋𝑛+1= (𝑋𝑛−∑𝑚1

𝑘=1

𝜋𝑛,𝑘) 𝑟𝑓𝑛 +∑𝑚1

𝑘=1

𝜋𝑛,𝑘𝑅𝑛,𝑘= 𝑋𝑛𝑟𝑛𝑓+ 𝜋󸀠𝑛𝑅𝑒𝑛, (1)

where𝑅𝑒𝑛,𝑘 = 𝑅𝑛,𝑘− 𝑟𝑛𝑓, 𝑅𝑛𝑒 = (𝑅𝑒𝑛,1, 𝑅𝑒𝑛,2, . . . , 𝑅𝑒𝑛,𝑚1)󸀠for𝑛 = 0, 1, . . . , 𝑇 − 1.

As we know, the classic mean-variance optimization problem is as follows:

{𝜋0,𝜋max1,...,𝜋𝑇−1}𝐸 (𝑋𝜋𝑇) − 𝜔Var(𝑋𝜋𝑇) , (2)

which results in a time-inconsistent strategy, that is, precom- mitment strategy. Therefore, as mentioned inSection 1, this paper aims to solve this problem from another perspective and to look for the time-consistent Nash equilibrium invest- ment strategy. To this end, we first give the definition of Nash equilibrium strategy according to [10] and the references therein.

Let𝜋(𝑛) = {𝜋𝑛, 𝜋𝑛+1, . . . , 𝜋𝑇−1}be the policy made at time 𝑛and

𝐽𝑛(F𝑛, 𝜋 (𝑛)) = 𝐸 (𝑋𝜋(𝑛)𝑇 |F𝑛) − 𝜔 (F𝑛)Var(𝑋𝜋(𝑛)𝑇 |F𝑛) , (3)

where 𝑋𝜋(𝑛)𝑇 is the terminal wealth corresponding to the investment strategy𝜋, andF𝑛 is the information at time𝑛, such as wealth level. A natural assumption is that risk aversion 𝜔is a function ofF𝑛.

Definition 1. Let ̂𝜋 be a fixed control law. For an arbitrary point𝑛 (𝑛 = 0, 1, . . . , 𝑇 − 1), one selects an arbitrary control value𝜋𝑛and define the strategy𝜋(𝑛) = (𝜋𝑛, ̂𝜋𝑛+1, . . . , ̂𝜋𝑇−1).

Then̂𝜋is said to be a subgame perfect Nash equilibrium strategy (or simply equilibrium strategy) if for all𝑛 < 𝑇, it satisfies

max𝜋𝑛 𝐽𝑛(F𝑛; 𝜋 (𝑛)) = 𝐽𝑛(F𝑛; ̂𝜋 (𝑛)) . (4)

(3)

In addition, if equilibrium strategŷ𝜋exists, the equilibrium value function is defined as

𝑉𝑛(F𝑛) = 𝐽𝑛(F𝑛; ̂𝜋 (𝑛)) . (5)

Let̂𝜋(𝑛)be the Nash equilibrium strategy at time𝑛, then Definition 1 makes it possible to solve the problem by the following procedure:

(a) ̂𝜋(𝑇 − 1) = ̂𝜋𝑇−1 = arg max𝜋𝑇−1[𝐸(𝑋𝜋𝑇𝑇−1 | F𝑇−1) − 𝜔(F𝑇−1)Var(𝑋𝜋𝑇𝑇−1 |F𝑇−1)];

(b) given that the decision maker𝑇 − 1 will use ̂𝜋𝑇−1,

̂𝜋𝑇−2is the optimal control that optimizes objective function𝐽𝑇−2(F𝑇−2; (𝜋𝑇−2, ̂𝜋𝑇−1));

(c) generally,̂𝜋𝑛 is obtained by letting decision maker𝑛 choose𝜋𝑛to maximize𝐽𝑛given that the forthcoming decision makers𝑛+1, . . . , 𝑇−1will choose the strategy

̂𝜋(𝑛 + 1) = (̂𝜋𝑛+1, . . . , ̂𝜋𝑇−1); that is,

̂𝜋𝑛=arg max

𝜋𝑛 𝐽𝑛(F𝑛; (𝜋𝑛, ̂𝜋𝑛+1, . . . , ̂𝜋𝑇−1)) . (6) Now we try to derive the time-consistent Nash equilib- rium strategy and value function, but first we need to give the following notations and assumptions throughout this paper:

(N1)𝑟𝑛𝑒 = 𝐸(𝑅𝑒𝑛), 𝑛 = 0, 1, . . . , 𝑇 − 1;

(N2)𝜎𝑛 = 𝐸(𝑅𝑒𝑛𝑅𝑒𝑛󸀠) − 𝑟𝑒𝑛𝑟𝑛𝑒󸀠, 𝑛 = 0, 1, . . . , 𝑇 − 1;

(N3)𝜅𝑛= 𝑟𝑒𝑛󸀠(𝜎𝑛)−1𝑟𝑛𝑒, 𝑛 = 0, 1, . . . , 𝑇 − 1;

(N4)𝑔𝜋(𝑛)𝑛 (F𝑛) = 𝐸[𝑋𝜋(𝑛)𝑇 | F𝑛], 𝑔𝑛(F𝑛) = 𝐸[𝑋𝑇̂𝜋(𝑛) | F𝑛], 𝑛 = 0, 1, . . . , 𝑇 − 1;

(N5)ℎ𝜋(𝑛)𝑛 (F𝑛) = 𝐸[(𝑋𝜋(𝑛)𝑇 )2 | F𝑛], ℎ𝑛(F𝑛) = 𝐸[(𝑋̂𝜋(𝑛)𝑇 )2 | F𝑛], 𝑛 = 0, 1, . . . , 𝑇 − 1.

(A1) The distribution function of the random returns𝑅𝑛 is𝐹𝑛, and {𝑅𝑛, 𝑛 = 0, 1, . . . , 𝑇 − 1}is assumed to be statistically independent.

(A2)𝜎𝑛is assumed to be positive definite.

(A3) Short selling is allowed for all risky assets in all peri- ods. Unlimited borrowing and lending are permitted.

Transaction costs are not taken into account.

(A4) Capital additions or withdrawals are forbidden for all assets in all periods.

With the notations above, we can obtain the recursions of 𝐽𝑛and𝑉𝑛. For the sake of convenience, we define𝐸𝑛,F𝑛(⋅) = 𝐸[⋅ |F𝑛]and Var𝑛,F𝑛(⋅) =Var(⋅ |F𝑛), then

𝐽𝑛(F𝑛; 𝜋 (𝑛))

= 𝐸𝑛,F𝑛[𝐽𝑛+1(F𝑛+1; 𝜋 (𝑛 + 1))]

− {𝐸𝑛,F𝑛[𝐸𝑛+1,F𝑛+1(𝑋𝜋(𝑛+1)𝑇 )

−𝜔 (F𝑛+1)Var𝑛+1,F𝑛+1(𝑋𝜋(𝑛+1)𝑇 )]

−𝐸𝑛,F𝑛(𝑋𝜋(𝑛)𝑇 ) + 𝜔 (F𝑛)Var𝑛,F𝑛(𝑋𝜋(𝑛)𝑇 )}

= 𝐸𝑛,F𝑛[𝐽𝑛+1(F𝑛+1; 𝜋 (𝑛 + 1))]

− {𝐸𝑛,F𝑛[𝐸𝑛+1,F𝑛+1(𝑋𝜋(𝑛+1)𝑇 )] − 𝐸𝑛,F𝑛(𝑋𝜋(𝑛)𝑇 )}

+ 𝐸𝑛,F𝑛[ 𝜔 (F𝑛+1)Var𝑛+1,F𝑛+1(𝑋𝜋(𝑛+1)𝑇 )]

− 𝜔 (F𝑛)Var𝑛,F𝑛(𝑋𝑇𝜋(𝑛))

= 𝐸𝑛,F𝑛[𝐽𝑛+1(F𝑛+1; 𝜋 (𝑛 + 1))]

+ 𝐸𝑛,F𝑛[ 𝜔 (F𝑛+1)Var𝑛+1,F𝑛+1(𝑋𝜋(𝑛+1)𝑇 )]

− 𝜔 (F𝑛)Var𝑛,F𝑛(𝑋𝑇𝜋(𝑛))

= 𝐸𝑛,F𝑛[𝐽𝑛+1(F𝑛+1; 𝜋 (𝑛 + 1))]

+ 𝐸𝑛,F𝑛[𝜔 (F𝑛+1) 𝐸𝑛+1,F𝑛+1(𝑋𝜋(𝑛+1)𝑇 )2

−𝜔 (F𝑛+1) [ 𝐸𝑛+1,F𝑛+1(𝑋𝜋(𝑛+1)𝑇 )]2]

− 𝜔 (F𝑛) [𝐸𝑛,F𝑛(𝑋𝜋(𝑛)𝑇 )2− [ 𝐸𝑛,F𝑛(𝑋𝜋(𝑛)𝑇 )]2]

= 𝐸𝑛,F𝑛[𝐽𝑛+1(F𝑛+1; 𝜋 (𝑛 + 1))]

+ 𝐸𝑛,F𝑛[𝜔 (F𝑛+1) 𝐸𝑛+1,F𝑛+1((𝑋𝜋(𝑛+1)𝑇 )2)]

− 𝜔 (F𝑛) 𝐸𝑛,F𝑛((𝑋𝜋(𝑛)𝑇 )2)

− 𝐸𝑛,F𝑛[𝜔 (F𝑛+1) [ 𝐸𝑛+1,F𝑛+1(𝑋𝜋(𝑛+1)𝑇 )]2] + 𝜔 (F𝑛) {𝐸𝑛,F𝑛[𝐸𝑛+1,F𝑛+1(𝑋𝜋(𝑛+1)𝑇 )]}2

= 𝐸𝑛,F𝑛[𝐽𝑛+1(F𝑛+1; 𝜋 (𝑛 + 1))]

+ 𝐸𝑛,F𝑛[ 𝜔 (F𝑛+1) ℎ𝜋(𝑛+1)𝑛+1 (F𝑛+1)]

− 𝜔 (F𝑛) 𝐸𝑛,F𝑛(ℎ𝜋(𝑛+1)𝑛+1 (F𝑛+1))

− 𝐸𝑛,F𝑛[𝜔 (F𝑛+1) [ 𝑔𝜋(𝑛+1)𝑛+1 (F𝑛+1)]2] + 𝜔 (F𝑛) [ 𝐸𝑛,F𝑛(𝑔𝑛+1𝜋(𝑛+1)(F𝑛+1))]2.

(7)

(4)

ByDefinition 1, we have 𝑉𝑛+1(F𝑛+1) = 𝐽𝑛+1(F𝑛+1; ̂𝜋𝑛+1), and hence (7) implies the following recursion for equilibrium value function𝑉𝑛(F𝑛):

𝑉𝑛(F𝑛) =max

𝜋𝑛 {𝐸𝑛,F𝑛[𝑉𝑛+1(F𝑛+1)]

+ 𝐸𝑛,F𝑛[𝜔 (F𝑛+1) ℎ𝑛+1(F𝑛+1)]

− 𝜔 (F𝑛) 𝐸𝑛,F𝑛(ℎ𝑛+1(F𝑛+1))

− 𝐸𝑛,F𝑛[ 𝜔 (F𝑛+1) [𝑔𝑛+1(F𝑛+1)]2] +𝜔 (F𝑛) [ 𝐸𝑛,F𝑛(𝑔𝑛+1(F𝑛+1))]2} , 𝑛 = 0, 1, . . . , 𝑇 − 1,

(8)

𝑉𝑇(F𝑛) = 𝑥, (9)

where the recursions of𝑔𝑛(F𝑛)andℎ𝑛(F𝑛)are as follows:

𝑛(F𝑛) = 𝐸𝑛,F𝑛[(𝑋𝑇̂𝜋(𝑛))2]

= 𝐸𝑛,F𝑛[𝐸𝑛+1,F𝑛+1((𝑋𝑇̂𝜋(𝑛))2)]

= 𝐸𝑛,F𝑛(ℎ𝑛+1(F𝑛+1)) , 𝑛 = 0, 1, . . . , 𝑇 − 1, ℎ𝑇(F𝑇) = 𝑥2,

𝑔𝑛(F𝑛) = 𝐸𝑛,F𝑛(𝑋̂𝜋(𝑛)𝑇 )

= 𝐸𝑛,F𝑛[𝐸𝑛+1,F𝑛+1(𝑋̂𝜋(𝑛)𝑇 )] = 𝐸𝑛,F𝑛(𝑔𝑛+1(F𝑛+1)) , 𝑛 = 0, 1, . . . , 𝑇 − 1, 𝑔𝑇(F𝑇) = 𝑥.

(10)

3. Nash Equilibrium Strategy with Constant Risk Aversion

3.1. Equilibrium Strategy and Equilibrium Value Function.

When the risk aversion is a constant,𝐽𝑛is of the form 𝐽𝑛(𝑥; 𝜋 (𝑛)) = 𝐸 (𝑋𝑇𝜋(𝑛)| 𝑋𝑛= 𝑥) − 𝜔Var(𝑋𝜋(𝑛)𝑇 | 𝑋𝑛= 𝑥)

𝑉𝑛(F𝑛) = 𝑉𝑛(𝑋𝑛 = 𝑥) = 𝐽𝑛(𝑥; ̂𝜋 (𝑛)) .

(11) According to (8), the recursion of equilibrium value function 𝑉𝑛(𝑥)is simplified as

𝑉𝑛(𝑥) =max

𝜋𝑛 {𝐸𝑛,𝑥[𝑉𝑛+1(𝑋𝜋𝑛+1𝑛 )]

−𝜔Var𝑛,𝑥[𝑔𝑛+1(𝑋𝜋𝑛+1𝑛 )]} , 𝑛 = 0, 1, . . . , 𝑇 − 1,

(12)

𝑉𝑇(𝑥) = 𝑥. (13)

Recursion (12) indicates that𝑉𝑛(𝑥)does not depend onℎ𝑛, and then we only need to find the explicit expression of𝑔𝑛by the following recursion:

𝑔𝑛(𝑥) = 𝐸𝑛,𝑥(𝑋𝑇̂𝜋(𝑛))

= 𝐸𝑛,𝑥[𝑔𝑛+1(𝑋𝑛+1̂𝜋𝑛 )] , 𝑛 = 0, 1, . . . , 𝑇 − 1, (14)

𝑔𝑇(𝑥) = 𝑥. (15)

The following theorem gives the explicit expressions of̂𝜋 and𝑉𝑛(𝑥)

Theorem 2. When the risk aversion is a constant, the Nash equilibrium strategy is given by

̂𝜋𝑛 = 1 2𝜔

1

𝑇−1𝑘=𝑛+1𝑟𝑘𝑓(𝜎𝑛)−1𝑟𝑛𝑒, 𝑛 = 0, 1, . . . , 𝑇 − 1. (16) The corresponding equilibrium value function is

𝑉𝑛(𝑥) = 𝑥𝑇−1

𝑘=𝑛

𝑟𝑘𝑓+ 1 4𝜔

𝑇−1

𝑘=𝑛

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑒𝑘, 𝑛 = 0, 1, . . . , 𝑇, (17)

𝑔𝑛(𝑥) = 𝑥𝑇−1

𝑘=𝑛

𝑟𝑘𝑓+ 1 2𝜔

𝑇−1

𝑘=𝑛

𝑟𝑒𝑘󸀠(𝜎𝑘)−1𝑟𝑒𝑘, 𝑛 = 0, 1, . . . , 𝑇. (18)

Proof. Obviously (17) and (18) hold true for𝑛 = 𝑇. Then for 𝑛 = 𝑇 − 1,

𝑉𝑇−1(𝑥) =max𝜋

𝑇−1 {𝐸𝑇−1,𝑥[𝑉𝑇(𝑋𝜋𝑇𝑇−1)]

−𝜔Var𝑇−1,𝑥[𝑔𝑇(𝑋𝜋𝑇𝑇−1)]}

=max𝜋

𝑇−1 [𝐸𝑇−1,𝑥(𝑋𝜋𝑇𝑇−1) − 𝜔Var𝑇−1,𝑥(𝑋𝜋𝑇𝑇−1)]

= 𝑥𝑟𝑓𝑇−1+sup

𝜋𝑇−1

[𝜋󸀠𝑇−1𝑟𝑒𝑇−1− 𝜔𝜋󸀠𝑇−1𝜎𝑇−1𝜋𝑇−1] . (19)

Since𝜎𝑇−1is positive definite, the optimal solution exists and is given by

̂𝜋𝑇−1= 1

2𝜔(𝜎𝑇−1)−1𝑟𝑒𝑇−1. (20) Substituting (20) into (19) yields

𝑉𝑇−1(𝑥) = 𝑥𝑟𝑓𝑇−1+ 1

4𝜔𝑟𝑇−1𝑒 󸀠𝜎𝑇−1−1 𝑟𝑇−1𝑒 , 𝑔𝑇−1(𝑥) = 𝐸𝑇−1,𝑥[𝑋𝑇̂𝜋𝑇−1] = 𝑥𝑟𝑇−1𝑓 + 1

2𝜔𝑟𝑇−1𝑒 󸀠𝜎−1𝑇−1𝑟𝑇−1𝑒 . (21)

(5)

Hence (16), (17), and (18) hold true for𝑛 = 𝑇 − 1. Now we assume that (17) and (18) are true for𝑛 + 1, then for𝑛,

𝑉𝑛(𝑥) =max

𝜋𝑛 {𝐸𝑛,𝑥[𝑉𝑛+1(𝑋𝜋𝑛+1𝑛 )]

−𝜔Var𝑛,𝑥[𝑔𝑛+1(𝑋𝜋𝑛+1𝑛 )]}

=max𝜋

𝑛 {𝐸𝑛,𝑥[(𝑥𝑟𝑓𝑛 + 𝑅𝑛𝑒󸀠𝜋𝑛) 𝑇−1

𝑘=𝑛+1

𝑟𝑘𝑓

+ 1 4𝜔

𝑇−1

𝑘=𝑛+1

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑒𝑘]

− 𝜔Var𝑛,𝑥[(𝑥𝑟𝑓𝑛 + 𝑅𝑒𝑛󸀠𝜋𝑛) 𝑇−1

𝑘=𝑛+1

𝑟𝑘𝑓

+ 1 2𝜔

𝑇−1

𝑘=𝑛+1

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒]}

=max

𝜋𝑛 {𝑥𝑇−1

𝑘=𝑛

𝑟𝑓𝑘 + 𝑟𝑛𝑒󸀠𝜋𝑛 𝑇−1

𝑘=𝑛+1

𝑟𝑘𝑓

+ 1 4𝜔

𝑇−1

𝑘=𝑛+1

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒

−𝜔𝑇−1

𝑘=𝑛+1

(𝑟𝑓𝑘)2𝜋󸀠𝑛𝜎𝑛𝜋𝑛} .

(22)

It is obvious that the optimal solution of (22) exits and is given by

̂𝜋𝑛 = 1 2𝜔

1

𝑇−1𝑘=𝑛+1𝑟𝑘𝑓(𝜎𝑛)−1𝑟𝑛𝑒. (23)

Substituting (23) into (22), we obtain

𝑉𝑛(𝑥) = 𝑥𝑇−1

𝑘=𝑛

𝑟𝑘𝑓+ 1 4𝜔

𝑇−1

𝑘=𝑛

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑒𝑘, (24)

and according to (14),

𝑔𝑛(𝑥) = 𝐸𝑛,𝑥[𝑔𝑛+1(𝑋𝑛+1̂𝜋𝑛 )]

= 𝐸𝑛,𝑥[𝑋𝑛+1̂𝜋𝑛 𝑇−1

𝑘=𝑛+1

𝑟𝑓𝑘

+ 1 2𝜔

𝑇−1

𝑘=𝑛+1

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑒𝑘]

= (𝑥𝑟𝑓𝑛 + ̂𝜋𝑛𝑟𝑛𝑒) 𝑇−1

𝑘=𝑛+1

𝑟𝑘𝑓

+ 1 2𝜔

𝑇−1

𝑘=𝑛+1

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒

= 𝑥𝑇−1

𝑘=𝑛

𝑟𝑓𝑘 + 1

2𝜔𝑟𝑛𝑒󸀠(𝜎𝑛)−1𝑟𝑛𝑒 + 1

2𝜔

𝑇−1

𝑘=𝑛+1

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒

= 𝑥𝑇−1

𝑘=𝑛

𝑟𝑓𝑘 + 1 2𝜔

𝑇−1

𝑘=𝑛

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒.

(25) Equations (23), (24), and (25) mean that (16), (17), and (18) hold true for 𝑛. By induction, the proof of Theorem 2 is complete.

3.2. Comparison to the Precommitment Results

3.2.1. About the Value Function. In view of (17), we know that the equilibrium value function at initial time0is

𝑉0(𝑥0) = 𝑥0𝑇−1

𝑘=0

𝑟𝑘𝑓+ 1 4𝜔

𝑇−1

𝑘=0

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒, (26)

and the precommitment value function of [2] is 𝐸0,𝑥0(𝑋𝑇) − 𝜔Var0,𝑥0(𝑋𝑇)

= 𝑥0𝑇−1

𝑘=0

𝑟𝑘𝑓

+ 1 4𝜔

1 − ∏𝑇−1𝑘=0(1 − 𝑟𝑒𝑘󸀠𝐸−1(𝑅𝑒𝑘𝑅𝑒𝑘󸀠) 𝑟𝑘𝑒)

𝑇−1𝑘=0(1 − 𝑟𝑘𝑒󸀠𝐸−1(𝑅𝑒𝑘𝑅𝑒𝑘󸀠) 𝑟𝑒𝑘) .

(27)

The relationship between (26) and (27) is summarized in the following lemma.

Lemma 3. Consider the following:

𝐸0,𝑥0(𝑋𝑇) − 𝜔Var0,𝑥0(𝑋𝑇) ≥ 𝑉0(𝑥0) . (28) Proof. First of all, we have

𝐸−1(𝑅𝑒𝑘𝑅𝑒𝑘󸀠) = (𝜎𝑘)−1−(𝜎𝑘)−1𝑟𝑘𝑒𝑟𝑘𝑒󸀠(𝜎𝑘)−1

1 + 𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑒𝑘 , (29)

(6)

then,

𝑟𝑒𝑘󸀠𝐸−1(𝑅𝑒𝑘𝑅𝑘𝑒󸀠) 𝑟𝑒𝑘

= 𝑟𝑒𝑘󸀠[(𝜎𝑘)−1−(𝜎𝑘)−1𝑟𝑘𝑒𝑟𝑒𝑘󸀠(𝜎𝑘)−1 1 + 𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒 ] 𝑟𝑒𝑘

= 𝑟𝑒𝑘󸀠(𝜎𝑘)−1𝑟𝑘𝑒[1 − 𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒 1 + 𝑟𝑒𝑘󸀠(𝜎𝑘)−1𝑟𝑒𝑘]

= 𝑟𝑒𝑘󸀠(𝜎𝑘)−1𝑟𝑒𝑘 1 + 𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒.

(30)

Now,

𝐸0,𝑥0(𝑋𝑇) − 𝜔Var0,𝑥0(𝑋𝑇) − 𝑉0(𝑥0)

= 1

4𝜔[1 − ∏𝑇−1𝑘=0(1 − 𝑟𝑒𝑘󸀠𝐸−1(𝑅𝑒𝑘𝑅𝑒𝑘󸀠) 𝑟𝑒𝑘)

𝑇−1𝑘=0(1 − 𝑟𝑘𝑒󸀠𝐸−1(𝑅𝑒𝑘𝑅𝑒𝑘󸀠) 𝑟𝑒𝑘)

𝑇−1

𝑘=0

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒]

= 1 4𝜔

1 − ∏𝑇−1𝑘=0(1/ (1 + 𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒))

𝑇−1𝑘=0(1/ (1 + 𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒))

− 1 4𝜔

𝑇−1

𝑘=0

𝑟𝑒𝑘󸀠(𝜎𝑘)−1𝑟𝑒𝑘

= 1 4𝜔

𝑇−1

𝑘=0

(1 + 𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒)

− 1 4𝜔

𝑇−1

𝑘=0

𝑟𝑒𝑘󸀠(𝜎𝑘)−1𝑟𝑒𝑘− 1 4𝜔

= 1

4𝜔[ (1 + 𝑟𝑒0󸀠(𝜎0)−1𝑟0𝑒) ⋅ ⋅ ⋅ (1 + 𝑟𝑒𝑇−1󸀠(𝜎𝑇−1)−1𝑟𝑇−1𝑒 )

𝑇−1

𝑘=0

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒− 1] ≥ 0.

(31)

This proof gives an important inequality needed in the later analysis as

1 − ∏𝑇−1𝑘=0(1 − 𝑟𝑘𝑒󸀠𝐸−1(𝑅𝑒𝑘𝑅𝑒𝑘󸀠) 𝑟𝑘𝑒)

𝑇−1𝑘=0(1 − 𝑟𝑘𝑒󸀠𝐸−1(𝑅𝑘𝑒𝑅𝑒𝑘󸀠) 𝑟𝑘𝑒) −𝑇−1

𝑘=0

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒

= (1 + 𝑟0𝑒󸀠(𝜎0)−1𝑟0𝑒) ⋅ ⋅ ⋅ (1 + 𝑟𝑒𝑇−1󸀠(𝜎𝑇−1)−1𝑟𝑇−1𝑒 )

𝑇−1

𝑘=0

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒− 1 ≥ 0.

(32)

Lemma 3shows that the precommitment value function is greater than the equilibrium value function. This is a fair game of God. The Nash equilibrium strategy gains the time consistency but at the same time destroys the welfare of the whole decision procedure because of the inability to precommit. Referring to the proof ofLemma 3, we also realize that the distance between these two value functions is amplified at longer time horizon. Specially, when𝑇 = 1, these two value functions coincide with each other. When 𝑇 = 1, the time-consistent results should be and are actually the same as the precommitment ones.

3.2.2. About the Investment Strategy. The time-consistent strategy at each period is

̂𝜋𝑛= 1 2𝜔

1

𝑇−1𝑘=𝑛+1𝑟𝑓𝑘(𝜎𝑛)−1𝑟𝑛𝑒, 𝑛 = 0, 1, . . . , 𝑇 − 1. (33)

Referring to [2], the precommitment strategy at each period is

𝜋𝑛 = − 𝑟𝑓𝑛𝐸−1(𝑅𝑒𝑛𝑅𝑒𝑛󸀠) 𝑟𝑛𝑒𝑥𝑛 + [𝑇−1

𝑘=0

𝑟𝑘𝑓𝑥0+ 1

2𝜔 (∏𝑇−1𝑘=0(1 − 𝑟𝑘𝑒𝐸−1(𝑅𝑒𝑘𝑅𝑒𝑘󸀠) 𝑟𝑘𝑒))]

× 𝑇−1

𝑘=𝑛+1

(1

𝑟𝑘𝑓) 𝐸−1(𝑅𝑒𝑛𝑅𝑒𝑛󸀠) 𝑟𝑛𝑒,

𝑛 = 0, 1, . . . , 𝑇 − 1.

(34)

The significant differences between̂𝜋and𝜋are as follows.

(a) Since the time-consistent strategy at time𝑛will not be affected by the initial information, then it has nothing to do with the initial wealth𝑥0in contrast with precommitment strategy.

(b) the time consistent is time deterministic but the precommitment one is stochastically dependent on the current wealth.

3.2.3. About the Efficient Frontier. In this Section, we want to compare our efficient frontier with the one in [2]. But first of all, we need the results inLemma 4.

(7)

Lemma 4. Under the time-consistent strategy(16), 𝐸 (𝑋𝑛̂𝜋) = 𝑥0𝑛−1

𝑘=0

𝑟𝑓𝑘 + 1 2𝜔

1

𝑇−1𝑘=𝑛𝑟𝑘𝑓

𝑛−1

𝑘=0

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑒𝑘, (35)

𝐸 [(𝑋̂𝜋𝑛)2]

= 𝑥0𝑛−1

𝑘=0

(𝑟𝑓𝑘)2+ 1 𝜔

𝑛−1𝑘=0𝑟𝑒𝑘󸀠(𝜎𝑘)−1𝑟𝑘𝑒

𝑇−1𝑘=𝑛(𝑟𝑘𝑓)2 𝑥0𝑇−1

𝑘=0

𝑟𝑘𝑓

+ 1 4𝜔2

1

𝑇−1𝑘=𝑛(𝑟𝑘𝑓)2

𝑛−1

𝑘=0

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒

× [1 + 𝑘−1

𝑚=0𝑟𝑒𝑚󸀠(𝜎𝑚)−1𝑟𝑒𝑚+ ∑𝑘

𝑚=0𝑟𝑚𝑒󸀠(𝜎𝑚)−1𝑟𝑚𝑒] . (36)

Proof. Substituting (16) into (1) yields 𝑋𝑛̂𝜋= 𝑋𝑛−1̂𝜋 𝑟𝑓𝑛−1+ 1

2𝜔 1

𝑇−1𝑘=𝑛𝑟𝑓𝑘

× 𝑅𝑒𝑛−1󸀠(𝜎𝑛−1)−1𝑟𝑒𝑛−1, 𝑛 = 1, 2, . . . , 𝑇, (37)

and for𝑛 = 1, 2, . . . , 𝑇, (𝑋𝑛̂𝜋)2= (𝑋𝑛−1̂𝜋 )2(𝑟𝑓𝑛−1)2

+ 1

𝜔𝑋𝑛−1̂𝜋 𝑟𝑛−1𝑓 1

𝑇−1𝑘=𝑛𝑟𝑘𝑓𝑅𝑒𝑛−1󸀠(𝜎𝑛−1)−1𝑟𝑛−1𝑒 + 1

4𝜔2 1

𝑇−1𝑘=𝑛(𝑟𝑘𝑓)2𝑟𝑛−1𝑒 󸀠(𝜎𝑛−1)−1

× 𝑅𝑒𝑛−1𝑅𝑒𝑛−1󸀠(𝜎𝑛−1)−1𝑟𝑒𝑛−1.

(38)

Hence,

𝐸 (𝑋𝑛̂𝜋) = 𝐸 (𝑋̂𝜋𝑛−1) 𝑟𝑛−1𝑓 + 1

2𝜔 1

𝑇−1𝑘=𝑛𝑟𝑓𝑘𝑟𝑛−1𝑒 󸀠(𝜎𝑛−1)−1𝑟𝑛−1𝑒 , 𝑛 = 1, 2, . . . , 𝑇,

(39)

and for𝑛 = 1, 2, . . . , 𝑇, 𝐸 [(𝑋̂𝜋𝑛)2]

= 𝐸(𝑋𝑛−1̂𝜋 )2(𝑟𝑓𝑛−1)2+ 1

𝜔𝐸 (𝑋𝑛−1̂𝜋 ) 𝑟𝑛−1𝑓

× 1

𝑇−1𝑘=𝑛𝑟𝑘𝑓𝑟𝑛−1𝑒 󸀠(𝜎𝑛−1)−1𝑟𝑛−1𝑒 + 1

4𝜔2 1

𝑇−1𝑘=𝑛(𝑟𝑘𝑓)2𝑟𝑛−1𝑒 󸀠(𝜎𝑛−1)−1𝑟𝑛−1𝑒

× [ 1 + 𝑟𝑛−1𝑒 󸀠(𝜎𝑛−1)−1𝑟𝑒𝑛−1] .

(40)

By repeatedly using recursive equation (39), we can obtain (35). Substituting (35) into (40) yields

𝐸 [(𝑋𝑛̂𝜋)2] = 𝐸 [(𝑋̂𝜋𝑛−1)2] (𝑟𝑛−1𝑓 )2 + 1

𝜔

𝑟𝑛−1𝑒 󸀠(𝜎𝑛−1)−1𝑟𝑛−1𝑒

𝑇−1𝑘=𝑛(𝑟𝑓𝑘)2 𝑥0𝑇−1

𝑘=0

𝑟𝑘𝑓

+ 1 4𝜔2

𝑟𝑛−1𝑒 󸀠(𝜎𝑛−1)−1𝑟𝑛−1𝑒

𝑇−1𝑘=𝑛(𝑟𝑓𝑘)2

× [1 +𝑛−2

𝑘=0

𝑟𝑒𝑘󸀠(𝜎𝑘)−1𝑟𝑒𝑘+𝑛−1

𝑘=0

𝑟𝑒𝑘󸀠(𝜎𝑘)−1𝑟𝑘𝑒] . (41)

Repeatedly using the above recursive equation yields (36).

Then according to (35) and (36), we can obtain the efficient frontier under the time-consistent strategy.

Theorem 5. The efficient frontier under the time-consistent strategy(16)is

Var(𝑋𝑇̂𝜋) = [ 𝐸 (𝑋̂𝜋𝑇) − 𝑥0𝑇−1𝑘=0𝑟𝑓𝑘]2

𝑇−1𝑘=0𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒 . (42)

Proof. When𝑛 = 𝑇, (35) becomes

𝐸 (𝑋𝑇̂𝜋) = 𝑥0𝑇−1

𝑘=0

𝑟𝑘𝑓+ 1 2𝜔

𝑇−1

𝑘=0

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑒𝑘. (43)

Equation (43), as we expect, coincides with the expression of 𝑔0(𝑥)in (18).

When𝑛 = 𝑇, (36) becomes

𝐸 [(𝑋𝑇̂𝜋)2]

= (𝑥0)2𝑇−1

𝑘=0

(𝑟𝑘𝑓)2+ 1 𝜔

𝑇−1

𝑘=0

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒𝑥0𝑇−1

𝑘=0

𝑟𝑘𝑓

+ 1 4𝜔2

𝑇−1

𝑘=0

𝑟𝑘𝑒󸀠(𝜎𝑘)−1𝑟𝑘𝑒[1 +𝑘−1

𝑚=0𝑟𝑚𝑒󸀠(𝜎𝑚)−1𝑟𝑚𝑒 +∑𝑘

𝑚=0

𝑟𝑚𝑒󸀠(𝜎𝑚)−1𝑟𝑚𝑒]

参照

関連したドキュメント

We study the existence and uniqueness of weak solutions for a Cauchy problem of a viscous Burgers equation with a time dependent reaction term involving Dirac measure.. After applying

Podlubny, Fractional Differential Equations: An Introduction to Fractional Derivatives, Fractional Di ff erential Equations, to Methods of Their Solution and Some of Their

A Black-Scholes type model for American options will be considered where the underlying asset price experiences Brownian motion with random jumps.. The mathematical problems is

A problem of the first passage of a cumulative random process with generally distributed discrete or continuous increments over a fixed level is con- sidered in the article as

The author of [8] derived precise energy decay estimates for the initial-boundary value problem to the wave equation with a localized nonlinear dissipation which depended on the time

Ex- ponential decay rates for the solutions of Euler-Bernoulli equations with boundary dissipation occurring in the moments only was investigated by Lasiecka [11], and the

Under some assumptions, we show that the solution of the above problem quenches in a finite time and estimate its quenching time.. Finally, we give some numerical results to

Finally, we give an example to show how the generalized zeta function can be applied to graphs to distinguish non-isomorphic graphs with the same Ihara-Selberg zeta