東北大学機関リポジトリTOUR

(1)

capacity per unit cost

著者

日合文雄

journal or

publication title

Journal of mathematical physics

volume

48 number

9 page range

092102

year

2007

URL

http://hdl.handle.net/10097/46856

doi: 10.1063/1.2779138

(2)

Limit relation for quantum entropy and channel capacity

per unit cost

Imre Csiszára兲

Alfréd Rényi Institute of Mathematics, P.O. Box 127, H-1364 Budapest, Hungary Fumio Hiaib兲

Graduate School of Information Sciences, Tohoku University, Aoba-ku, Sendai 980-8579, Japan

Dénes Petzc兲

Alfréd Rényi Institute of Mathematics, P.O. Box 127, H-1364 Budapest, Hungary

共Received 10 July 2007; accepted 13 August 2007; published online 13 September 2007兲 In a thermodynamic model, Diósi et al. 关Int. J. Quantum Inf. 4, 99–104 共2006兲兴 arrived at a conjecture stating that certain differences of von Neumann entropies converge to relative entropy as system size goes to infinity. The conjecture is proven in this paper for density matrices. The analytic proof uses the quantum law of large numbers and the inequality between the Belavkin-Staszewski and Umegaki relative entropies. Moreover, the concept of channel capacity per unit cost is intro-duced for classical-quantum channels. For channels with binary input alphabet, this capacity is shown to equal the relative entropy. The result provides a second proof of the conjecture and a new interpretation. Both approaches lead to generalizations of the conjecture. © 2007 American Institute of Physics.

关DOI:10.1063/1.2779138兴

I. INTRODUCTION

It was conjectured by Diósi et al.4that the von Neumann entropy of a quantum state equal to a mixture,

Rnª

1

n共␴丢␳

丢共n−1兲₊_␳_丢_␴_丢_␳丢共n−2兲₊ _{¯ +}_␳丢共n−1兲_丢_␴_兲,

exceeds the entropy of a component asymptotically by the Umegaki relative entropy S共␴储␳兲, that is,

S共Rn兲 − 共n − 1兲S共␳兲 − S共␴兲 → S共␴储␳兲, 共1兲

as n→⬁. Here␳ and␴are density matrices acting on a finite dimensional Hilbert space. Recall that S共␴兲=−Tr␴log␴ and

S共␴储␳兲 =

再

Tr␴共log␴− log␳兲 if supp␴艋 supp␳

+⬁ otherwise.

冎

Concerning the background of quantum entropy quantities, we refer to Refs.12and14. The set of bounded linear operators on a Hilbert spaceH is denoted by B共H兲. When H is d dimensional, d finite, B共H兲 is identified as usual with the set Md共C兲 of d⫻d matrices with complex entries.

a兲_{Electronic mail: [email protected]} b兲_{Electronic mail: [email protected]} c兲_{Electronic mail: [email protected]}

JOURNAL OF MATHEMATICAL PHYSICS 48, 092102共2007兲

48, 092102-1

(3)

In Ref.4, a composite system consisting of n molecules has been considered, originally each in a quantum state␳, and interaction with environment changed the state of one molecule to␴. Irreversibility has been introduced via a completely positive mapM acting as

M共␴丢␳丢共n−1兲兲 =1

n共␴丢␳

丢共n−1兲₊_␳_丢_␴_丢_␳丢共n−2兲₊ _{¯ +}_␳丢共n−1兲_丢_␴_兲, _共2兲

interpreted as total randomization over the n subsystems共molecules兲. A thermodynamical argu-ment showed that the thermodynamical entropy of the system increased by S共␴储␳兲. This moti-vated the conjecture that the increase of the “informatic entropy,” given by the left-hand side of Eq.共1兲, also equals S共␴储␳兲, at least in the limit n→⬁.

The quantum formulation includes the case where both␳ and␴ are diagonal matrices. This will be referred to as the classical case. If␳ and␴commute, then in an appropriate basis both of them will be diagonal. Apparently, no exact proof of Eq. 共1兲 has been published even for the classical case, although for that case a heuristic proof was offered in Ref.4.

In this paper, first an analytic proof of Eq.共1兲 is given, see Theorem 1, using an inequality between the Umegaki and the Belavkin-Staszewski relative entropies and the law of large numbers in the quantum case. The idea is based on the identity

Rn=共␳1/2兲丢n

冉

1

n共X丢I

丢共n−1兲_{+ I}_丢_X_丢_I丢共n−2兲₊ _{¯ + I}丢共n−1兲_丢_X_兲

冊

_共_␳1/2_兲丢n

,

where X =␳−1/2_␴␳−1/2_{. The limit of the term in the middle can be computed by the}_{共quantum兲 law}

of large numbers. For readers not familiar with the required tools, the arguments are simplified to the classical case, where the ordinary law of large numbers is used, see Theorem 2.

In the second part of this paper, we recognize that S共Rn兲−共n−1兲S共␳兲−S共␴兲 is a particular

Holevo quantity or a quantum mutual information. The Holevo capacity of classical-quantum channels is well understood.5,8,10 Channel capacity per unit cost has been studied in classical information theory, see primarily Ref. 15, but not in quantum information theory. An indirect approach to capacity per unit cost is possible via the concept of capacity with constrained inputs, which is available for classical-quantum channels.7We take a direct approach which—as in the classical case15—appears preferable.

We will consider 共memoryless兲 classical-quantum channels with binary input alphabet X =兵0,1其 which assigns to 共classical兲 input sequences 共x1, x2, . . . , xn兲苸Xn output quantum states ␳x₁丢␳x₂丢¯丢␳x_n, where ␳0=␴ and␳1=␳, assuming that the cost of an input sequence is the

number of characters 0 in it. Considerations similar to Ref.15simultaneously provide a proof of Eq. 共1兲, and the result that the capacity per unit cost of the above channel equals S共␴储␳兲, see Theorems 3 and 4.

We note that our analytic proof of Eq.共1兲requires the assumption that supp␴艋supp␳, while the proof given in Sec. III does not共neither does the simplified version of the analytic proof to the classical case兲. It is remarkable that the two proofs lead to different generalizations of Eq.共1兲. The second proof is based on a purely information theoretic interpretation, nevertheless, the result of Theorem 3 admits also a thermodynamical interpretation as in Ref.4, see the discussion after the proof of Theorem 3.

II. AN ANALYTIC PROOF OF THE CONJECTURE

Assuming that supp␴艋supp␳ for the support projections of␴ and␳, one can simply com-pute

S共Rn储␳丢n兲 = Tr共Rnlog Rn− Rnlog␳丢n兲 = − S共Rn兲 − 共n − 1兲Tr␳log␳− Tr␴log␳.

(4)

S共Rn储␳丢n兲 = − S共Rn兲 + 共n − 1兲S共␳兲 + S共␴储␳兲 + S共␴兲,

holds. It follows that conjecture共1兲is equivalent to the statement

S共Rn储␳丢n兲 → 0 as n → ⬁,

when supp␴艋supp␳.

Recall the Belavkin-Staszewski relative entropy

SBS共␻储␳兲 = Tr共␻log共␻1/2␳−1␻1/2兲兲 = − Tr共␳␩共␳−1/2␻␳−1/2兲兲,

if supp␻艋supp␳, where ␩共t兲ª−t log t, see Refs. 1 and 12. 关The equality of the above two expressions is easily seen from the fact that Xf共X*_X_{兲= f共XX}*_{兲X for a matrix X and for a polynomial}

f.兴 It was proved by Hiai and Petz that

S共␻储␳兲艋 SBS共␻储␳兲, 共3兲

see Ref.6 or Proposition 7.11 in Ref.12.

Theorem 1: If supp␴艋supp␳, then S共Rn兲−共n−1兲S共␳兲−S共␴兲→S共␴储␳兲 as n→⬁.

Proof: We want to use the quantum law of large numbers, see Proposition 1.17 in Ref. 12. Assume that␳ and␴are d⫻d density matrices and we may suppose that␳ is invertible. Due to the GNS construction with respect to the limit␸_⬁of the product states ␸n on the n-fold tensor

product Md共C兲丢n, n苸N, defined by␸n共A兲=Tr␳丢nA, all finite tensor products Md共C兲丢n are

em-bedded into a von Neumann algebraM acting on a Hilbert space H. If␥ denotes the right shift and Xª␳−1/2_␴␳−1/2_{, then R} nis written as Rn=共␳1/2兲丢n

冉

1 n

兺

_i=0 n−1 ␥i_共X兲

冊

_共_␳1/2_兲丢n_. By inequality 共3兲, we get 0艋 S共Rn储␳丢n兲艋 SBS共Rn储␳丢n兲 = − Tr共␳丢n␩共共␳−1/2兲丢nRn共␳−1/2兲丢n兲兲 =

冓

⍀,␩

冉

1 n

兺

_i=0 n−1 ␥i_共X兲

冊

_⍀

冔

, 共4兲 where⍀ is the cyclic vector in the GNS construction.

The law of large numbers, see Proposition 1.17 in Eq. 共12兲, gives 1

n

兺

i=0 n−1

␥i_{共X兲 → I,}

in the strong operator topology in B共H兲, since␸共X兲=Tr␳␳−1/2_␴␳−1/2_{= 1.}

Since the continuous functional calculus preserves the strong convergence共simply due to the approximation by polynomials on a compact set兲, we obtain

␩

冉

1 n

兺

_i=0

n−1

␥i_共X兲

冊

_→_␩_{共I兲 = 0 strongly.}

This shows that upper bound共4兲converges to 0 and the proof is complete. 䊐

By the same proof, one can obtain that for

R_ᐉ,nª 1

n −ᐉ + 1共␴

丢ᐉ_丢_␳丢共n−ᐉ兲₊_␳_丢_␴丢ᐉ_丢_␳丢共n−ᐉ−1兲₊ _{¯ +}_␳丢共n−ᐉ兲_丢_␴丢ᐉ_兲,

the limit relation

(5)

S共R_ᐉ,n兲 − 共n − ᐉ兲S共␳兲 − ᐉS共␴兲 → ᐉS共␴储␳兲共5兲

holds as n→⬁ when ᐉ is fixed.

In the next theorem, we treat the classical case in a matrix language.The proof includes the case where supp␴艋supp␳is not true. Those readers who are not familiar with the tools used in the proof of the previous theorem are suggested to follow the arguments below.

Theorem 2: Assume that ␳and ␴are commuting density matrices. Then S共Rn兲 − 共n − 1兲S共␳兲 − S共␴兲 → S共␴储␳兲

as n→⬁.

Proof: We may assume that ␳= Diag共␮1, . . . ,␮ᐉ, 0 , . . . , 0兲 and␴= Diag共␭1, . . . ,␭d兲 are d⫻d

diagonal matrices,␮1, . . . ,␮ᐉ⬎0 and ᐉ⬍d. 共We may consider ␳,␴in a matrix algebra of bigger

size if ␳ is invertible.兲 If supp␴艋supp␳, then ␭_ᐉ+1=¯ =␭d= 0; this will be called the regular

case. When supp␴艋supp␳ is not true, we may assume that␭d⬎0 and we refer to the singular

case.

The eigenvalues of Rn correspond to elements共i1, . . . , in兲 of 兵1, ... ,d其n,

1

n共␭i1␮i2¯␮in+␮i1␭i2␮i3¯␮in+ ¯ +␮i1¯␮in−1␭in兲. 共6兲

We divide the eigenvalues in three different groups as follows: 共a兲 A corresponds to 共i1, . . . , in兲苸兵1, ... ,d其nwith 1艋i1, . . . , in艋ᐉ;

共b兲 B corresponds to 共i1, . . . , in兲苸兵1, ... ,d其nwhich contains exactly one d;

共c兲 C is the rest of the eigenvalues. If eigenvalue共6兲 is in group A, then it is

共␭i₁/␮i₁兲 + ¯ + 共␭i_n/␮i_n兲 n ␮i1␮i2¯␮in. First, we compute

兺

␬苸A␩共␬兲 =i₁

兺

,. . .,in ␩

冉

共␭i1/␮i1兲 + ¯ + 共␭in/␮in兲 n ␮i1¯␮in

冊

.

Below, the summations are over 1艋i1, . . . , in艋ᐉ,

兺

i₁,. . .,in ␩

冉

共␭i1/␮i1兲 + ¯ + 共␭in/␮in兲 n ␮i1¯␮in

冊

= −

兺

i₁,. . .,in

冉

共␭i₁/␮i₁兲 + ¯ + 共␭i_n/␮i_n兲 n ␮i1¯␮in

冊

log共␮i1¯␮in兲 + Qn = −1 n

兺

_k=1 n

冉

兺

i₁,. . .,in ␭i₁␮i₂¯␮i_nlog␮i_k+

兺

i₁,. . .,in ␭i₂␮i₁¯␮i_nlog␮i_k + ¯ +

兺

i₁,. . .,in ␭i_n␮i₁¯␮i_n−1log␮i_k

冊

+ Qn = −1 n

兺

k=1 n

冉

共n − 1兲

兺

i_k ␮i_klog␮i_k+

兺

i_k ␭i_klog␮i_k

冊

+ Qn=共n − 1兲S共␳兲 −

兺

i=1 ᐉ ␭ilog␮i+ Qn, where

(6)

Qnª

兺

i₁,. . .,in

共␮i₁¯␮i_n兲␩

冉

共␭i₁/␮i₁兲 + ¯ + 共␭i_n/␮i_n兲

n

冊

.

Note that Qnis equal to the expected value of ␩

冉

X1+ ¯ + Xn

n

冊

for independent and identically distributed random variables X₁, X₂, . . . defined on the product probability space共兵1,2, ... ,ᐉ其,共␮₁,␮₂, . . . ,␮_ᐉ兲兲N_{, where X}

ntakes the value␭i/␮ion a sequence in

兵1,2, ... ,ᐉ其N _{whose nth component is equal to i.}

By the strong law of large numbers,

X1+ ¯ + Xn n → E共X1兲 =

兺

i=1 ᐉ

冉

␭i ␮i

冊

␮i=

兺

i=1 ᐉ ␭i almost surely.

Since ␩共共X₁+¯ +Xn兲/n兲 is uniformly bounded, the Lebesgue bounded convergence theorem

implies that Qn→␩

冉

兺

i=1 ᐉ ␭i

冊

, as n→⬁.

In the regular case, all nonzero eigenvalues are in group A; hence,

S共Rn兲 − 共n − 1兲S共␳兲 − S共␴兲 = −

兺

i=1 ᐉ ␭ilog␮i+

兺

i=1 ᐉ ␭ilog␭i+ Qn= S共␴储␳兲 + Qn.

As兺_i=1ᐉ ␭i= 1 implies Qn→0, the statement follows.

Next, we consider the singular case, when the contributions of the eigenvalues in A is

兺

␬苸A␩共␬兲 = 共n − 1兲S共␳兲 + O共1兲,

and we turn to eigenvalues in B. If an eigenvalue corresponding to共i1, . . . , in兲苸兵1, ... ,d其n is in

group B and i1= d, then the eigenvalue is

1

n␭d␮i2¯␮in.

Summation of such eigenvalues gives

−

兺

i₂,. . .,in

冉

␭d␮i₂¯␮i_n n

冊

log

冉

␭d␮i₂¯␮i_n n

冊

= − ␭d n i₂,. . .,i

兺

n 共␮i₂¯␮i_n兲log共␮i₂¯␮i_n兲 − ␭d n log ␭d n =␭d n共n − 1兲S共␳兲 − ␭d n log ␭d n .

When ij= d for some 2艋 j艋n, we get the same quantity, so this should be multiplied with n,

兺

␬苸B␩共␬兲 = ␭d共n − 1兲S共␳兲 − ␭dlog

␭d n .

It follows that

(7)

S共Rn兲 − 共n − 1兲S共␳兲 − S共␴兲艌

兺

␬苸A␩共␬兲 +␬苸B

兺

␩共␬兲 − 共n − 1兲S共␳兲 − S共␴兲艌 ␭d共n − 1兲S共␳兲 + ␭dlog n

+ O共1兲 → + ⬁,

as n→⬁. 䊐

III. CHANNEL CAPACITY PER UNIT COST

A classical-quantum channel with classical input alphabetX transfers the input x苸X into the output W共x兲⬅␳xwhich is a density matrix acting on a Hilbert spaceK. We restrict ourselves to the

case whenX is finite and K is finite dimensional.

If a classical random variable X is chosen to be the input, with probability distribution P =兵p共x兲:x苸X其, then the corresponding output is the quantum state ␳Xª兺x苸Xp共x兲␳x. When a

measurement is performed on the output quantum system, it gives rise to an output random variable Y which is jointly distributed with the input X. If a partition of unity兵Fy: y苸Y其 in B共K兲

describes the measurement, then

Prob共Y = y兩X = x兲 = Tr␳xFy 共x 苸 X,y 苸 Y兲. 共7兲

The Holevo bound says that the classical mutual information

I共X ∧ Y兲 ª H共Y兲 − H共Y兩X兲 = H共X兲 − H共X兩Y兲

共expressed by the classical Shannon entropy H兲 satisfies9,10 I共X ∧ Y兲艋 I共X,W兲 ª S共␳X兲 −

兺

x苸X

p共x兲S共␳x兲. 共8兲

This bound is a simple consequence of the monotonicity of relative entropy under state transformation12,13but has been proved earlier than monotonicity. Here I共X,W兲 is called a Holevo quantity or a classical-quantum mutual information, and it satisfies the identity

兺

x_苸X

p共x兲S共␳x储␳兲 = I共X,W兲 + S共␳X储␳兲, 共9兲

where␳ is an arbitrary density matrix.

When the channel W :X→B共K兲 is used to transfer sequences x=共x₁, x₂, . . . , xn兲苸Xn in a

memoryless manner, a sequence x =共x1, x2, . . . , xn兲苸Xnis transferred into the quantum state

W丢n共x兲 =␳xª␳x₁丢␳x₂丢 ¯ 丢␳x_n. 共10兲

Formally, this defines a new channel W丢n:Xn→B共Kn兲 called the nth extension of W.

A code of block length n is defined by a subset An傺Xn called the code word set and by a

measurement 兵Fy: y苸Bn其 called the decoder. For technical convenience, the set Bn of possible

decoding results may be different from An, and only An傺Bn is required. The probability of error

is Prob共X⫽Y兲, where the input random variable X is uniformly distributed on Anand the output

random variable Y is as in Eq.共7兲 with x and y replaced by x and y.

The essential observation is that S共Rn兲−共n−1兲S共␳兲−S共␴兲 in the conjecture equals the Holevo

quantity I共X,W丢n_{兲 for the nth extension of the channel W with input alphabet X=兵0,1其,} _␳

0=␴,

␳1=␳ and with X uniformly distributed on those length-n binary sequences that contain exactly

one 0. More generally, we shall consider Holevo quantities

I共A,␳0,␳1兲 ª S

冉

1 兩A兩x

兺

_苸A ␳x

冊

− 1 兩A兩x

兺

_苸A S共␳x兲, 共11兲

(8)

The concept related to the conjecture we study is the channel capacity per unit cost which is defined next for simplicity only in the case where X=兵0,1其, the cost of a character 0苸X is 1, while the cost of 1苸X is 0. Given such channel and ␧⬎0, a number R⬎0 is called an ␧-achievable rate per unit cost if for every␦⬎0 and for any sufficiently large T there exists a code of block length n⬎T with at least eT共R−␦兲_{code words such that each of the code words contains at}

most T 0’s and the error probability is at most␧. The largest R which is an ␧-achievable rate per unit cost for every␧⬎0 is the channel capacity per unit cost.

The next theorem is our main result of this section.

Theorem 3: Let the classical-quantum channel W :X=兵0,1其→B共K兲 be defined by W共0兲

=␳0⬅␴and W共1兲=␳1⬅␳. Let An傺兵0,1其n, n = 1 , 2 , . . ., be sets such that for some natural number

ᐉ and for some real number c⬎0,

(a) each element x =共x1, x2, . . . , xn兲苸An contains at mostᐉ copies of 0;

(b) log兩An兩/log n→c as n→⬁; (c) c共An兲 ª 1 兩An兩x

兺

_苸A_n 兩兵i:xi= 0其兩 → c as n → ⬁. Then lim n_→⬁ I共An,␴,␳兲 = cS共␴储␳兲.

The proof of the theorem is divided into lemmas.

Lemma 1: For an arbitrary A傺兵0,1其n_,

I共A,␳0,␳1兲艋 c共A兲S共␳0储␳1兲

holds.

Proof: Let c共x兲ª兩兵i:xi= 0其兩 for x苸A. Since I共A,␳0,␳1兲=I共X,W丢n兲, we can use identity共9兲to

get an upper bound 1 兩A兩x

兺

_苸A S共␳x储␳1丢 n_{兲 =} 1 兩A兩x

兺

_苸A c共x兲S共␳0储␳1兲 = c共A兲S共␳0储␳1兲 for I共A,␳₀,␳₁兲. 䊐

Lemma 2: If A傺兵0,1其n _{is a code word set of a code whose probability of error does not} exceed a given 0⬍␧⬍1, then

共1 − ␧兲log兩A兩 − log 2 艋 I共A,␳0,␳1兲.

Proof: For the input and output random variables corresponding to the given code, the

clas-sical mutual information I共X∧Y兲 is bounded above by I共X,W丢n_兲=I共A,_␳

0,␳1兲, see Eq.共8兲. Since

the error probability Prob共X⫽Y兲 does not exceed ␧, the Fano inequality 共see, e.g., Ref.3兲 gives

H共X兩Y兲艋 ␧ log兩A兩 + log 2.

Therefore,

I共X ∧ Y兲 = H共X兲 − H共X兩Y兲艌共1 − ␧兲log兩A兩 − log 2,

and the proof is complete. 䊐

We need the direct part of the so-called quantum Stein lemma obtained in Ref. 6, see also Refs.2,5,11, and14.

Lemma 3: Given arbitrary density matrices␳and␴in B共K兲,␩⬎0, and 0⬍R⬍S共␴储␳兲, if N sufficiently large, there is a projection E苸B共K丢N_{兲 such that}

(9)

␣关E兴 ª Tr␴丢N_{共I − E兲 ⬍}_␩ and

␤关E兴 ª Tr␳丢N

E⬍ e−NR.

Here E 关or the measurement 共E,I−E兲兴 is interpreted as a test of the null hypothesis that the state is␴丢N_{, against the alternative hypothesis that it is}_␳丢N_{. This test incorrectly accepts the null}

hypothesis 共error of the first kind兲 with probability␣关E兴, and incorrectly rejects it 共error of the second kind兲 with probability␤关E兴.

Lemma 4: Assume that ␧⬎0, 0⬍R⬍S共␳₀储␳₁兲, and ᐉ is a positive integer. If n is large enough, then for any set Anof sequences x苸兵0,1其nthat contain at mostᐉ copies of 0, there exists a code with error probability smaller than ␧ whose code words are the N-fold repetitions xN

=共x,x, ... ,x兲 of the sequences x苸An, where N is the smallest integer satisfying

N艌 1 R log

2n ␧ .

Proof: We follow the probabilistic construction in Ref.15. The output states corresponding to input sequences of length nN are density matrices acting on the Hilbert spaceK丢Nn_⬅共K丢n_兲丢N_{. We}

decompose this Hilbert space into an N-fold tensor product in a different way. For each 1艋i 艋n, let Kibe the tensor product of the factors i , i + n , i + 2n , . . . , i +共N−1兲n. So K丢Nnis identified

withK1丢K2丢¯丢Kn.

We construct a decoder to the code word set in the lemma as follows. For each 1艋i艋n, we test the null hypothesis that the ith component of the actually chosen x苸An is 0, against the

alternative that it is 1, based on the channel outputs at time instances i , i + n , . . . , i +共N−1兲n. More exactly, let the projection Ei苸B共Ki兲 be a test of the null hypothesis␴丢n against the alternative ␳丢n_{. According to the quantum Stein lemma}_{共Lemma 3兲, applied with} ␩₌_{␧/2ᐉ and the given 0}

⬍R⬍S共␴储␳兲, for N sufficiently large, there exists a test Eisuch that the probability of error of the

first kind is smaller than ␩, while the probability of error of the second kind is smaller than e−NR_{艋␧/2n. The projections E}

iand I − Eiform a partition of unity in the Hilbert spaceKi, and the n-fold tensor product of these commuting projections will give a partition of unity inK丢Nn_{. For}

y苸兵0,1其n_{, set F}

yª丢i=1 n

Fy

i, where Fyi= Eiif yi= 0 and Fyi= I − Eiif yi= 1, and let the decoder be

the measurement 兵Fy: y苸兵0,1其n其. Thus, the result of the decoding will be an arbitrary 0–1

se-quence in兵0,1其n.

The error probability should be estimated, Prob共Y ⫽ X兩X = x兲 =

兺

y:y⫽x Tr␳x丢 N Fy=

兺

y:y⫽x

兿

i=1 n Tr␳x丢_i N Fy i艋

兺

i=1 n

兺

y:yi⫽xi

兿

j=1 n Tr␳x丢_j N Fy j 艋

兺

i=1 n Tr␳x丢_i N_{共I − F} x_i兲. If xi= 0, then Tr␳x丢_i N_{共I − F} x_i兲 = Tr␳0丢 N_{共I − E} i兲 =␣共Ei兲艋 ␧ 2ᐉ, and if xi= 1, Tr␳x丢_i N_{共I − F} x_i兲 = Tr␳1丢 N Ei=␤共Ei兲艋 e−RN艋 ␧ 2n.

As xi= 0 holds for at most ᐉ indices, it follows that the probability of error of this code is

(10)

Proof of Theorem 3: Since Lemma 1 gives the upper bound,

lim sup

n→⬁

I共An,␳0,␳1兲艋 cS共␴储␳兲,

it remains to prove that

lim inf

n→⬁

I共An,␳0,␳1兲艌 cS共␴储␳兲.

By Lemma 4, the set兵xN_{: x}_苸A

n其 with N given there is the code word set of a code with error

probability smaller than␧. According to Lemma 2 and Eq.共11兲, this implies 共1 − ␧兲log兩An兩 − log 2 艋 S共␳XN兲 −

1 兩An兩x

兺

苸A_n

S共␳xN兲,

where X is uniformly distributed on An and XNdenotes its N-fold repetition.

From the subadditivity of the von Neumann entropy, we have

S共␳XN兲艋 NS共␳X兲

and

S共␳xN兲 = NS共␳_x兲

holds due to the additivity for product. It follows that 共1 − ␧兲log兩An N − 1 N艋 S共␳X兲 − 1 兩An兩x

兺

_苸A_n S共␳x兲 = I共An,␳0,␳1兲.

From the choice of N in Lemma 4, we have

Rlog兩An

log n

log n + log 2 − log␧艋 log兩An

N ,

and the lower bound is arbitrarily close to cR. Since R⬍S共␳0储␳1兲 was arbitrary, the proof is

complete. 䊐

Assume that Anis the set of all x苸兵0,1其ncontaining exactlyᐉ 0’s for a fixed natural number

ᐉ. Then c共An兲=ᐉ and from the Stirling formula, one can easily check log兩An兩 /log n→ᐉ.

Conse-quently, Theorem 3 proves that

S共Rn共ᐉ兲兲 − 共n − ᐉ兲S共␳兲 − ᐉS共␴兲 → ᐉS共␴储␳兲共12兲

holds as n→⬁ when ᐉ is fixed and

Rn共ᐉ兲 ª

冉

n ᐉ

冊

−1

兺

x苸A_n ␳x₁丢␳x₂丢 ¯ 丢␳x_n 共␳0=␴,␳1=␳兲.

In particular, whenᐉ=1, conjecture共1兲 is proven in full generality. We have two generalizations 关Eqs.共5兲and共12兲兴 of Eq.共1兲, which are similar but different.

We note that Eq.共12兲admits a thermodynamical interpretation, analogous of that of Eq.共1兲in Ref. 4, sketched in the Introduction. Indeed, suppose that interaction with the environment changes the state not of 1 but ᐉ molecules to ␴ and irreversibility is introduced again by total randomization. The new state of the system will be Rn共ᐉ兲 above and Eq.共12兲says that “informatic

entropy” increases byᐉ times the relative entropy 共in the limit as system size goes to infinity兲.

Theorem 4: The capacity per unit cost of the channel with a binary input alphabet and

W共0兲=␳₀, W共1兲=␳₁is equal to the relative entropy S共␳₀储␳₁兲.

(11)

Proof: Assume that R⬎0 is an ␧-achievable rate per unit cost. For every␦⬎0 and T⬎0, there

is a code A傺兵0,1其n _{for which we get by Lemmas 1 and 2,}

TS共␳₀储␳₁兲艌 c共A兲S共␳₀储␳₁兲艌 I共A,␳₀,␳₁兲艌共1 − ␧兲log兩A兩− log 2 艌共1 − ␧兲T共R −␦兲 − log 2.

Since T is arbitrarily large and␧,␦ are arbitrarily small, R艋S共␳0储␳1兲 follows.

Let An be the set of x苸兵0,1其n containing exactly one 0, and consider the N-times repeated

code words given in Lemma 4. Then each of the n code words contains exactly N 0’s. For every

R⬍S共␳0储␳1兲 and ␧,␦⬎0, if N is chosen as in Lemma 4, we have

n艌␧ 2e NR =␧e N␦ 2 e N_共R−␦_兲_{⬎ e}N_共R−␦_兲

as long as n is so large that N satisfies␧eN␦_{/ 2}_{⬎1. This implies that R is an ␧-achievable rate per}

unit cost for every␧⬎0. Hence, the result follows. 䊐

ACKNOWLEDGMENTS

The first author 共I.C.兲 was partially supported by the Hungarian Research Grants OTKA T046376 and T068258. The second author 共F.H.兲 was partially supported by Grant-in-Aid for Scientific Research 共B兲17340043. The third author was partially supported by the Hungarian Research Grant T068258.

1_{Belavkin, V. P., and Staszewski, P., “C}*_{-algebraic generalization of relative entropy and entropy,” Ann. Inst. Henri}

Poincare, Sect. A 37, 51–58共1982兲.

2_{Bjelaković, I., Deuschel, J., Krüger, T., Seiler, R., Siegmund-Schultze, R., and Szkoła, A., “A quantum version of}

Sanov’s theorem,” Commun. Math. Phys. 260, 659–671共2005兲.

3_{Cover, T. M., and Thomas, J. A., Elements of Information Theory, 2nd ed.}_{共Wiley-Interscience, Hoboken, NJ, 2006兲.} 4_{Diósi, L., Feldmann, T., and Kosloff, R., “On the exact identity between thermodynamic and informatic entropies in a}

unitary model of friction,” Int. J. Quantum Inf. 4, 99–104共2006兲.

5_{Hayashi, M., Quantum Information: An Introduction}_{共Springer, New York, 2006兲.}

6_{Hiai, F., and Petz, D., “The proper formula for relative entropy and its asymptotics in quantum probability,” Chem.}

Mater. 143, 99–114共1991兲.

7_{Holevo, A. S., e-print arXiv:quant-ph/9705054.}

8_{Holevo, A. S., “Quantum coding theorems,” Russ. Math. Surveys 53, 1295–1331}_共1998兲.

9_{Holevo, A. S., “Some estimates for the amount of information transmittable by a quantum communication channel}_共in

Russian兲,” Problemy Peredachi Informacii 9, 3–11 共1973兲.

10_{Nielsen, M. A., and Chuang, I. L., Quantum Computation and Quantum Information}_{共Cambridge University Press,}

Cambridge, 2000兲.

11_{Ogawa, T., and Nagaoka, H., “Strong converse and Stein’s lemma in quantum hypothesis testing,” IEEE Trans. Inf.}

Theory 46, 2428–2433共2000兲.

12_{Ohya, M., and Petz, D., Quantum Entropy and its Use}_{共Springer, New York, 1993兲.}

13_{Ohya, M., Petz, D., and Watanabe, N., “On capacities of quantum channels,” Probab. Math. Stat. 17, 179–196}_共1997兲. 14_{Petz, D., Quantum Information Theory and Quantum Statistics, to be published.}