Volume 2009, Article ID 323615,15pages doi:10.1155/2009/323615
Research Article
On Some Improvements of the Jensen Inequality with Some Applications
M. Adil Khan,
1M. Anwar,
1J. Jak ˇseti ´c,
2and J. Pe ˇcari ´c
1, 31Abdus Salam School of Mathematical Sciences, GC University, 5400 Lahore, Pakistan
2Faculty of Mechanical Engineering and Naval Architecture, University of Zagreb, 1000 Zagreb, Croatia
3Faculty of Textile Technology, University of Zagreb, 1000 Zagreb, Croatia
Correspondence should be addressed to M. Adil Khan,[email protected] Received 23 April 2009; Accepted 10 August 2009
Recommended by Sever Silvestru Dragomir
An improvement of the Jensen inequality for convex and monotone function is given as well as various applications for mean. Similar results for related inequalities of the Jensen type are also obtained. Also some applications of the Cauchy mean and the Jensen inequality are discussed.
Copyrightq2009 M. Adil Khan et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
The well-known Jensen’s inequality for convex function is given as follows.
Theorem 1.1. IfΩ,A, μis a probability space and iff ∈L1μis such thata≤ ft ≤bfor all t∈Ω, −∞ ≤a < b≤ ∞,
φ
Ωftdμt
≤
Ωφ ft
dμt 1.1
is valid for any convex functionφ :a, b → R. In the case whenφis strictly convex ona, bone has equality in1.1if and only iffis constant almost everywhere onΩ.
Here and in the whole paper we suppose that all integrals exist. By considering the difference of 1.1for functional in 1Anwar and Peˇcari´c proved an interesting result of log-convexity. We can define this result for integrals as follows.
Theorem 1.2. LetΩ,A, μbe a probability space andf ∈L1μis such thata≤ ft ≤bfor all t∈Ω, −∞ ≤a < b≤ ∞. Define
Λs
⎧⎪
⎪⎪
⎪⎪
⎪⎪
⎪⎪
⎨
⎪⎪
⎪⎪
⎪⎪
⎪⎪
⎪⎩ 1 ss−1
Ω
fts dμt−
Ωftdμt s
, s /0,1,
log
Ωftdμt
−
Ωlog ft
dμt, s 0,
Ω
ft log
ft dμt−
Ωftdμt
log
Ωftdμt
, s 1,
1.2
and letΛsbe positive. ThenΛsis log-convex, that is, for−∞< r < s < u <∞,the following is valid
Λsu−r ≤Λru−sΛus−r. 1.3
The following improvement of1.1was obtained in2.
Theorem 1.3. Let the conditions ofTheorem 1.1be fulfilled. Then
Ωφ ft
dμt−φ
Ωftdμt
≥
Ω
φ ft
−φ
fdμt−φ f
Ω
ft−fdμt ,
1.4
whereφxrepresents the right-hand derivative ofφand
f
Ωftdμt. 1.5
Ifφis concave, then left-hand side of 1.4should beφ
Ωftdμt−
Ωφftdμt.
In this paper, we give another proof and extension of Theorem 1.2 as well as improvements ofTheorem 1.3for monotone convex function with some applications. Also we give applications of the Jensen inequality for divergence measures in information theory and related Cauchy means.
2. Another Proof and Extension of Theorem 1.2
In fact,Theorem 1.2forΩ a, band 0 < r < s < u, r, s, u /1 was first of all initiated by Simi´c in3.
Moreover, in his proof, he has used convex functions defined onI −∞,0∪0,1∪ 1,∞ see3, Theorem 1. In his proof, he has used the following function:
λx v2 xs
ss−12vw xr
rr−1w2 xu
uu−1, 2.1
wherer su/2 andv, w, r, s, uare real withr, s, t∈I.
In1we have given correct proof by using extension of2.1, so that it is defined on R.
Moreover, we can give another proof so that we use only 2.1 but without using convexity as in3.
Proof ofTheorem 1.2. Consider the functionλxdefined, as in3, by2.1.
Now
λx
vxs/2−1wxu/2−12
≥0, forx >0, 2.2
that is,λxis convex. By using1.1we get
v2Λs2vwΛrw2Λu≥0. 2.3
Therefore,2.3is valid for alls, r, u∈I. Now since left-hand side of2.3is quadratic form, by the nonnegativity of it, one has
Λ2su/2 Λ2r ≤ΛsΛu. 2.4
Since we have lims→0Λs Λ0and lims→1Λs Λ1, we also have that2.4is valid forr, s, u∈ R. Sos →Λsis log-convex function in the Jensen sense onR.
Moreover, continuity of Λs implies log-convexity, that is, the following is valid for
−∞< r < s < u <∞:
Λsu−r ≤Λru−sΛus−r. 2.5
Let us note that it was used in4to get corresponding Cauchy’s means. Moreover, we can extend the above result.
Theorem 2.1. Let the conditions of Theorem 1.2 be fulfilled and let pi i 1,2, . . . , n be real numbers. Then
Λpij
k≥0 k 1,2, . . . , n, 2.6
where|aij|kdefine the determinant of orderkwith elementsaijandpij pipj/2.
Proof. Consider the function
fx n
i,j 1
uiuj xpij pij
pij−1 2.7
forx >0 andui∈Randpij∈I.
So, it holds that
fx n
i,j 1
uiujxpij−2 n
i 1
uixpi/2−1 2
≥0. 2.8
Sofxis convex function, and as a consequence of1.1, one has n
i,j 1
uiujΛpij ≥0. 2.9
Therefore,Λpij aijdenote then×nmatrix with elementsaijis nonnegative semi definite and2.6is valid forpij∈I. Moreover, since we have continuity ofΛpijfor allpij,2.6is valid for allpi∈Ri 1,2, . . . , n.
Remark 2.2. InTheorem 2.1, if we setn 2,we getTheorem 1.2.
3. Improvements of the Jensen Inequality for Monotone Convex Function
In this section and in the following section, we denotex n
i 1pixiandPI
i∈Ipi.
Theorem 3.1. IfΩ,A, μis a probability space and iff∈L1μis sucha≤ft≤bfort∈Ω,and ifft≥ffort∈Ω⊂Ω(Ωis measurable, i.e.,Ω∈A),−∞< a < b≤ ∞,then
Ωφ ft
dμt−φ
Ωftdμt
≥
Ωsgn
ft−f φ
ft
−φ f
ft
dμt φ
f
−fφ
f
1−2μ Ω,
3.1
where
f
Ωftdμt, 3.2 for monotone convex functionφ : a, b → R. Ifφis monotone concave, then the left-hand side of 3.1should beφ
Ωftdμt−
Ωφftdμt.
Proof. Consider the case whenφis nondecreasing ona, b. Then
Ω
φ ft
−φ
fdμt
Ω
φ
ft
−φ f
dμt
Ω\Ω
φ
f
−φ
ft
dμt
Ωφ ft
dμt−
Ω\Ωφ ft
dμt−φ f
μ Ω
φ f
μ Ω\Ω
Ωsgn
ft−f φ
ft
dμt φ f
μ Ω\Ω
−μ Ω
.
3.3
Similarly,
Ω
ft−fdμt
Ωsgn
ft−f
ftdμt f μ
Ω\Ω
−μ Ω
. 3.4
Now from1.4,3.3, and3.4we get3.1.
The case whenφis nonincreasing can be treated in a similar way.
Of course a discrete inequality is a simple consequence ofTheorem 3.1.
Theorem 3.2. Letφ:a, b → Rbe a monotone convex function,xi∈a, b, pi >0, n
i 1pi 1.
Ifxi≥xfori∈I ⊂ {1,2, . . . , n} In, then
n i 1
piφxi−φ n
i 1
pixi
≥
n i 1
pisgnxi−x
φxi−xiφx
φx−xφx
1−2PI .
3.5
Ifφis monotone concave, then the left-hand side of 3.5should be
φ n
i 1
pixi
−n
i 1
piφxi. 3.6
The following improvement of the Hermite-Hadamard inequality is valid5.
Corollary 3.3. Letφ:a, b → Rbe a differentiable convex. Then ithe inequality
1 b−a
b
a
φtdt−φ ab
2
≥
1 b−a
b
a
φt−φ ab
2 dt
− b−a
4 φ
ab 2
3.7
holds.
If φ is differentiable concave, then the left-hand side of 3.7 should beφab/2− 1/b−ab
aφtdt;
iiif φ is monotone, then the inequality
1 b−a
b
a
φtdt−φ ab
2
≥
1 b−a
b
a
sgn
t−ab 2
φt−tφ ab
2
dt
3.8
holds. Ifφis differentiable and monotone concave then the left-hand side of 3.8should be φab/2−1/b−ab
aφtdt.
Proof. iSettingΩ a, b, ft t, dμt dt/b−ain1.4, we get3.7.
iiSettingft t, dμt dt/b−a, andΩ a, bin3.1, we get3.8.
4. Improvements of the Levinson Inequality
Theorem 4.1. If the third derivative offexist and is nonnegative, then for 0< xi < a, pi >01 ≤ i≤n, n
i 1pi 1 andPk k
i 1pi 2≤k≤n−1one has i
n i 1
pif2a−xi−f2a−x−n
i 1
pifxi fx
≥
n i 1
pif2a−xi−fxi−f2a−x fx
−f2a−x fxn
i 1
pi|xi−x|
,
4.1
iiifφx f2a−x−fxis monotone andxi≥xfori∈I⊂ {1,2, . . . , n} In, then
n i 1
pif2a−xi−f2a−x−n
i 1
pifxi fx
≥
n i 1
pisgnxi−x
f2a−xi−fxi xi
f2a−x fx
f2a−x−fx x
f2a−x fx
1−2PI .
4.2
Proof. iAs for 3-convex functionf :0,2a → Rthe functionφx f2a−x−fxis convex on0, a, so by settingφ f2a−x−fxin the discrete case of2, Theorem 2, we get4.1.
iiAsf2a−x−fxis monotone convex, so by settingφ f2a−x−fxin3.5, we get5.16.
Ky Fan Inequality
Letxi∈0,1/2be such thatx1≥x2 ≥ · · · ≥xk≥x≥xk1· · · ≥xn. We denoteGkandAk, the weighted geometric and arithmetic means, respectively, that is,
Ak 1 Pk
k
i 1
pixi
x, Gk
k
i 1
xipi 1/Pk
, 4.3
and also byAkandGk, the arithmetic and geometric means of 1−xi,respectively, that is,
Ak 1 Pk
k i 1
pi1−xi 1−Ak, Gk
k
i 1
1−xipi 1/Pk
. 4.4
The following remarkable inequality, due to Ky Fan, is valid6, page 5,
Gn Gn ≤ An
An, 4.5
with equality sign if and only ifx1 x2 · · · xn.
Inequality4.5has evoked the interest of several mathematicians and in numerous articles new proofs, extensions, refinements and various related results have been published 7.
The following improvement of Ky Fan inequality is valid2.
Corollary 4.2. LetAn, GnandAn, Gnbe as defined earlier. Then, the following inequalities are valid i
An/An
Gn/Gn ≥exp
n i 1
pi ln
1−xiAn
xiAn
− 1 AnAn
n i 1
pi|xi−An|
, 4.6
ii
An/An Gn/Gn ≥exp
2Pk
ln
GkAn GkAn
Ak−An
AnAn
ln GnAn
AnGn
. 4.7
Proof. iSettinga 1/2,fx lnxin4.1, we get4.6.
iiConsidera 1/2 andfx lnx,thenφx ln1−x−lnxis strictly monotone convex on the interval0,1/2and has derivative
φx − 1
xx−1. 4.8
Then the application of inequality4.2to this function is given by n
i 1
piln1−xi
xi −ln1−x x
≥
n i 1
pisgnxi−x
ln1−xi
xi xi
x1−x
ln1−x
x 1
1−x
1−2Pk .
4.9
From4.9we get4.7.
5. On Some Inequalities for Csisz ´ar Divergence Measures
LetΩ,A, μbe a measure space satisfying|A|>2 andμaσ-finite measure onΩwith values inR∪ {∞}. LetP be the set of all probability measures on the measurable spaceΩ,Awhich are absolutely continuous with respect toμ. ForP, Q ∈ P, let p dP/dμand q dQ/dμ denote the Radon-Nikodym derivatives ofPandQwith respect toμ,respectively.
Csisz´ar introduced the concept off-divergence for a convex function, f : 0,∞ →
−∞,∞that is continuous at 0 as followscf.8, see also9.
Definition 5.1. LetP, Q∈P. Then
IfQ, P
Ωpsf qs
ps
dμs, 5.1
is called thef-divergence of the probability distributionsQandP.
We give some important f-divergences, playing a significant role in Information Theory and statistics.
iThe class of χ-divergences: thef-divergences, in this class, are generated by the family of functions:
fαu |u−1|α u≥0, α≥1, IfαQ, P
Ωp1−αs|qs−ps|αdμs. 5.2 Forα 1, it gives the total variation distance:
VQ, P
Ω
qs−psdμs. 5.3
Forα 2, it gives the Karl pearsonχ2-divergence:
Iχ2Q, P
Ω
qs−ps2
ps dμs. 5.4
iiTheα-order Renyi entropy: forα∈R\ {0,1}, let
ft tα, t >0. 5.5
ThenIfgivesα-order entropy
DαQ, P
Ωqαsp1−αsdμs. 5.6 iiiHarmonic distance: let
ft 2t
1t, t >0. 5.7
ThenIfgives Harmonic distance
DHQ, P
Ω
2psqs
ps qsdμs. 5.8
ivKullback-Leibler: let
ft tlogt, t >0. 5.9
Thenf-divergence functional gives rise to Kullback-Leibler distance10
DKLQ, P
Ωqslog qs
ps
dμs. 5.10
The one parametric generalization of the Kullback-Leibler10relative information studied in a different way by Cressie and Read11.
v The Dichotomy class: this class is generated by the family of functions gα : 0,∞ → R,
gαu
⎧⎪
⎪⎪
⎪⎨
⎪⎪
⎪⎪
⎩
u−1−logu, α 0,
1
α1−ααu1−α−uα, α∈R\ {0,1},
1−uulogu, α 1.
5.11
This class gives, for particular values of α, some important divergences. For instance, for α 1/2,we have Hellinger distance and some other divergences for this class are given by
IgαQ, P
⎧⎪
⎪⎪
⎪⎪
⎨
⎪⎪
⎪⎪
⎪⎩
Q−PDKLP, Q, α 0, αQ−P P−DαQ, P
α1−α , α∈R\ {0,1}, DKLQ, P P−Q, α 1,
5.12
wherepxandqxare positive integrable functions with
Ωpsdμs P,
Ωqsdμs Q.
There are various other divergences in Information Theory and statistics such as Arimoto-type divergences, Matushita’s divergence, Puri-Vincze divergences cf. 12–14 used in various problems in Information Theory and statistics. An application ofTheorem 1.1 is the following result given by Csisz´ar and K ¨ornercf.15.
Theorem 5.2. Letf :0,∞ → Rbe convex, and letpandqbe positive integrable function with
Ωpsdμs P,
Ωqsdμs Q. Then the following inequality is valid:
IfP, Q≥Qf P
Q
, 5.13
whereIfP, Q
Ωqsfps/qsdμs.
Proof. By substitutingφs fs, fs ps/qsanddμs qsdμsinTheorem 1.1 we get5.13.
Similar consequence of Theorems 1.2and 2.1in information theory for divergence measures discussed above is the following result.
Theorem 5.3. Letpandqbe positive integrable functions with
Ωpsdμs P,
Ωqsdμs Q. Define the function
Φt
⎧⎪
⎪⎪
⎪⎪
⎪⎪
⎨
⎪⎪
⎪⎪
⎪⎪
⎪⎩ 1 t1−t
PtQ1−t−DtP, Q
, t /0,1, DKLQ, P QlogP
Q, t 0,
DKLP, Q PlogQ
P, t 1,
5.14
and letΦtbe positive. Then iit holds that
Φpij
k≥0 k 1,2, . . . , n, 5.15
where|aij|kdefine the determinant of ordernwith elementsaijandpij pipj/2, ii Φtis log-convex.
As we said in 4 we define new means of the Cauchy type, here we define an application of these means for divergence measures in the following definition.
Definition 5.4. Let p and q be positive integrable functions with
Ωpsdμs P,
Ωqsdμs Q. The mean Ms,tis defined as
Ms,t Φs
Φt
1/s−t
, s /t /0,1,
Ms,s exp
PsQ1−slogP/Q−DsP, Q
PsQ1−s−DsP, Q − 1−2s s1−s
, s /0,1,
M0,0 exp Q
logP/Q2−D0P, Q 2
QlogP/Q−D0P, Q1
,
5.16
whereD0P, Q
Ωqslogps/qsdμsandD0P, Q
Ωqslogps/qs2dμs,
M1,1 exp Q
logP/Q2
−D1P, Q 2
PlogP/Q−D1P, Q−1
, 5.17
whereD1P, Q
Ωpslogqs/psdμsandD1P, Q
Ωpslogqs/ps2dμs.
Theorem 5.5. Letr, s, t, ube nonnegative reals such thatr ≤t, s≤u,then
Mr,t≤Ms,u. 5.18
Proof. By using log convexity of Φt,we get the following result forr, s, t, u ∈ R such that r≤t, s≤uandr /s, t /u
Φs
Φr
1/s−r
≤ Φu
Φt
1/u−t
. 5.19
Also forr s, t u,we consider limiting case and the result follows from continuity of Ms,u.
An application ofTheorem 1.3in divergence measure is the following result given in 16.
Theorem 5.6. Letf :I ⊆R → Rbe differentiable convex function onIo, then
IfP, Q−Qf P
Q
≥
IfP, Q−Qf P
Q
−fP/Q
Q Q
, 5.20
where
Q
Ω
Qps−P qsdμs. 5.21
Proof. By substitutingφs fs, fs ps/qs,anddμs →qsdμsinTheorem 1.3, we get5.20.
Theorem 5.7. Let f : I ⊆ R → R be differentiable monotone convex function on Ioand let ps/qs> P/Q fors∈Ω⊂Ω
IfP, Q−Qf P
Q
≥
Ωsgn ps
qs− P Q
f
ps qs
qsdμs
−f P
Q
Ωsgn ps
qs− P Q
psdμs
Q
f P
Q
− P Q f
P Q
1 − 2Q Q
,
5.22
where
Q
Ωqsdμs, 5.23
andΩas inTheorem 5.7.
Proof. By substituting φs fs, fs ps/qs and dμs → qsdμs in
Theorem 3.1iiwe get5.22.
Corollary 5.8. It holds that DHαP, Q− 2P Q
PQ ≥
Ωsgn ps
qs − P Q
2psqs
ps qsdμs
− 2Q2 PQ2
Ωsgn ps
qs− P Q
psdμs
2P Q
PQ− 2P Q2 PQ2
1−2Q
Q
,
5.24
where
Q
Ωqsdμs, 5.25
andΩas inTheorem 5.7.
Proof. The proof follows by settingft 2t/1t, t >0 inTheorem 5.7.
Corollary 5.9. Letgα:R → Rbe as given in5.11, then iforα 0 one has
DKLQ, P Qlog P
Q
≥
Ωsgn ps
qs− P Q
ps
qs−1−log ps
qs
qsdμs
−P−Q P
Ωsgn ps
qs− P Q
psdμs Q log P
Q
1− 2Q Q
,
5.26
iiforα∈R\ {0,1}one has
PαQ1−α−DαP, Q
α1−α ≥
Ωsgn ps
qs− P Q
αps
qs 1−α−psαqs−α
qsdμs
−α
1−Pα−1Q1−α
Ωsgn ps
qs− P Q
psdμs
αP Q−αQP Q1−α−αP/Q
1−P1−α Q1−α
1−2Q/Q
α1−α ,
5.27
iiiforα 1 one has
DKLP, Q P Qlog
P Q
≥
Ωsgn ps
qs− P Q
1−ps
qsps
qslogps qs
qsdμs
−log P
Q
Ωsgn ps
qs − P Q
psdμs
Q−P
1−2Q Q
,
5.28
where
Q
Ωqsdμs, 5.29
and Ωas inTheorem 5.7.
Proof. The proof follows be settingf gαto be as given in5.11, inTheorem 3.1.
Acknowledgments
This research work is funded by the Higher Education Commission Pakistan. The research of the fourth author is supported by the Croatian Ministry of Science, Education and Sports under the Research Grants 117-1170889-0888.
References
1 M. Anwar and J. Peˇcari´c, “On logarithmic convexity for differences of power means and related results,” Mathematical Inequalities & Applications, vol. 12, no. 1, pp. 81–90, 2009.
2 S. Hussain and J. Peˇcari´c, “An improvement of Jensen’s inequality with some applications,” Asian- European Journal of Mathematics, vol. 2, no. 1, pp. 85–94, 2009.
3 S. Simi´c, “On logarithmic convexity for differences of power means,” Journal of Inequalities and Applications, vol. 2007, Article ID 37359, 8 pages, 2007.
4 M. Anwar and J. Peˇcari´c, “New means of Cauchy’s type,” Journal of Inequalities and Applications, vol.
2008, Article ID 163202, p. 10, 2008.
5 S. S. Dragomir and A. McAndrew, “Refinements of the Hermite-Hadamard inequality for convex functions,” Journal of Inequalities in Pure and Applied Mathematics, vol. 6, no. 2, article 140, 6 pages, 2005.
6 H. Alzer, “The inequality of Ky Fan and related results,” Acta Applicandae Mathematicae, vol. 38, no. 3, pp. 305–354, 1995.
7 E. F. Beckenbach and R. Bellman, Inequalities, vol. 30 of Ergebnisse der Mathematik und ihrer Grenzgebiete, N. F., Springer, Berlin, Germany, 1961.
8 I. Csisz´ar, “Information measures: a critical survey,” in Transactions of the 7th Prague Conference on Information Theory, Statistical Decision Functions and the 8th European Meeting of Statisticians, pp. 73–86, Academia, Prague, Czech Republic, 1978.
9 M. C. Pardo and I. Vajda, “On asymptotic properties of information-theoretic divergences,” IEEE Transactions on Information Theory, vol. 49, no. 7, pp. 1860–1868, 2003.
10 S. Kullback and R. A. Leibler, “On information and sufficiency,” Annals of Mathematical Statistics, vol.
22, pp. 79–86, 1951.
11 P. Cressie and T. R. C. Read, “Multinomial goodness-of-fit tests,” Journal of the Royal Statistical Society.
Series B, vol. 46, no. 3, pp. 440–464, 1984.
12 P. Kafka, F. ¨Osterreicher, and I. Vincze, “On powers off-divergences defining a distance,” Studia Scientiarum Mathematicarum Hungarica, vol. 26, no. 4, pp. 415–422, 1991.
13 F. Liese and I. Vajda, Convex Statistical Distances, vol. 95 of Teubner Texts in Mathematics, BSB B. G.
Teubner Verlagsgesellschaft, Leipzig, Germany, 1987.
14 F. ¨Osterreicher and I. Vajda, “A new class of metric divergences on probability spaces and its applicability in statistics,” Annals of the Institute of Statistical Mathematics, vol. 55, no. 3, pp. 639–653, 2003.
15 I. Csisz´ar and J. K ¨orner, Information Theory: Coding Theorems for Discrete Memoryless System, Probability and Mathematical Statistics, Academic Press, New York, NY, USA, 1981.
16 M. Anwar, S. Hussain, and J. Peˇcari´c, “Some inequalities for Csisz´ar-divergence measures,”
International Journal of Mathematical Analysis, vol. 3, no. 26, pp. 1295–1304, 2009.