Photocopying permitted by license only the Gordon and BreachSciencePublishers imprint, amember of theTaylor &FrancisGroup.
An Extension of Chebyshev’s Inequality and its Connection with Jensen’s Inequality*
CONSTANTIN P. NICULESCU
Departmentof Mathematics, University of Craiova, Craiova1100,Romania
(Received 24 November 1999;Infinalform17 February2000)
The aim of this paper is to show that Jensen’s Inequality and an extension of Chebyshev’sInequalitycomplementoneanother,sothattheyboth can beformulatedin apairingform, includingasecond inequality, thatprovidesanestimate forthe classical one.
Keywords: Convexfunctions;Subdifferential; Meanvalue Mathematics SubjectClassifications2000: Primary:26A51,26D15
1.
INTRODUCTION
The well known fact that the derivative and the integral are inverse each other has a lot ofinteresting consequences, one of them being the duality between convexity and monotonicity. The purpose of the present paper is to relate on this basis two basic inequalities in ClassicalAnalysis, precisely those due toJensenandChebyshev.
Both refer to mean values of integrable functions. Restricting ourselves to the case offinite measure spaces (X,
,
#), let us recall*Partiallysupported byMENGrant39683/1998.
e-mail: [email protected]
451
that the mean value of any #-integrable function
f:
X can bedefinedas
M(f) #(X) fd#.
A
useful remark is that the mean value ofan integrable function belongstoanyinterval that includesitsimage; see [5], page202.In the particular case of an interval [a,b], endowed with the Lebesgue measure, the two aforementioned inequalities reads as follows:
JENSEN’S INEQUALITY Suppose that
f:
[a,b]--, is an integrablefunc-
tion and99 is a convex
function defined
on an interval containing the imageoff,
such thato of
is integrabletoo. Thenp(M(f)) <_ M(p of).
CHEBYSHEV’SINEQUALITY
If
g,h"[a, b] Rare twonondecreasingfunc- tions thenM(g)M(h) <_ M(gh).
Our goal is to show that Jensen’s Inequality and an extension of Chebyshev’s Inequality complement one another, so that they both can be formulated in a pairing form, including a second inequality, that providesan estimate for the classicalone.
2.
PRELIMINARIES
Before entering the details we shall need some preparation on the smoothness properties of the two type of functions involved: the convexand the nondecreasing ones.
Suppose
thatIis an interval (withinteriorIntI)
andf:
I---.is aconvexfunction. Then
f
iscontinuous on lntIand hasfiniteleft and rightderivatives ateach point ofInt LMoreover,
x
<
y in Int/=:D-f (x) < D+f(x) <
< D-f (y) < D+f(y)
which shows thatboth
D-f
andD+f
are nondecreasing on Int L It is not difficult to prove thatf
must be differentiable except for at most countably many points. See [5], pp. 271-272. On the other hand, simpleexamples show thatat the endpoints ofIcouldappear problems even with the continuity. Notice that every discontinuous convexfunctionf:
[a,b]--, comes fromacontinuous convex functionf0:
[a, b]--,,
whose values at aand/or
b were enlarged. In fact, anyconvex function
f
defined on an interval I is either monotonic or admits a point c such thatf
is nonincreasing on (-oo, c]f3I and nondccrcasingon [c,oo)UI.Inthecase ofconvex functions, the role ofderivative isplayed by thesubdifferential,amathematicalobjectwhichforafunction
f:
Iis defined as the set
Of
of all functionso:I--,
[-oo, o0] such thato(Int I)
c
andf(x) >f(a)+ (x- a)o(a), (V)
x,a EI.Geometrically, the subdifferentialgives usthe slopes of supporting lines for thegraph
off.
Thewell known fact that adifferentiable function is convex if(and onlyif)itsderivative isnondecreasing has the following generalization in termsofsubdifferential:
LEMMA1 LetIbeaninterval. The
subdifferential of
afunction f:
isnon-empty
if
andonlyif f
isconvex. Moreover,if
qo Of, thenD-f (x) < qo(x) <_ D+f(x)
for
every x Int L Particularly, gois anondecreasingfunction.
Proof
Necessity Suppose firstthatf
is aconvex function defined on an open interval L We shall prove thatD+f Of.
For, let x,a/,with x
>
a.Thenf((1 t)a + tx) f (a)
<_f(x) f (a)
t for each (0, 1], whichyields
f(x) >f(a) + D+f(a) (x a).
If
x<a,
then a similar argument leads us tof(x)>f(a)+
D-
f (a) (x- a);
or, D-f (a) (x-
a)>
D+f (a)
(x-a), becausex-a<O.
Analogously, wecan argue that
D-f Of.
Then from(,)
we inferthat any
o Of
isnecessarily nondecreasing.IfIis notopen, say I= [a,
oo),
we can complete the definition ofo
by letting
o(a)=-oo.
Sufficiency Let x, yE
L
x:#
y, andlet E (0,1). Thenf(x) >_f(( t)x +
ty)++ t(x y). o((1 t)x + ty)
f(r) >_f(( t)x + ty)-
(1 t)(x y). o((1 t)x + ty).
Bymultiplying thefirstinequalityby 1-t,thesecondby and then adding themside by side,weget
(1 t)f(x) + tf(x) _>f((1 t)x + ty)
i.e.,
f
isconvex. []Let usconsider nowthecase of nondecreasingfunctions. Itiswell known that each nondecreasing function
o:I
has at mostcountably manydiscontinuities
(each
ofthefirstkind);less known is that qo admits primitives(also
called antiderivatives). According to Dieudonn6[2],a primitiveofo
meansanyfunction"
I which iscontinuous on/, differentiable at each point of continuity ofo, and such that
,= o
atallthose points.An
exampleofaprimitive ofo
isx
,(x) o(t)dt,
x,
abeing arbitrarilyfixed inL
Because
o
isnondecreasing,aneasy computation shows that I,is a convex function.Infact, iscontinuous,so it suttices to show that#
(x,,+2 3’7) <- #(x) + (y)2
for allx,yI;
or, the last inequalityis equivalentto
(x+y)/2
99(t)dt < 99(t)dt,
Jx +y)/2
forallx, yEI,x
<
y the later being clear becauseo
isnondecreasing.On the other hand, by Denjoy-Bourbaki Theorem(The General- ized Mean Value Theorem) any two primitives of 99 differ by a constant. See [2], 8.7.1. Consequently, all primitives of 99 must be convex too!
3.
THE
MAIN RESULTSWe can now state our first result, the complete
form of
Jensen’sInequality:
THEOREM
A
Let (X,,
lz) beafinite
measure space and letg X-, bea#-integrablefunction. If f
isaconvexfunction
given on anintervalI that includesthe imageof
gandgoOf
isafunction
such that qoogandg.(qoog)areintegrable, then the followinginequalitieshold:
0
<_ M(f
og) -f(M(g)) <_ M(g. (qo
og)) M(g)M(qo
og).
Proof
The first inequality is that ofJensen, for which we give thefollowing simple argument: IfM(g) Int/, then
f(g(x)) >_f(M(g)) + (g(x) M(g)) qo(M(g))
for allx Xand the Jensen’sinequality follows by integrating both sides over X.
Thecasewhere M(g)is anendpoint ofIisstraightforward becausein thatcase g M(g)#a.e.
Thesecond inequalitycan be obtainedfrom
f(M(g)) >_f(g(x)) + (M(g) g(x)) qo(g(x))
forallxX
by integrating bothsidesoverX.COROLLARY
(See
[3], for thecase wheref
isa smoothconvex func- tion) Letf
beaconvexfunction defined
on anopenintervalI
andletOf
Then0
<_ Okf(Xk) --f
OkXk<_
k=l k=l
k=l k=l k=l
for
everyxl,. .,xn E Iandeveryal,. .,an [0, 1],withnk=
ak 1.Corollary 1 above allowsustosay somethingmore eveninthecase ofmost familiar inequalities.
Here
are three examples, all involving concavefunctions; TheoremA
and Corollary 1 both workinthatcase, simply by reversing the inequality signs. The first example concerns the sine function and improves on a well known inequality from Trigonometry:If A,B,
C are the anglesof
a triangle (expressed inradians) then
2
sinA -A co
2
-A
cosA<sinA<
2Togetafeeling of the lowerestimatefor sin
A,
just testthecase ofthe triangle withangles A (r/2), B(r/3)
andC(r/6)!
Our second exampleconcernsthefunction
In
and improves on the AM-GMInequality:/x x <_
x+ + x <_
n
The exponent in the right hand side can be further evaluated via a classical inequality due to Schweitzer [7] (as strengthened by
Cirtoaje [1]), so we can state the AM-GM Inequality as follows:
/f
0
<
m<_
x,...,x <_
M, thenX -J-’..-J-Xn
exp
[1 (M + m)
2+ [1 + (-1)#-I](M m)
2] <
4Mm 8Mmn2
< /x... x <
xi+... + x.
n
Stirling’s Formula suggests the possibility of further improvement, asubject which willbeconsidered elsewhere.
TheoremAalso allowsustoestimatetricky integrals suchas 4
f./4
I
In(1 + tanx)dx In
2 0.34657...’d0
By Jensen’sInequality,
ln(1 + tanx)dx <_ In (1 + tanx)dx
?I’d0
which yields a pretty good upper bound for Ibecause the difference between the two sides is (approximately) 1.8952x 10
-2.
Notice that aneasy computation shows thatr/4
(1 + tanx)dx r + ln2,
Theorem
A
allows us to indicate a valuable lower bound for /, precisely,I_>
In (4fo "/4-
r(1 + tanx)dx +
+1 r
4fo
r/4(1 + tanx)dx -4 ./4(1 + tanx)-l
dx.
In fact, Maple V4 shows that I exceeds the left hand side by 1.9679 x 10-
2.
Using theaforementionedduality between theconvex functionsand the nondecreasing ones, we caninfer from Theorem
A
the following result:THEOREM
B (The
extensionofChebyshev’sInequality) Let(X, ,
#) beafinite
measurespace, let g:X bea#-integrablefunction
andleto
be a nondecreasingfunction
given on an interval that includes theimage
of
g and such thatoo
g andg.(
og) are integrablefunctions.
Then
for
everyprimitiveofo
such that ogisintegrable thefollowing inequalities holdtrue:0
<_ M(
og) (M(g)) <_ M(g. (qo
og)) M(g)M(qo
og).Inordertoshow how TheoremByieldsChebyshev’sInequalitywe havetoconsider two cases.Thefirst one concernsthesituationwhere g: [a, b] [ isincreasing and h: [a,
b]
is nondecreasing. In that case we apply Theorem Bto g and hog-.
When bothg and h are nondecreasing, we shall consider increasing perturbations of g, e.g.,g+
exfor>
0.By
the previous case,+ ,Oh) < +
for eachThe followinge
>
0andtwoit remains toinequalitiestake theare consequences of Theoremlimit as e-
0+.
B:f0 (sin) ( sinl +
cossin-lx)dX- (f
0 sin(sin 12
"(fO (sin-lx+cssinl)dx) >x fo (
\ .,/1) (/0
1)
sin-dx sin sin-dx
> O;
x x
+
sin sin,x-1 )dx
(1)
jo eSinx/Xdx ( fo sin-dx)(fo eSinx/Xdx) >
01 (sinx) (folsinx)
>
exp dx expdXx >
O.(2)
The first one corresponds to the case where g(x)=sin
(I/x)
and o(x)=x+cosx, while the second one to g(x)=((sin x)/x) and(x) eX;
the fact that the inequalities above are strict is straightfor- ward.Inboth cases, the integralsinvolved cannotbe computed exactly (i.e., viaelementary functions).Using
MAPLE
V4, we can estimate the different integrals and obtainthat(1)
lookslike0.37673
>
0.16179>
0while
(2)
looks like5.7577 x10-3 >
2.8917x 10-3>
0.4.
ANOTHER
ESTIMATEOF JENSEN’S
INEQUALITYThe following result complements Theorem Aand yields a valuable upperestimateofJensen’sInequality:
TIEOREM C Let
f
[a,b] be a continuous convexfunction, and let [ml,M],...,[mn, M,] be compact subintervalsof
[a,b]. Givena,.. CnE [0, 1] with
=
na
1, thefunction
E(Xl, ,Xn) g(Xk) f
kXkk=l k=l
attains its supremum on =[m,M]x x[mn, Mn] at a boundary point(i.e.,atapoint
of
O {m, M} x x {mn,Mn}).Moreover, the conclusion remains valid
for
every compact convex domain in[a, b]n.
Theproof of TheoremC dependsupon the followingrefinement of
Lagrange
Meaneorem:
LA2 Leth: [a, b] bea continuous
function.
Then thereexistsapointc (a, b)such that
h(c) < h(b) h(a)
< h(c).
Here the lower and respectively the upper derivative of h at c are definedby
h(c)
lim infh(x) h(c)
and
h(c)
lim suph(x) h(c).
xc X C xc X C
Proof
Asin the smooth case, weconsiderthefunctionH(x) h(x) h(b) h(a) (x a)
b-a
Clearly,Hiscontinuousand
H(a) H(b).
IfHattains itssupremum at c(a,
b),thenD__H(c) <_
0< H(c)
andtheconclusionofLemma2is immediate. The same is true when H attains its infimum at an interiorpoint of[a,
b].
If bothextremaare attained atthe endpoints, then His constant and the conclusion ofLemma 2 works for every c in(a,b).
Proof of
Theorem C Itsuffices toshowthatE(x,...,xt,... ,xn) < sup{E(x,...
,mk,...,Xn),E(Xl,... ,Mtc,..., Xn)}
for everyxk [ink,Mk], k {1,...,
n}.
By
reduetio ad absurdum,wemay assume thatE(x
x,Xn) > sup{E(ml xz, Xn), E(M
xz,xn) }.
forsomex,xz,...,x,withxk [ink,Mk] for each k
{1,... ,n}.
Letting fixed xk [mk, Mk] with
k{1,...,n},
we consider the functionh"
[m,M] -- , h(x) E(x,
x2,...,Xn).
According to Lemma 2, there exists a (ml,x0 such that
h(x)-h(m) <(x-m)h(). As h(xO>h(m),
it follows thath() >
0, equivalently,f() > f(cl +
2X2+"" + OnXn).
Or,
f
is a nondeereasing function on(a,b)
(actuallyr D+f),
which leadsto
> a+ax+
+anXn, i.e., toc2x2
+ +
O.nXn>
2+"" + n
A
newappeal toLemma2 (applied this time to hl[x,
Md), yields an(x,M)
suchthat02X2
+’" +
O.nXn c2+... + cn
Or, the latercontradicts the fact that<
r/.The following application of Theorem Cis dueto Khanin [6]: Let p
>
1, Xl,...,Xn [O,M] and a,...,tn [0,1], with’k=ln
Ok 1.Then
OtkXPk <_
OtkXkk=l k=l
+ (p 1)pP/(-P)M
p.Particularly,
n
<
n4’
whichrepresentsanadditiveconverse toCauchy-Schwarz Inequality.
In fact, according toTheoremC,thefunction
E(Xl, Xn) OkXPk
OtkXkk=l k=l
attains its supremum on [0,M] at a boundary point i.e., at apoint whosecoordinates are either 0 orM.Therefore
supE(Xl,...
,xn) <_
Me.sup{s- sP;sC [0, 11}
(p
1)pe/O-e)Me.
Another immediate consequence of Theorem C is the following fact, whichimproves on aspecial caseofaclassical inequality dueto Hardy, Littlewood and Polya
(cf.
[4], p. 89): Letf’.
[a,b]---, be acontinuousconvex
function.
Thenf(a) +f(b)
_f(a+b)
2> f(c)
2_f(c+d)2
for
every a<
c<_
d<
b; in [4], one restricts to the case where a/b c+d. The problem of extending this statement for longer familiesofnumbersisleft open.References
[1] Cirtoaje,V.(1990).Onsome inequalities with restrictions,Revistadematematica din Timisoara,1, 3-7(Romanian).
[2] Dieudonn,J.(1960).FoundationsofModern Analysis, AcademicPress.
[3] Drimbe, M. O. (1996). A connection between the inequalities ofJensen and Chebyshev,GazetaMatematica,CI(5-6),270-273(Romanian).
[4] Hardy, G. H., Littewood, J. E. and Polya, G., Inequalities, Cambridge Mathematical Library,2nd edn., 1952, Reprinted1988.
[5] Hewitt,E. and Stromberg,K. (1965).Real and Abstract Analysis,Springer-Verlag.
[6] Khanin, L. G.(1988).ProblemM1083,Kvant, 18(1),p. 35 andKvant, 18(5),p. 35.
[7] Polya,G. and Szeg6, G.,Aufgabenund Lehrsitze aus Analysis,Vol. 1, Springer Verlag, 1925, English edition, Spdnger-Verlag, 1972.