ISSN:1083-589X in PROBABILITY
On a concentration inequality for sums of independent isotropic vectors
∗Michael C. Cranston
†Stanislav A. Molchanov
‡Abstract
We consider a version of a classical concentration inequality for sums of independent, isotropic random vectors with a mild restriction on the distribution of the radial part of these vectors. The proof uses a little Fourier analysis, the Laplace asymptotic method and a conditioning idea that traces its roots to some of the original works on concentration inequalities.
Keywords:concentration inequality; isotropy.
AMS MSC 2010:Primary 60F05, Secondary 60E15.
Submitted to ECP on August 18, 2009, final version accepted on April 23, 2012.
1 Introduction
The purpose of this paper is to give a multi-dimensional version of a concentration inequality that traces its origins to P. Lévy [14]. These Lèvy type concentration results assert a rate of decay on the maximal amount of mass that the distribution of a sum of independent, nondegenerate, random variables can place in an arbitrary interval.
We remark that this is in contrast to another version of the concentration phenomena, see [13] and the vast literature listed there, where measures concentrate most of their mass on particular subsets of a measure space. However, the proof of our result relies on an early version of the latter type of concentration phenomena due to Bernstein and Hoeffding. Motivation to study the concentration inequality in the context of isotropic random vectors arises from a variety of sources. Sums of independent isotropic vectors are the so-called isotropic random flights which arise in problems in astronomy, [2], [4].
Such sums appear in random searches in the context of biological encounters, [3]. They are also used as polymer models in chemistry, [6], [8]. Yet another example of isotropic random vectors arises in a discrete model for the theory of magnetic fields generated by a turbulent media, [15], where random vectors of the formM ξappear whereM is a random element of the orthogonal group,O(3),and the random vectorξis independent ofM.
Early works on the concentration inequalities in the real valued case are due to Döblin, [5], Kolmogorov [11], Lèvy [14] and Rogozin [17]. The higher dimensional case has been treated in Kanter, [10]. In [10], the author considered independent, RN- valued, symmetric random vectors X1,· · · , Xn. Then if C ⊂ RN is a convex set and
∗Research supported by grants from NSF, DMS-1007176 and DMS-0706928.
†M. Cranston, University of California Irvine, USA. E-mail:[email protected]
‡S. Molchanov, University of North Carolina at Charlotte, USA. E-mail:[email protected]
λ=Pn
i=1P(Xi6∈C)it was proved that
P(X1+· · ·+Xn+x∈C)≤Φ(λ)
whereΦ(s) =e−s(I0(s) +I1(s))andI0andI1are the modified Bessel functions given by Ik(s) =
∞
X
m=0
1 m!(m+k)!
s 2
2m+k
.
Our work considers the convex set C = Q(l), the cube centered at Q(l) = [0l]d. As remarked in [10],Φ(n2) ≥ cn−1/2. Thus, the Kanter result doesn’t contain the dimen- sional dependence which does appear in our bound on the concentration function in Theorem 2.1.
The authors wish to thank the referee for pointing out the Lèvy decoupage technique [1], used in our conditioning argument at (3.6) below, which greatly improved an earlier version of the paper.
2 Statement of Result
Let us give a precise statement of the result proven by Kolmogorov [11]. Suppose {ξk}k≥1are independent random variables defined on some probability space(Ω,F, P).
SetSn=Pn
k=1ξk and define forl >0,the concentration function Qk(l) = sup
x
P(x≤ξk≤x+l) and
QSn(l) = sup
x
P(x≤Sn ≤x+l).
Kolmogorov’s version of the concentration inequality states that there exists a constant C >0such that if
L≥l, L2≥l2logs where
s=
n
X
k=1
(1−Qk(l)) then
QSn(L)≤ CL l√
s.
We prove a higher dimensional version of this result. Due to possible degeneracies, the higher dimensional result will not hold in general, but a natural condition to impose on random vectors under which the result turns out to be true is isotropy. We say an Rd valued random vector, X, on a probability space(Ω,F, P) is isotropic if P X−1 = P(U X)−1for everyU ∈ O(d),the group of orthogonal matrices. With this definition we have,
Theorem 2.1. LetX1, X2,· · · be independent, isotropic random vectors with values in Rd, d≥2and putSn =X1+X2+· · ·+Xn.Letl >0andL >0be given. Define, for a >0,the cube
Q(l) = [0, a]d⊂Rd. Assumepi =P(Xi ∈/ Q(l)), satisfiesPn
i=1pi → ∞.Then given any∈ (0,1),for every x∈Rd,
P(Sn∈x+Q(L))≤2 exp
−22(Pn i=1pi)2 n
+(1 +o(1)) r d
2π L
l
!d
(1−)
n
Xpi
!−d2
.
(2.1)
There various immediate corollaries that may be derived from this. We sate two such.
Corollary 2.2. Assume in addition to the conditions above that
n
X
i=1
pi>2−2/d r d
2π L
l
!2
and
1
4 > n 2(Pn
i=1pi)2ln2 12Pn i=1pid2 q
d 2π
L l
d .
Then there is a positive constantcindependent ofn, landLsuch that,
P(Sn ∈x+Q(L))≤c
n
X
i=1
pi
!−d2
L l
d
. Proof. The conditions of the corollary ensure that the solution of
2= n
2(Pn
i=1pi)2ln2 ((1−)Pn i=1pi)d2 q
d 2π
L l
d
satisfies 0 < < 12. This follows since the left hand side minus the right hand side is increasing inand this forces the solution to be less than 14. Sinceis then bounded from1we can take thisin (2.1) and find thecas claimed by usingTheorem 2.1.
Another possibility is the following.
Corollary 2.3. LetX1, X2,· · · be independent, isotropic random vectors with values in Rd, d≥2,for which
P(Xi∈/Q(1))≥ 1
2, for alli≥1.
There is a constantc(d)such that for everyxwe have P(Sn ∈x+Q(1))≤ c(d)
nd/2. Proof. In this case, Pn
i=1pi ≥ n2 and L = l = 1. Taking = 12 in Theorem 2.1 the corollary holds with a suitable choice of constantc.
3 Proof of the Concentration Inequality
We commence with two lemmas, the first is a local central limit theorem. This is likely a known result and as the proof is short we include it for completeness.
Lemma 3.1. Suppose thatYe1,· · · ,Yemare iiduniformly distributed on the unit sphere inRd.Ifpedmdenotes the density ofYe1+· · ·+Yemthen
pedm(0)∼
r d 2πm
!d
, m→ ∞.
Proof. IfY is uniformly distributed on the unit sphere inRd,then fork∈Rd,we first compute
ψd(k) =E[eihk,Yi].
Given k, one selects coordinates on the sphere φ1, φ2,· · ·, φd−1, φi ∈ [0, π),fori = 1,· · ·, d−2,andφd−1 ∈ [0,2π), so thatk= |k|cosφ1.The volume form on the sphere, normalized to have total mass one, is given in these coordinates by
dV = Γ(d2+ 1)
dπd2 sind−2φ1sind−3φ2 · · ·sinφd−2dφ1dφ2· · ·dφd−1.
Then since the characteristic function of a random vector Y which is uniformly dis- tributed on the sphere of radius1inRdis real,
E[eihk,Yi] =cd
Z π
0
cos(|k|cosφ1) sind−2φ1dφ1, wherecd= Rπ
0 sind−2φ1dφ1−1 . Ford= 2,by [12] page115,
ψ2(k) =1 π
Z π
0
cos(|k|cosφ)dφ
=1 π
Z π
0
cos(|k|sinφ)dφ
=J0(|k|),
whereJ0(z)is the0thorder Bessel function of the first kind given by J0(z) = 1−(z/2)2
(1!)2 +(z/2)4
(2!)2 −(z/2)6 (3!)2 +· · · . Ford= 3,
ψ3(k) =1 2
Z π
0
cos(|k|cosφ1) sinφ1dφ1
=1 2
Z 1
−1
cos(|k|w)dw
=sin(|k|)
|k| . Ford >3,we have by [12] page114,
ψd(k) =cd Z π
0
cos(|k|cosφ1) sind−2φ1dφ1
=2d2−1Γ d
2
|k|−d2+1Jd
2−1(|k|), whereJd
2−1is the order d2−1Bessel function of the first kind.
Since for anyd≥2,
pedm(0) = 1 (2π)d
Z
Rd
(ψd(k))mdk, (3.1)
we need to check the asymptotics of the right hand side.
Starting withd= 2,we observe thatψ2(r) =J0(r)has a unique global maximum at r = 0 withψ2(0) = 1. Also, ψ002(0) = −12 and for any > 0, c2() ≡ supr≥|ψ2(r)| <1.
Similarly, ford= 3, ψ3(r) = sinrr has a unique global maximum atr= 0withψ3(0) = 1.
This timeψ003(0) =−13 and for any >0, c3()≡supr≥|ψ3(r)|<1.Finally, ford >3,we use the the representation, from [12] page114,
2d2−1Γ d
2
r−d2+1Jd
2−1(r) = Γ d2 Γ 12
Γ d−12 Z 1
−1
(1−t2)d−32 cos(rt)dt to conclude thatψd(r) = 2d2Γ d2
r−d2+1Jd
2−1(r)has a unique global maximum atr = 0 withψd(0) = 1and
ψ00d(0) =− Γ d2 Γ 12
Γ d−12 Z 1
−1
(1−t2)d−32 t2dt
=− Γ d2 Γ 12
Γ d−12
Γ d−12 Γ 32
Γ d2+ 1
=−1 d,
and for any >0, cd()≡supr≥|ψd(r)|<1.
In order to determine the asymptotics at (3.1), we need ther → ∞decay ofψd(r).
Ford≥2from [12] page134for some positive constantc,
|ψd(r)| ≤c√
r−d+1, r→ ∞. (3.2)
Now, withωd= dπ
d 2
Γ(d2+1) being the volume ofSd−1,write 1
(2π)d Z
Rd
(ψd(k))mdk= ωd
(2π)d Z ∞
0
rd−1(ψd(r))mdr
= ωd
(2π)d Z
0
rd−1(ψd(r))mdr+ ωd
(2π)d Z ∞
rd−1(ψd(r))mdr.
(3.3)
We use the fact that in any dimension d ≥ 2, cd() ≡ supr≥|ψd(r)| < 1 to show the second integral dies off exponentially fast inm.In fact,
Z ∞
rd−1(ψd(r))mdr
≤cd()m/2 Z ∞
rd−1|ψd(r)|m/2dr (3.4) and by (3.2), the second integral is bounded inmfor m2 > d−12d .
Since for anyd≥2, ψd00(0) =−1d,in the first integral, we write (ψd(r))m=emlnψd(r)∼e−mr
2 2d
proceeding with the Laplace asymptotic method gives ωd
(2π)d Z
0
rd−1(ψd(r))mdr∼ ωd (2π)d
Z
0
|r|d−1e−mr
2 2d dr
= ωd
(2π)d rd
m
!d
Z
√m d
0
rd−1e−r
2 2 dr
∼ r d
2πm
!d
(3.5)
Thus, by (3.3), (3.4) and (3.5), ford≥2,
pedm(0)∼
r d 2πm
!d
and the lemma is proved.
Lemma 3.2. Let Y1, Y2,· · ·, Ymbe independent random vectors withYi uniformly dis- tributed on the sphere of radiusRi, i= 1,· · ·, m.Assume there is a numberl >0such that eachRi≥land theRiare non-random. Letmbe an even integer. Then forL >0 and anyx∈Rd,
P(Y1+Y2+· · ·+Ym∈x+Q(L))≤(1 +o(1)) r d
2πm L
l
!d
, m→ ∞.
Proof. It suffices to provide an appropriate bound on the L∞ norm of the density of Y1+Y2+· · ·+Ym.Notice that
E[eihk,Yii] =ψd(Rik).
Then, by the independence ofY1, Y2,· · ·, Ym, E[eihk,Y1+Y2+···+Ymi] =
m
Y
i=1
ψd(Rik).
By (3.2), for m > d−12d , this characteristic function is integrable, which means Sm = Y1+Y2+· · ·+Ym has a bounded density,pdm(x), x ∈ Rd.In fact, for m even, by the arithmetic-geometric mean and Jensen’s inequalities,
||pdm||∞≤ 1 (2π)d
Z
Rd
m
Y
i=1
ψd(Rik)
dk
≤ 1 (2π)d
1 m
m
X
i=1
Z
Rd
(ψd(Rik))mdk
= 1 (2π)d
1 m
m
X
i=1
1 Rid
Z
Rd
(ψd(k))mdk
≤ 1 (2πl)d
Z
Rd
(ψd(k))mdk
≤(1 +o(1)) r d
2πm 1
l
!d .
where the last line follows fromLemma 3.1.This completes the proof.
We can now prove the main result,Theorem 2.1.
Proof. We use the Lèvy decoupage technique [1] to decompose the random variables Xi,based on whether|Xi| ≥lor|Xi|< l.Set
pi=P(|Xi| ≥l).
Recall that we are assuming that pi ≥ 12, i = 1,2,· · · , n. For i = 1,2,· · · , n, let the random variables{Ui: 1≤i≤n}and{Vi: 1≤i≤n}be independent with
L(Ui) =L(|Xi| | |Xi| ≥l),L(Vi) =L(|Xi| | |Xi|< l).
Also take Bernoulli random variables{ηi: 1≤i≤n}independent of the{Ui: 1≤i≤n}
and the{Vi: 1≤i≤n}with
P(ηi= 1) = 1−P(ηi= 0) =pi.
Then, as above, take{Yei : 1≤i≤n}to be uniformly distributed on the unit sphere in Rdand independent of the previously defined random variables. In an obvious notation we may take the probability measureP asP =P(U,V)×Pη×P
Yeand our original random variables may be represented as
Xi= (ηiUi+ (1−ηi)Vi)Yei. For1≤i≤n,set
Yi1=ηiUiYei and
Yi0= (1−ηi)ViYei.
Notice that{Yi1: 1≤i≤n, ηi6= 0}satisfies the conditions ofLemma 3.2with respect to the probability measurePη×P
Ye a.s..Denote the vector of outcomes of the sequence {ηi: 1≤i≤n}by
ζn = (η1, η2,· · ·, ηn).
Then for a given deterministic sequence(α1, α2, α3,· · ·, αn)∈ {0,1}n,define pη(α)≡P(ζn= (α1, α2, α3,· · ·, αn)) = Πni=1pαii(1−pi)1−αi.
By Bernstein’s inequality or it’s generalization due to Hoeffding, [9], given0< <1, Pη
n
X
i=1
ηi<(1−)
n
X
i=1
pi
!
=Pη
n
X
i=1
(ηi−pi)<−
n
X
i=1
pi
!
≤Pη |
n
X
i=1
(ηi−pi)|>
n
X
i=1
pi
!
≤2 exp
−22(Pn i=1pi)2 n
.
(3.6)
We then have
Pη×P
Ye(Sn ∈x+Q(L)) = X
α∈{0,1}n
pη(α)P
Ye n
X
i=1
(αiUi+ (1−αi)Vi)eYi∈x+Q(L)
!
≤ X
α∈{0,1}n,Pn
i=1αi<(1−)Pn i=1pi
pη(α)
+ X
α∈{0,1}n,Pn
i=1αi≥(1−)Pn i=1pi
(pη(α)
×P
Ye n
X
i=1
(αiUi+ (1−αi)Vi)eYi ∈x+Q(L)
!!
.
(3.7)
It easily follows from Fubini’s theorem that ifZ is independent ofSn,then for anyQ
sup
x
P(Sm+Z ∈x+Q)≤sup
x
P(Sm∈x+Q). (3.8)
Using Lemma 3.2with m = [(1−)Pn
i=1pi] which we may assume is even, we con- clude from (3.8) that for eachα∈ {0,1}n for whichPn
i=1αi ≥(1−)Pn
i=1pi we have P(U,V) a.s.
sup
x
PYe n
X
i=1
(αiUi+ (1−αi)Vi)Yei∈x+Q(L)
!
≤sup
x
PYe n
X
i=1
αiUiYei∈x+Q(L)
!
≤(1 +o(1)) r d
2π 1 l
!d
(1−)
n
X
i=1
pi
!−d2
|Q|.
(3.9)
Thus, from (3.6), (3.7) and (3.9), it follows that asPn
i=1pi→ ∞,one hasP(U,V)a.s.
Pη×P
Ye(Sn∈x+Q(L))≤2 exp
−22(Pn i=1pi)2 n
+ (1 +o(1)) r d
2π L
l
!d
(1−)
n
X
i=1
pi
!−d2
. Now integrate with respect toP(U,V)to complete the proof.
References
[1] Aloisio Araújo and Evarist Ginè. The central limit theorem for real and Banach valued ran- dom variables.. Wiley, New York, June 1980. MR-0576407
[2] Richard Barakat. Isotropic Random Flights. J. Phys. A., Nucl. Gen., Vol. 6, 796-804, June 1973. MR-0418184
[3] Richard Barakat.Lévy flights and superdiffusion in the context of biological encounters and random searches Physics of Life Reviews, Vol. 5, 133-150, 2008.
[4] S. Chandrasekhar.Stochastic Problems in Physics and Astronomy. Rev. Mod. Phys. 15 1-89, 1943. MR-0008130
[5] W. Döblin.Sur la summe d’un grande nombres des variables aleatoires independantes. Bull.
Sc. Math, 63, pp 23-32, 35-64, 1939.
[6] M. Doi and S.F. Edwards. The Theory of Polymer Dynamics. International Monographs on Physics 1988.
[7] R. Durrett. Probability: Theory and Examples, Second Edition Duxbury Press, 1996. MR- 1609153
[8] P. J. Flory Statistical Mechanics of Chain Molecules. New York: Interscience, 1969.
[9] Wassily Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association 58 (301): 13–30, March 1963. MR-0144363
[10] Marek Kanter. Probability inequalities for convex sets and multidimensional concentration functions. J. Multivariate Anal. 6 (1976), no. 2, 222–236 MR-0478328
[11] A.N. Kolmogorov. Sur les proprietes des fonctions concentrations de M. P, Lévy. Ann. de l’Inst. Henri Poincaré, XVI, 1,pp.27-34, 1958.
[12] N.N. Lebedev. Special Functions and Their Applications. Prentice-Hall, Englewood Cliffs, N.J. 1965.MR-0174795
[13] Michel Ledoux.The Concentration of Measure Phenomenon. AMS, 2001. MR-1849347 [14] Paul Lévy.Theorie de l’Addition des Variables Aleatoires. Gauthier-Villars, Paris, 1937.
[15] S.A. Molchanov and A. Ruzmaikin.Lyapunov exponents and distributions of magnetic fields in dynamo models. The Dynkin Festschrift, 287–306, Progr. Probab., 34, BirkhŁuser Boston, Boston, MA, 1994.
[16] G. Polya and G. Szego. Problems and Theorems in Analysis. Springer Verlag, New York, 1972. MR-0344042
[17] B. Rogozin.An estimate for concentration functions. Theory of Probability and its Appl. VI, 1, pp. 94-96, 1961. MR-0131893