THE BERRY-ESSEEN THEOREM
K. NEAMMANEE
Received 24 November 2004 and in revised form 8 March 2005 Dedicated to Professor Yupaporn Kemprasit on her sixtieth birthday.
In 2001, Chen and Shao gave the nonuniform estimation of the rate of convergence in Berry-Esseen theorem for independent random variables via Stein-Chen-Shao method.
The aim of this paper is to obtain a constant in Chen-Shao theorem, where the ran- dom variables are not necessarily identically distributed and the existence of their third moments are not assumed. The bound is given in terms of truncated moments and the constant obtained is 21.44 for most values. We use a technique called Stein’s method, in particular the Chen-Shao concentration inequality.
1. Introduction and main result
LetX1,X2,. . .,Xnbe independent and not necessarily identically distributed random vari- ables with zero mean and finite variance. Define W=X1+X2+···+Xn and assume that Var(W)=1. LetFnbe the distribution function ofWandΦthe standard normal distribution function. It is well known that if the Lindeberg condition,
∀ε >0, n i=1
EXi2IXi> ε−→0 asn−→ ∞, (1.1) whereI(A) is an indicator random variable such that
I(A)=
1 ifAis true,
0 otherwise, (1.2)
is satisfied, then
∀x∈R, Fn(x)−→Φ(x) asn−→ ∞. (1.3) Furthermore, ifE|Xi|3<∞, then we have the uniform Berry-Esseen theorem
sup
x∈R
Fn(x)−Φ(x)≤C0
n i=1
EXi3, (1.4)
Copyright©2005 Hindawi Publishing Corporation
International Journal of Mathematics and Mathematical Sciences 2005:12 (2005) 1951–1967 DOI:10.1155/IJMMS.2005.1951
and the nonuniform Berry-Esseen theorem Fn(x)−Φ(x)≤ C1
1 +|x|3
n i=1
EXi3, (1.5)
where bothC0andC1are absolute constants.
Note that in caseXi’s are identically distributed, (1.4) and (1.5) were first obtained by Esseen [4] and Nagaev [8], respectively. Bikjalis [1] generalized Nagaev’s result to the case thatXi’s are not necessarily identically distributed random variables. Paditz [9,10]
calculatedC1to be 114.7 and 32 in 1977 and 1989, respectively, and Michel [7] reduced it to 30.84 for the independent and identically distributed case.
In 2001, Chen and Shao gave nonuniform and uniform bounds for independent and not necessarily identically distributed random variables without assuming the existence of third moments. Their result states as follows.
Theorem1.1 (Chen-Shao theorem). LetX1,X2,. . .,Xnbe independent random variables with zero means andni=1EXi2=1. LetW=X1+X2+···+Xnand letFnbe the distribu- tion function ofW. Then,
F(x)−Φ(x)≤C n i=1
EXi2IXi≥1 +|x|
1 +|x|2 +EXi3IXi<1 +|x| 1 +|x|3
, (1.6) F(x)−Φ(x)≤4.1
n i=1
EXi2IXi≥1+EXi3IXi<1. (1.7)
Observe that the constant 4.1 in (1.7) is smaller than 6 as obtained by Feller [5] and it was pointed out by Loh [6] that the truncation at 1 in (1.7) is optimal in the sense that
EX2I|X| ≥1+E|X|3I|X|<1=inf
A
EX2I(X∈A) +E|X|3IX∈AC. (1.8) The standard tool used Esseen [4], Nagaev [8], Bikjalis [1], Paditz [9,10], and Michel [7]
is the Fourier-analytic method. But Chen and Shao [3] proved (1.6) and (1.7) by com- bining truncation with Stein’s method [14] and the concentration inequality approach.
The concentration inequality approach was originally used by Stein for independent and identically distributed random variables. It was extended by Chen [2] to dependent and nonidentically distributed random variables with arbitrary index sets. In [3], the con- centration inequality approach is improved and extended to nonuniform bounds. The improved approach is much more effective than that in [2]. In this paper, we combine the concentration inequality in [3] with the coupling approach to calculate the constant Cin (1.6). The followings are our main results.
Theorem 1.2. LetX1,X2,. . .,Xnbe independent random variables with zero means and n
i=1EXi2=1. LetW=X1+X2+···+Xnand letFn be the distribution function ofW.
Then
Fn(x)−Φ(x)≤C0
n i=1
EXi2IXi≥1 +|x/4|
1 +|x/4|2 +EXi3IXi<1 +|x/4| 1 +|x/4|3
, (1.9)
where
C0=
21.44 if|x| ≤3or|x| ≥14,
32 if3<|x| ≤3.99or7.98<|x|<14, 60 otherwise.
(1.10)
Corollary1.3. IfXi’s inTheorem 1.1have finite third moment, then Fn(x)−Φ(x)≤C1
n
i=1EXi3
1 +|x/4|3 , (1.11)
where
C1=
21.44 if|x| ≤7.98or|x| ≥14,
32 if7.98<|x|<14. (1.12) Observe that the bound inTheorem 1.2is given in terms of truncated moments. It is worthwhile to note also that truncated moments were considered by Sazonov [13]. In his work, he gave two main methods for deriving speed of convergence results in the central limit theorem (CLT), namely, the Fourier-analytic method and the method of composition which used convolutions directly. These methods are used to derive more results for random vectors. For nonuniform bound in CLT of random vectors, one can see, for examples, Rotar [11,12].
2. Auxiliary results
In this section, we give auxiliary results in order to prove the main theorem inSection 3.
LetX1,X2,. . .,Xn,W,Fn, andΦbe defined as inTheorem 1.2. In order to use the con- centration inequality and the coupling approach, we introduce random variablesJ, ˜X1, X˜2,. . ., ˜Xn defined in the following way. The random variables J, X1,X2,. . .,Xn, ˜X1, X˜2,. . ., ˜Xn are independent, J uniformly distributed over the set{1, 2,. . .,n}, (Xi, ˜Xi) is a coupling pair, that is,Xiand ˜Xiare the same distributions. Fora >0, we also let
Yj,a=XjIXj<1 +a, Y˜j,a=X˜jIX˜j<1 +a, Sa=
n j=1
Yj,a, S˜a=Sa−YJ,a+ ˜YJ,a, αa=
n j=1
EX2jIXj≥1 +a, βa= n j=1
EXj3IXj<1 +a, δa= αa
(1 +a)2+ βa
(1 +a)3.
(2.1)
Observe that (Yj,a,Yj,a) is a coupling pair and (Sa,Sa) is an exchangeable pair in the sense that
PSa∈E,Sa∈E=PSa∈E, Sa∈E (2.2)
for arbitrary Borel sets Eand EonR. From the fact that (a+b)n≤2n−1(an+bn) for a,b≥0, we have
EYJ,a3=1 n
n j=1
EXj3IXj<1 +a=βa
n, (2.3)
EY˜J,a−YJ,a3≤8 n
n j=1
EXj3IXj<1 +a=8βa
n . (2.4)
Inproposition 2.1, we use the coupling approach to boundES2aandES4awhich are used in the proof of the concentration inequality.
Proposition2.1. (1)ESaS˜a=(1−1/n)Sa+ (1/n)ESa, whereEXYis the conditional expec- tation ofYwith respect toX.
(2)ES2a≤1 + (αa/(1 +a))2.
(3)|ESa|3≤12βa+ 3(αa/(1 +a)) + (αa/(1 +a))3.
(4)ES4a ≤53(1 +a)βa + 30βa(αa/(1 +a)) + 6(αa/(1 +a))2+ (αa/(1 +a))4+ 6βa + 6αa+ 3.
(5)If(1 +a)2αa+ (1 +a)βa<1/80anda≥3, thenES2a≤1 + (3.8×10−8)andES4a≤ 3.69.
(6)If(1 +a)2αa+ (1 +a)βa≥1/80anda≥14, thenES4a/a4≤391δa. Proof. (1)
ESaS˜a=ESaSa−YJ,a+ ˜YJ,a
=Sa−ESaYJ,a+ESaY˜J,a
=Sa−1 n
n j=1
ESaYj,a+1 n
n j=1
ESaY˜j,a
=Sa−1 n
n j=1
Yj,a+1 n
n j=1
EYj,a
=
1−1 n
Sa+1 nESa.
(2.5)
(2) Leth:R2→Rbe defined by
h(˜t,t)=t˜2−t2. (2.6)
Sincehis antisymmetric in the sense thath(˜t,t)= −h(t, ˜t) and (Sa, ˜Sa) is an exchangeable pair, by Stein [15, equation (9), page 10],
ES˜2a−S2a=EhS˜a,Sa
=0. (2.7)
From this fact and (1), we have
0=ES˜a−SaS˜a+Sa
=2ES˜a−Sa
Sa+ES˜a−Sa2
=2EESaS˜a−Sa
Sa+ES˜a−Sa2
= −2
nES2a+ES˜a−Sa
2
+2 nE2Sa,
(2.8)
which implies that ES2a=n
2EY˜J,a−YJ,a
2
+E2Sa
= n j=1
EX2jIXj<1 +a−E2XjIXj<1 +a+E2Sa
≤ n j=1
EX2jIXj<1 +a+E2Sa
≤1 + αa
1 +a 2
,
(2.9)
where we have used the fact thatnj=1EX2j =1 and ESa=
n j=1
EXjIXj<1 +a =
n j=1
EXjIXj≥1 +a ≤ αa
1 +a (2.10) in the last inequality.
(3) By the same argument of (2), withh(˜t,t)=(˜t−t)(˜t2+t2), ES3a=n
2ES˜a−SaS˜2a−S2a+ESaES2a
=n
2ES˜a−Sa2S˜a+Sa
+ESaES2a
=n
2EY˜J,a−YJ,a
2Y˜J,a−YJ,a
+ 2Sa
+ESaES2a
=n
2EY˜J,a−YJ,a3+nEY˜J,a−YJ,a2Sa+ESaES2a.
(2.11)
Hence,
ES3a≤n
2EY˜J,a−YJ,a3+nEY˜J,a−YJ,a2
Sa+ESaES2a
≤4βa+nEY˜J,a−YJ,a2Sa+ αa
1 +a
+ αa
1 +a 3
,
(2.12)
where we have used (2.4), (2.10), and (2) in the last inequality. Note that EY˜J,a−YJ,a2Sa=
1 n
n j=1
EY˜j,a−Yj,a2 n l=1
Yl,a
≤ 1
n n j=1
EY˜j,a−Yj,a
2
Yj,a
+
1 n
n j=1
EY˜j,a−Yj,a
2n l=1 l=j
EYl,a
≤1 n
n j=1
EY˜j,a−Yj,a2Yj,a+1 n
n j=1
EY˜j,a−Yj,a2ESa +1
n n j=1
EY˜j,a−Yj,a2EYj,a
≤8 n
n j=1
EYj,a3+2 n
αa
1 +a
≤8βa
n +2 n
αa
1 +a
.
(2.13) Hence, by (2.12) and (2.13),|ES3a| ≤12βa+ 3(αa/(1 +a)) + (αa/(1 +a))3.
(4) Using the same argument of (2), withh(˜t,t)=(˜t−t)(˜t3+t3), we have ES4a=n
2ES˜a−SaS˜3a−S3a+ESaES3a
=n
2ES˜a−Sa
2S˜a−Sa
2
+ 3 ˜SaSa
+ESaES3a
=n
2EY˜J,a−YJ,a4
+3n
2 EY˜J,a−YJ,a2
S2a+Y˜J,a−YJ,a Sa
+ESaES3a
≤n(1 +a)EY˜J,a−YJ,a3+3n
2 EY˜J,a−YJ,a
2
S2a + 3n(1 +a)EY˜J,a−YJ,a2
Sa+ESaES3a
≤32(1 +a)βa+ 6αa+ 12βa
αa 1 +a
+ 3
αa 1 +a
2
+ αa
1 +a 4
+3n
2 EY˜J,a−YJ,a2S2a,
(2.14)
where we have used (2.4), (2.10), (2.13), and (3) in the last inequality. From (2.14) and the fact that
EY˜J,a−YJ,a
2
S2a
=1 n
n j=1
EY˜j,a−Yj,a2
E
n l=1 l=j
Yl,a
2
+2 n
n j=1
EY˜j,a−Yj,a2
Yj,aE
n l=1 l=j
Yl,a
+1 n
n j=1
EY˜j,a−Yj,a2Y2j,a
≤2 n
n j=1
EY2j,aE
n l=1 l=j
Yl,a
2
+8 n
n j=1
EYj,a3 E
n l=1 l=j
Yl,a
+4(1 +a) n
n j=1
EYj,a3
≤2 n
n j=1
EY2j,aES2a−4 n
n j=1
EYj,a2 ESaYj,a+2 n
n j=1
EY2j,aEY2j,a
+8 n
n j=1
EYj,a3ESa+8 n
n j=1
EYj,a3EYj,a+4(1 +a) n βa
≤2 n+2
n αa
1 +a 2
+4 n
n j=1
EYj,a3
ES2a+8βa n
αa
1 +a
+14(1 +a)βa n
≤2 n+4βa
n +12βa
n αa
1 +a
+2 n
αa 1 +a
2
+14(1 +a)βa
n ,
(2.15) we have
ES4a≤53(1 +a)βa+ 30βa
αa 1 +a
+ 6 αa 1 +a
2
+ αa
1 +a 4
+ 6βa+ 6αa+ 3. (2.16) (5) Follows directly from (2) and (4).
(6)
ES4a a4 ≤
53 a3
1 +a a
βa+30βa a4
αa
1 +a
+ 6 a4
αa
1 +a 2
+ 1 a4
αa 1 +a
4
+6βa
a4 +6αa a4 + 3
a4
≤70.697βa
(1 +a)3 +0.035αa
(1 +a)2+ 3.997 (1 +a)4
≤70.697βa
(1 +a)3 +0.035αa
(1 +a)2+ 319.76αa
≤391δa,
(2.17)
where we have used the fact that a≥14, αa≤1, and (1 +a)/a≤1.072 in the second inequality and the fact that (1 +a)2αa+ (1 +a)βa≥1/80 in the last inequality.
Next, we will prove the concentration inequality.
Proposition 2.2 (concentration inequality). Let i∈ {1, 2,. . .,n} and W(i)=W−Xi. Then for3≤a < b <∞and(1 +a)2αa+ (1 +a)βa<1/80,
Pa≤W(i)≤b≤ 40.98
(1 +a)3(b−a) + 46.38δa. (2.18)
Proof. LetSi,a=Sa−Yi,a. We observe thatW(i)=Si,awhen max1≤j≤n,j=i|Xj|<1 +a. So
Pa≤W(i)≤b≤Pa≤Si,a≤b+P
max
1≤j≤n j=i
Xj≥1 +a
≤Pa≤Si,a≤b+ αa
(1 +a)2.
(2.19)
Letγ=βa/2 and f :R→Rdefined by
f(t)=
0 fort < a−γ,
(1 +t+γ)3(t−a+γ) fora−γ≤t≤b+γ, (1 +t+γ)3(b−a+ 2γ) fort > b+γ.
(2.20)
So f is a nondecreasing function satisfying f(t)≥(1 +a)3 for a−γ < t < b+γ, and f(t)≥0 otherwise. LetM(w,t)=w[I(−w≤t <0)−I(0≤t <−w)]. Hence,
ESi,afSi,a)= n j=1 j=i
EYj,afSi,a−fSi,a−Yj,a
= n j=1 j=i
EYj,a 0
−Yj,a
fSi,a+tdt
= n j=1 j=i
EYj,a
RfSi,a+tI−Yj,a≤t <0−I0< t≤ −Yj,a dt
= n j=1 j=i
E
RfSi,a+tMYj,a,tdt
≥(1 +a)3 n j=1 j=i
E
Ia≤Si,a≤b
|t|≤γMYj,a,tdt
=(1 +a)3E
Ia≤Si,a≤b n j=1 j=i
Yj,aminγ,Yj,a
≥0.46(1 +a)3Pa≤Si,a≤b−PUi≤0.46),
(2.21)
whereUi=n
j=1,j=i|Yj,a|min(γ,|Yj,a|) and we have used the fact that It1≤w≤t2
y≥c
It1≤w≤t2
−
1−y c
I(y≤c)
(2.22)
fort1,t2,y≥0,c >0 in the last inequality. Hence, Pa≤Si,a≤b≤ 1
0.46(1 +a)3ESi,afSi,a
+PUi≤0.46. (2.23) Next, we will bound the two terms on the right-hand side of (2.23). By the same argument as that inProposition 2.1, we can show thatES4i,a≤3.69 andES2i,a≤1 + (3.8×10−8). So
ESi,afSi,a≤(b−a+ 2γ)ESi,aSi,a+ (1 +γ)3
≤4b−a+βaES4i,a+|1 +γ|3ESi,a
≤4b−a+βa
$
ES4i,a+1 +βa
2 3
ES2i,a
%
≤18.85b−a+βa
.
(2.24)
By the facts that min(a,b)≥b−b2/4afora,b >0, EXi2IXi≤1 +a≤
βa2/3<0.021, αa≤ 1
80(1 +a)2 ≤7.8×10−4 fora≥3, (2.25) we have
EUi= n j=1 j=i
EYj,aminγ,Yj,a
≥ n j=1 j=i
EY2j,a−EYj,a3 4γ
≥ n j=1
EX2jIXj<1 +a−EX2jIXj≥1 +a −β γ
=
j=1 j=i
EX2jIXj<1 +a−E2XjIXj≥1 +a−0.5
=1−EXi2IXi<1 +a−2 n j=1
EX2jIXi≥1 +a−0.5
≥1−(βa)2/3−2αa−0.5
≥0.477.
(2.26)
Using the same argument as inProposition 2.1(5), we can show that EUi−EUi4≤3.69γ4=0.231β4a≤4.512×10−7 βa
(1 +a)3. (2.27)
Hence,
PUi≤0.46≤PEUi−Ui≥0.477−0.46
=PEUi−Ui≥0.017
≤EUi−EUi4 (0.017)4
≤5.402βa
(1 +a)3.
(2.28)
From (2.19), (2.23), (2.24), and (2.28), Pa≤W(i)≤b≤ 40.978
(1 +a)3
b−a+βa
+5.402βa
(1 +a)3+ αa
(1 +a)2
≤40.98(b−a)
(1 +a)3 + 46.36δa.
(2.29) Proposition2.3. Forx≥2,
Efx(W)≤ 15
(1 +x)2, (2.30)
where fxis the unique solution of the Stein equation
f(w)−w f(w)=I(w≤x)−Φ(x). (2.31) Proof. From Stein [15, pages 22 and 24], we know that
0< fx(w)<1−Φ(x) forw≤0,
0< fx(x)≤1−Φ(x)1 +√2πwe(1/2)w2Φ(x) for 0< w≤x, fx(w)≤1 ∀w∈R.
(2.32)
Hence,
Efx(W)=Efx(W)I(W≤0) +Efx(W)I
0< W≤4x 5
+Efx(W)I
W >4x 5
≤
1−Φ(x)P(W≤0) +1−Φ(x)E1 +√2πWeW2/2I
0< W≤4x 5
+ E1 +W2 1 + (4x/5)2
≤
1−Φ(x)+1−Φ(x)
$
1 +4√2π 5 xe8x2/25
%
+ 2
1 + (4x/5)2.
(2.33)
Since
1−Φ(x)≤e√−(1/2)x2
2πx forx≥0, (2.34)
(see Stein [15, equation (25), page 23]) andex2/2> xforx≥2, we have 1−Φ(x)(1 +x)2≤ 1
√2πx2 (1 +x)2= 1
√2π 1
x+ 1 2
≤0.9, (2.35) which implies that
1−Φ(x)≤ 0.9
(1 +x)2. (2.36)
From (2.34) and the fact thate9x2/50>9x2/50, we derive
√2π1−Φ(x)(1 +x)2xe8x2/25≤e−9x2/50(1 +x)2
≤50 9
1 x+ 1
2
≤12.5,
(2.37)
that is,
4√2π 5
1−Φ(x)xe8x2/25≤ 10
(1 +x)2. (2.38)
From (2.33), (2.36), (2.38), and the fact that (1 +x)/(1 + 4x/5)≤5/4, we have proved the
proposition.
Proposition2.4. Letx≥14andg:R→Rdefined byg(w)=(w fx(w)). If(1 +x)2αx+ (1 +x)βx<1/80, then for|u| ≤1 +x/4,
EgW(i)+u≤ 4.60
(1 +x/4)3+ 5.13δx/4(1 +x). (2.39) Proof. From Chen and Shao [3, pages 248–249], we know that
g(x−1)=√
2π1 + (x−1)2e(x−1)2/2Φ(x−1) + (x−1)1−Φ(x), (2.40) gis increasing for 0≤w < x, and
EgW(i)+u≤ 2
1 +x3+ 21−Φ(x)+g(x−1) +EgW(i)+uIx−1< W(i)+u < x. (2.41) Forx≥14, elementary calculation yields
(1 +x)3
1 +x3 ≤1.23 (2.42)