Some inequalities in functional analysis, combinatorics, and probability theory

(1)

Some inequalities in functional analysis, combinatorics, and probability theory

Chunrong Feng

^∗

Liangpan Li

^†

Jian Shen

^‡

Submitted: Aug 21, 2009; Accepted: Mar 30, 2010; Published: Apr 5, 2010 Mathematics Subject Classification: 46C05, 05A20, 60C05, 11T99

Abstract

The main purpose of this paper is to show that many inequalities in functional analysis, probability theory and combinatorics are immediate corollaries of the best approximation theorem in inner product spaces. Besides, as applications of the de Caen-Selberg inequality, the finite field Kakeya and Nikodym problems are also studied.

Keywords: inner product space, orthogonal projection, Kakeya set, Nikodym set

1 Brief Introduction

Let (H, <·,·>) be an inner product space over R throughout. Given x∈H and a finite dimensional subspace M, denote by xM the orthogonal projection of x onto M. It is geometrically evident that (we always assume ⁰₀ = 0 in this paper)

kxk² >kxMk² = max

y∈M

< x_M, y >²

kyk² = max

y∈M

< x, y >²

kyk² . (1)

Particularly, ifM = span{yi}ⁿ_i=1 for some given set of elements y1, . . . , yn, then kxk² > max

(α1,...,αn)∈Rⁿ

< x,Pn

i=1αiyi >² kPn

i=1αiyik² . (2)

∗Department of Mathematics, Shanghai Jiao Tong University, Shanghai 200240, China & Department of Mathematical Sciences, Loughborough University, Leics, LE11 3TU, UK. E-mail: [email protected].

Research was supported by the Mathematical Tianyuan Foundation of China (No. 10826090).

†Department of Mathematics, Shanghai Jiao Tong University, Shanghai 200240, China. E-mail: lil- [email protected]. Research was supported by the Mathematical Tianyuan Foundation of China (No. 10826088).

‡Department of Mathematics, Texas State University, San Marcos, TX 78666, USA. E-mail:

[email protected]. Research was supported by NSF (CNS 0835834) and Texas Higher Education Co- ordinating Board (ARP 003615-0039-2007).

(2)

The main purpose of this paper is to show that many inequalities in functional analysis, probability theory and combinatorics are immediate corollaries of (2). For the sake of completeness we determine the unique orthogonal projection xM (many authors of text- books on functional analysis only dealt the case when {yi}ⁿ_i=1 are linear independent).

Write xM =Pn

i=1βiyi for some (β1, . . . , βn)∈Rⁿ. Since the smooth function Ψ(α1, . . . , αn) .

=kx−

n

X

i=1

αiyik² =kxk²−2

n

X

i=1

αi < x, yi >+

n

X

i=1 n

X

j=1

αiαj < yi, yj >

attains its minimum d(x, M)² at (β1, . . . , βn),

∂Ψ

∂α_i(β1, . . . , βn) = 0 (i= 1,2, . . . , n).

Equivalently,







< y₁, y₁ > < y₁, y₂ > · · · < y₁, y_n>

< y2, y1 > < y2, y2 > · · · < y2, yn>

... ... . .. ...

< yn, y1 > < yn, y2 > · · · < yn, yn >











 β₁ β2

... βn







=







< x, y₁ >

< x, y2 >

...

< x, yn>







. (3)

If (γ1, . . . , γn)∈Rⁿ is another solution to (3), then

n

X

i=1

(βi−γi)yi

2 = (β1−γ1,· · · , βn−γn)(< yi, yj >)n×n







β1−γ1

...

βn−γn







= (β1−γ1,· · · , βn−γn)





 0... 0





= 0.

Consequently xM =Pn

i=1βiyi=Pn i=1γiyi.

Among many inequalities will be discussed later, we show particular interest in the de Caen-Selberg inequality [1, 2]:

n

[

i=1

Ai

>

n

X

i=1

|Ai|²

n

X

j=1

|Ai∩Aj|

, (4)

where {Ai}ⁿ_i=1 are finite sets. In Section 5 we will present some applications of the de Caen-Selberg inequality to the study of the finite field Kakeya and Nikodym problems in classical analysis.

(3)

2 Inequalities in Functional Analysis

2.1 Known inequalities

For any (α1, . . . , αn) ∈ Rⁿ, by (2) and the Cauchy-Schwarz inequality (|αiαj| 6 ^α

2i+α²_j 2 ) one obtains the Pe˘cari´c inequality [13]

kxk² >

n

X

i=1

αi < x, yi>2

n

X

i=1 n

X

j=1

α²_i|< yi, yj >|

. (5)

(The following arguments are standard [13]) Substitutingαi = ^Pⁿ^<x,yⁱ^>

k=1|<yi,yk>|into (5) yields the Selberg inequality [1]

kxk² >

n

X

i=1

< x, yi >²

n

X

j=1

|< yi, yj >|

. (6)

Substituting αi = sgn(< x, yi >) into (5) or applying the Cauchy-Schwarz inequality from (6) yields the Heilbronn inequality [10]

kxk² >

Xⁿ

i=1

|< x, yi >|2

n

X

i=1 n

X

j=1

|< yi, yj >|

. (7)

The Selberg inequality (6) is certainly stronger than the Bombieri inequality [1]

kxk² >

n

X

i=1

< x, yi >²

16maxi6n n

X

j=1

|< yi, yj >|

. (8)

If {yi}ⁿ_i=1 are orthogonal, then the Selberg inequality (6) turns out to be the classical Bessel inequality

kxk² >

n

X

i=1

< x, yi >²

< yi, yi >. (9)

Substituting αi = 1 into (2) yields the Chung-Erd˝os inequality [3]

kxk² >

Xⁿ

i=1

< x, yi >2

n

X

i=1 n

X

j=1

< yi, yj >

. (10)

(4)

In a partial summary,

(2)≻(5)≻(6)≻(7),

where (•)≻(••) means Estimate (•) is stronger than Estimate (••).

3 From Functional Analysis to Combinatorics

3.1 Immediate corollaries

In this section we always choose H = l². Let A, B be finite subsets of N and χ_A, χ_B be the corresponding indictor functions. Then

< χA, χB >=|A∩B|,

and χA, χB are orthogonal means A, B are disjoint sets. Given finite subsets {Ai}ⁿ_i=1 of N, define yi =χAi (i∈[n]) and x=χ∪iAi. Then < x, yi >=|(∪jAj)∩Ai|=|Ai|. By (2) and (3), we obtain

Theorem 3.1.

n

[

i=1

Ai

> max

(α₁,...,αn)∈Rⁿ

n

X

i=1

αi|Ai|2

n

X

i=1 n

X

j=1

αiαj|Ai∩Aj|

=

n

X

i=1 n

X

j=1

βiβj|Ai∩Aj|, (11)

where (β1, . . . , βn)∈Rⁿ is any solution to







|A1∩A1| |A1∩A2| · · · |A1∩An|

|A2∩A1| |A2∩A2| · · · |A2∩An| ... ... . .. ...

|An∩A1| |An∩A2| · · · |An∩An|











 β1

β2

... βn







=







|A1|

|A2| ...

|An|







. (12)

Note in this context the Selberg inequality (6) turns out to be the de Caen inequality (4) and the Bessel inequality (9) turns out to be a trivial equality. Also note that

sup

αi>0

n

X

i=1

αi|Ai|2

n

X

i=1 n

X

j=1

αiαj|Ai∩Aj|

= sup

αi>0

n

X

i=1

αi|Ai|2

n

X

i=1 n

X

j=1

α²_i|Ai∩Aj|

= sup

αi>0 n

X

i=1

αi|Ai|²

n

X

j=1

αj|Ai∩Aj| .

(5)

3.2 A slightly different variant

In this subsection, we provide a slightly different variant of (12).

Theorem 3.2. The following matrix equation always has a solution

|Ai∩Aj|

|A_i||A_j|

n×n





 q1

q₂ ... qn







=





 1 1... 1







; (13)

any solution to (13) satisfies

n

X

i=1

q_i = max

(α1,...,αn)∈Rⁿ

n

X

i=1

αi|Ai|2

n

X

i=1 n

X

j=1

αiαj|Ai∩Aj|

. (14)

Proof. Write P = ^|A_|Aⁱ^∩A^j^|

i||Aj|

n×n, Q = |Ai ∩Aj|

n×n and R = diag(1/|A1|, . . . ,1/|An|).

Obviously, P =RQR, Q=R⁻¹P R⁻¹. Let (β1, . . . , βn)∈Rⁿ be a solution to (12). Then

P







β1|A1| β₂|A₂|

...

β_n|A_n|







=RR⁻¹P R⁻¹





 β1

β₂ ...

β_n







=RQ





 β1

β₂ ...

β_n







=R







|A1|

|A₂| ...

|A_n|







=





 1 1 ...

1





 .

This solves the existence. Suppose (q1, q2,· · · , qn)^T is a solution to (13), that is,

RQR





 q1

q2

...

qn







=





 1 1...

1







⇔Q







q1/|A1| q2/|A2|

...

qn/|An|







=







|A1|

|A2| ...

|An|





 .

By (11), (12) and (13),

(α1,...,αmaxn)∈Rⁿ

n

X

i=1

αi|Ai|2

n

X

i=1 n

X

j=1

αiαj|Ai∩Aj|

=

n

X

i=1 n

X

j=1

qi

|Ai| · qj

|Aj| · |Ai∩Aj|

= (q1, q2,· · · , qn)P





 q1

q2

... qn







= (q1, q2,· · · , qn)





 1 1... 1







=

n

X

i=1

qi.

So we get (14). This concludes the whole proof.

(6)

3.3 A combinatorial proof

In this subsection, we provide a combinatorial proof for the inequality in (11) to help understand the equality case. To achieve the goal we need only prove

n

[

i=1

Ai

>

n

X

i=1

αi|Ai|2

n

X

i=1 n

X

j=1

αiαj|Ai∩Aj| .

holds for all integral weights αi ∈Z such that Pn

i=1αi|Ai|>0. Suppose this is the case.

Let U =∪ⁿ_i=1Ai and χi be the indicator function of Ai. Define f(x) =Pn

i=1αiχi(x) and for all k ∈Z,

U^k .

={x∈U :f(x) =k}, A^k_i .

=Ai∩U^k. Obviously, f =P

k∈Zkχ_U^k. Note

n

X

i=1

αi|A^k_i|=

n

X

i=1

αi

Z

U

χ_A_i_∩U^k =

n

X

i=1

αi

Z

U

χi·χ_U^k = Z

U

f·χ_U^k =k· |U^k|, (15) and

X

k∈Z

k|A^k_i|=X

k∈Z

k Z

U

χ_i·χ_U^k = Z

Ai

X

k∈Z

kχ_U^k = Z

Ai

n

X

j=1

α_jχ_j =

n

X

j=1

α_j|A_i∩A_j|, (16) here the integration means R

Ug =P

x∈Ug(x). By (15),

|U|=X

k∈Z

|U^k|>X

k6=0

Pn

i=1αi|X_i^k|

k .

Now we need an inequality: for all r, s >0 one has 1

s > 2 r − s

r²

⇔(1 s − 1

r)² >0 . By (15) again,Pn

i=1αi|A^k_i|and k have the same sign, and consequently for r >0, Pn

i=1αi|A^k_i| k >

₂

r

Pn

i=1αi|A^k_i| − _r^k2

Pn

i=1αi|A^k_i| if k > 0

−²_rPn

Pn

i=1αi|A^k_i| if k < 0

> 2

r

n

X

i=1

αi|A^k_i| − k r²

n

X

i=1

αi|A^k_i| if k 6= 0.

Recall that ²_rPn

Pn

i=1αi|A^k_i|= 0 when k = 0. By (16),

|U|>X

k∈Z

2 r

n

X

i=1

α_i|A^k

i| − k r2

n

X

i=1

α_i|A^k

i|

!

= 2 r

n

X

i=1

α_i|A_i| − 1 r2

n

X

i=1 n

X

j=1

α_iα_j|A_i∩A_j| .

=W(r).

(7)

Finally,

|U|>max

r>0 W(r) =W(r^∗) =

n

X

i=1

αi|Ai|2

n

X

i=1 n

X

j=1

αiαj|Ai∩Aj| ,

where r^∗ = (Pn

i=1αi|Ai|)/(Pn i=1

Pn

j=1αiαj|Ai∩Aj|). This concludes the whole proof. A byproduct of this proof is the following characterization of the equality case:

n

[

i=1

Ai

=

n

X

i=1

αi|Ai|2

n

X

i=1 n

X

j=1

αiαj|Ai∩Aj|

⇔

n

X

i=1

αiχi(x) Sn

i=1Ai

is a non-zero constant function.

4 From Functional Analysis to Probability Theory

4.1 Finitely many events

In this section we chooseHto be theL²space of the given probability space (Ω,F, P). Let E, F be two events andχE, χF be the corresponding indicator functions. It is well-known that Hilbert space theory and probability theory are intimately connected by

< χE, χF >=P(E∩F).

Note χ_E, χ_F are orthogonal means E, F are disjoint. Given events {E_i}ⁿ_i=1, define y_i = χEi (i ∈ [n]) and x = χ∪iEi. By (2) and (3), we extend the Gallot-Kounias inequality [9, 11] to its full generality in the following form.

Theorem 4.1 (Gallot-Kounias).

P(

n

[

i=1

Ei)> max

(α1,...,αn)∈Rⁿ

n

X

i=1

αiP(Ei)2 n

X

i=1 n

X

j=1

α_iα_jP(E_i∩E_j)

=

n

X

i=1 n

X

j=1

γiγjP(Ei∩Ej), (17)

where (γ1, . . . , γn)∈Rⁿ is any solution to







P(E1∩E1) P(E1∩E2) · · · P(E1∩En) P(E2∩E1) P(E2∩E2) · · · P(E2∩En)

... ... . .. ...

P(En∩E1) P(En∩E2) · · · P(En∩En)











 γ1

γ2

...

γn







=







P(E1) P(E2)

...

P(En)







. (18)

(8)

To the authors’ knowledge, it seems that the Gallot-Kounias inequality, being discov- ered 40 years ago, was almost forgotten by Mathematicians. Gallot and Kounias originally expressed their results in terms of generalized inverse of matrices, and this may prevent their results from being appreciated by others. So we restate their results in a more natural way in Theorem 4.1. Note in this context (10) turns out to be the original Chung-Erd˝os inequality [3]

P(

n

[

i=1

Ei)>

n

X

i=1

P(Ei)2 n

X

i=1 n

X

j=1

P(Ei∩Ej)

, (19)

and the Bessel inequality (9) turns out to be a trivial equality. Also note that

sup

αi>0

n

X

i=1

αiP(Ei)2 n

X

i=1 n

X

j=1

αiαjP(Ei∩Ej)

= sup

αi>0

n

X

i=1

αiP(Ei)2 n

X

i=1 n

X

j=1

α²_iP(Ei∩Ej)

= sup

αi>0 n

X

i=1

αiP(Ei)²

n

X

j=1

αjP(Ei∩Ej) .

Similar to Theorem 3.2 one can establish the following theorem.

Theorem 4.2. The following matrix equation always has a solution

P(Ei∩Ej) P(E_i)P(E_j)

n×n





 q1

q₂ ...

qn







=





 1 1 ...

1







; (20)

any solution to (20) satisfies

n

X

i=1

qi = max

(α₁,...,αn)∈Rⁿ

n

X

i=1

α_iP(E_i)2 n

X

i=1 n

X

j=1

αiαjP(Ei∩Ej)

. (21)

4.2 Borel-Cantelli lemma

Let {Ei}^∞_i=1 be infinitely many events on the probability space (Ω,F, P). The Borel- Cantelli lemma states that: (a) if P∞

i=1P(E_i) < ∞, then P(lim supE_i) = 0; (b) if P∞

i=1P(Ei) = ∞ and {Ei}^∞_i=1 are mutually independent, then P(lim supEi) = 1. Here lim supEi = ∩^∞_i=1∪^∞_k=i Ek. The Borel-Cantelli lemma played an exceptionally important role in probability theory, and many investigations were devoted to the second part of the Borel-Cantelli lemma in the attempt to weaken the independence condition on {Ei}^∞_i=1.

(9)

Towards this question, Erd˝os and R´enyi [6, 14] obtained a nice result closely related to (19): if P∞

i=1P(Ei) =∞, then

P(lim supEi)>lim sup

n→∞

n

X

k=1

P(Ek)2 n

X

i=1 n

X

j=1

P(E_i∩E_j)

. (22)

Recently, by carefully studying the effect of the denominator in the right hand of (22), the authors [8] established a weighted version of the Erd˝os-R´enyi theorem which states:

Theorem 4.3 (Feng-Li-Shen). If P∞

i=1αiP(Ei) =∞, then

P(lim supEi)>lim sup

n→∞

n

X

k=1

αkP(Ek)2 n

X

i=1 n

X

j=1

αiαjP(Ei∩Ej)

. (23)

5 Applications of the de Caen-Selberg Inequality

5.1 The finite field Kakeya set

LetF_q denote a finite field of q elements. Define a setK ⊂Fⁿ_q to be Kakeya if it contains a translate of any given line. The finite field Kakeya problem, posed by Wolff in his influential survey [17], conjectured that|K|>Cnqⁿ holds for some constantCn. Recently, using the polynomial method in algebraic extremal combinatorics, Dvir [4] completely confirmed this conjecture by proving

|K|>

n+q−1 n

. (24)

Ifn = 2, it is well-known that (24) is sharp [7] and can be established by a simple counting argument [15]. For n>3, see [16] for further improvement.

Similarly, we say a subsetE ⊂Fⁿ

q is an (n, k)-set if it contains a translate of any given k-plane. Ellenberg, Oberlin and Tao [5] proved that if 26k < n, then

|E|>qⁿ− n

2

q^n−k+1+o(q^n−k+1) (q→ ∞). (25)

Using the de Caen-Selberg inequality we can slightly improve (25) when k=n−1>2.

Theorem 5.1. Any (n, n−1)-set E ⊂Fⁿ_q (n >3) satisfies

|E|>qⁿ−q²+o(q²) (q→ ∞), where F_q denotes a finite field of q elements.

(10)

Proof. Since the total number s of (n−1)-dimensional hyperplanes passing through the origin equals the total number of lines passing through the origin,

s= qⁿ−1 q−1 .

Let {Pi}^s_i=1 be such hyperplanes. By the de Caen-Selberg inequality (4),

|E|>

s

X

i=1

|Pi|²

s

X

j=1

|Pi∩Pj|

> s·q²ⁿ⁻²

qⁿ⁻¹+ (s−1)qⁿ⁻²

= s·q²ⁿ⁻²+qⁿ(qⁿ⁻¹−qⁿ⁻²)−qⁿ(qⁿ⁻¹−qⁿ⁻²) (qⁿ⁻¹−qⁿ⁻²) +s·qⁿ⁻²

=qⁿ− qⁿ(qⁿ⁻¹−qⁿ⁻²) qⁿ⁻¹+ (s−1)qⁿ⁻²

=qⁿ−q²+o(q²) (q → ∞).

5.2 The finite field Nikodym set

Define a set B ⊂ Fⁿ

q to be Nikodym if for each z ∈ B^c there exists a line Lz passing through z such that Lz\{z} ⊂ B. Obviously, all such lines {Lz}z∈B^c are different from each other. Similar to (24) Li [12] proved (i)

|B|>

n+q−2 n

; (26)

(ii) any two-dimensional Nikodym set B ⊂F²

q satisfies

|B|> 2q²

3 +O(q) (q → ∞). (27)

Using the de Caen-Selberg inequality we can improve (27) substantially as follows, which shows some difference between the two-dimensional Kakeya sets and Nikodym sets.

Theorem 5.2. Any Nikodym set B ⊂F²

q satisfies

|B|>q²−q^3/2−q,

where F_q denotes a finite field of q elements.

Proof. Lets =|B^c|. By the de Caen-Selberg inequality (4), q²−s=|B|>

[

z∈B^c

Lz\{z}

>

s

X

i=1

(q−1)²

(q−1) +s−1 = s(q−1)² s+q−2.

(11)

Equivalently,

s²−(q+ 1)s−q²(q−2)60.

Hence

|B|=q² −s >q²− q+ 1 +p

(q+ 1)²+ 4q²(q−2)

2 >q²−q^3/2−q.

We thank a referee for many valuable suggestions leading to the clear presentation of the paper.

References

[1] E. Bombieri, A note on the large sieve. Acta Arith. 18 (1971) 401–404.

[2] D. de Caen, A lower bound on the probability of a union. Discrete Math.169 (1997) 217–220.

[3] K. L. Chung, P. Erd˝os,On the application of the Borel-Cantelli lemma. Trans. Amer.

Math. Soc. 72 (1952) 179–186.

[4] Z. Dvir, On the size of Kakeya sets in finite fields. To appear in J. Amer. Math. Soc.

[5] J. S. Ellenberg, R. Oberlin, T. Tao, The Kakeya set and maximal conjectures for algebraic varieties over finite fields. Preprint.

[6] P. Erd˝os, A. R´enyi, On Cantor’s series with convergent P

1/qn. Ann. Univ. Sci.

Budapest. E˝otv˝os Sect. Math. 2 (1959) 93–109.

[7] X. W. C. Faber, On the finite field Kakeya problem in two dimensions, J. Number Theory 124 (2007) 248–257.

[8] C. Feng, L. Li, J. Shen,On the Borel-Cantelli lemma and its generalization. Comptes Rendus Mathematique 347 (2009) 1313–1316.

[9] S. Gallot,A bound for the maximum of a number of random variables. J. Appl. Prob.

3 (1966) 556–558.

[10] H. Heilbronn,On the averages of some arithmetical functions of two variables. Math- ematika 5 (1958) 1–7.

[11] E. G. Kounias,Bounds for the probability of a union, with applications. Ann. Math.

Statist. 39 (1968) 2154–2158.

[12] L. Li,On the size of Nikodym sets in finite fields. Preprint.

[13] J. E. Pe˘cari´v, On some classical inequalities in unitary spaces. Mat. Bilten42 (1992) 63–72.

[14] A. R´enyi,Probability Theory. North-Holland Series in Applied Mathematics and Me- chanics, Vol. 10. North-Holland Publishing Co., Amsterdam-London, 1970; German version 1962, French version 1966, new Hungarian edition 1965.

(12)

[15] K. M. Rogers, The finite field Kakeya problem, Amer. Math. Monthly 108 (2000) 756–759.

[16] S. Saraf, M. Sudan,Improved lower bound on the size of Kakeya sets over finite fields.

Preprint.

[17] T. Wolff,Recent work connectecd with the Kakeya problem. Prospects in Mathematics (Princeton, NJ, 1996), Amer. Math. Soc. (1999) 129–162.