pdf Research Kengo Kato

(1)

Absolute continuity of suprema of Gaussian processes¹ Kengo Kato The purpose of this note is to provide details on some parts of Chapter 11 in [1] with “self-contained” proofs.

1. Distributions of suprema of Gaussian processes

Let (Ω, A, P) be a complete probability space. Let Xt, t ∈ T be a Gaussian process indexed by a separable semimetric space T . We assume that X_t, t ∈ T is centered (E[Xt] = 0, ∀t ∈ T ) and moreover separable, i.e., there exist null set N and countable subset S ⊂ T such that for every ω /∈ N and t ∈ T , there exists a sequence tm in S with tm → t such that Xtm(ω) → Xt(ω).² In this note, we are interested in the distribution of

sup

t∈T

X_t. (1)

Since X_t is separable, the above supremum can be replaced by sup_t∈SX_t, and hence (1) is a random variable. We further assume that

sup

t∈T

X_t< ∞ a.s., and E[Xt²0] > 0, ∃t₀ ∈ T. (2) The latter assumption is to exclude the trivial case where X_t = 0, ∀t ∈ T a.s., which is clearly not of interest. Let F be the distribution function of sup_t∈TX_t, i.e.,

F (r) = P(sup

t∈T

X_t≤ r). Define

r0 = inf{r : F (r) > 0},

that is, r0is the left endpoint of the support of the distribution of sup_t∈T Xt. The following is taken from Theorem 11.1 of [1] (not in full strength). Theorem 1. The distribution function F is absolutely continuous on (r₀, ∞), and the derivative F^′, which exists on (r0, ∞) except on an at most countable subset ∆ ⊂ (r₀, ∞), is positive on (r₀, ∞)\∆.

The function F may have a jump at r₀. Denote by q its jump size, i.e., q = F (r₀).

Note that q < 1 (because F is strictly increasing on (r0, ∞) by Theorem 1). The following corollary, which is important in statistical applications, is a direct consequence of Theorem 1.

Corollary 1. The quantile function F⁻¹ is continuous and strictly increasing on (q, 1).

1Incomplete.

2It is well known that, as long as T is separable, every stochastic process indexed by T has a separable version possibly taking values in the extended real line. See [2], p.150-154.

1

(2)

Remark 1. The q may take any value in [0, 1) even under the assumption (2). See Example 3.2 in [3].

2. Log-concavity of Gaussian measures For A, B ⊂ Rⁿ and λ ∈ R, we write

λA = {λx : x ∈ A}, A + B = {x + y : x ∈ A, y ∈ B}.

Theorem 2. Let γn be the canonical Gaussian measure on Rⁿ^{. Then for} all Borel sets A, B ⊂ Rⁿ and λ ∈ [0, 1],

γ_n(λA + (1 − λ)B) ≥ γ_n(A)^λγ_n(B)^1−λ.

Generally, a Borel probability measure µ on Rⁿis called log-concave if for all Borel sets A, B ⊂ Rⁿ and λ ∈ [0, 1],

µ(λA + (1 − λ)B) ≥ µ(A)^λµ(B)^1−λ.

Hence Theorem 2 says that the canonical Gaussian measure γ_n is log- concave. This theorem is a direct consequence of the following Pr´ekopa- Leindler inequality:

Theorem 3 (Pr´ekopa-Leindler). Let f, g and h be non-negative, integrable functions on Rⁿ, and let λ ∈ [0, 1]. If for all x, y ∈ Rⁿ^,

h(λx + (1 − λ)y) ≥ f^λ(x)g^1−λ(y), then we have

∫

Rⁿ

h ≥ (∫

Rⁿ

f )λ(∫

Rⁿ

g )1−λ

.

Proof of Theorem 2. In Theorem 3, take f = φ_n1_A, g = φ_n1_B and h = φ_n1_{λA+(1−λ)B} where φ_n(x) = (2π)^−n/2e^−|x|²^/2_{, x ∈ R}ⁿ.. Since log φ_n is concave, these f, g, h verify the hypothesis of Theorem 3, so that the desired

conclusion follows. _□

Proof of Theorem 3. We only need to consider the case where 0 < λ < 1. The proof is by induction. Suppose that n = 1. By the hypothesis of the theorem, we have for t > 0,

C_t:= {x : h(x) > t} ⊃ {x : f (λ⁻¹x) > t}+{x : g((1−λ)⁻¹x) > t} =: A_t+B_t, so that with µ denoting the Lebesgue measure on R,

∫

R

h(x)dx =

∫

R

∫ _∞

0

1(t < h(x))dtdx =

∫ _∞

0

µ(C_t)dt

≥

∫ _∞

0

µ(A_t+ B_t)dt. (3)

We shall prove the following lemma. Lemma 1. For all Borel sets A, B ⊂ R,

µ(A + B) ≥ µ(A) + µ(B).

(3)

Proof of Lemma 3. We first prove the lemma when A and B are compact. We may assume that A and B are non-empty. Since A + B ⊃ (sup A + B) ∪ (A + inf B) and (sup A + B) ∩ (A + inf B) = {sup A + inf B}, we have µ(A + B) ≥ µ(sup A + B) + µ(A + inf B) = µ(A) + µ(B).

We now prove the lemma for general Borel subsets A, B of R. By regular- ity of the Lebesgue measure, there exist sequences A_m and B_m of compact subsets of R with Am ^{⊂ A and B}m ⊂ B such that µ(A_m) ↑ µ(A) and µ(Bm) ↑ µ(B). Then

µ(A + B) ≥ µ(Am+ Bm) ≥ µ(Am) + µ(Bm).

Taking the limit, we obtain the desired conclusion. _□ We now go back to the proof of Theorem 3. By Lemma 3, we have (3) ≥

∫ ∞ 0

µ(At)dt+

∫ ∞ 0

µ(Bt)dt = λ

∫

R

f +(1−λ)

∫

R

g ≥ (∫

f )λ(∫

g )1−λ

. Suppose that the lemma holds up to some n and we would like to show that it holds for n + 1. By assumption, for x, y ∈ Rⁿ, u, v ∈ R, and λ ∈ [0, 1],

h(λx + (1 − λ)y, λu + (1 − λ)v) ≥ f^λ(x, u)g^1−λ(y, v). For a while fix u, v and λ, and let us define

h₁(x) = h(x, λu + (1 − λ)v), f₁(x) = f (x, u), g₁(x) = g(x, v). Then by the induction hypothesis,

∫

Rⁿ

h₁ ≥ (∫

Rⁿ

f₁ )λ(∫

Rⁿ

g₁ )1−λ

. (4)

Define h₂(u) =

∫

Rⁿ

h(x, u)dx, f₂(u) =

∫

Rⁿ

f (x, u)dx, g₂(u) =

∫

Rⁿ

g(x, u)dx. Then the inequality (4) implies that

h2(λu + (1 − λ)v) ≥ f₂^λ(u)g₂^1−λ(v), so that by the induction hypothesis,

∫

Rⁿ⁺¹

h =

∫

R

h₂ ≥ (∫

R

f₂ )λ(∫

R

g₂ )λ

= (∫

Rⁿ⁺¹

f )λ(∫

Rⁿ⁺¹

g )1−λ

.

This completes the proof. _□

3. Proof of Theorem 1 We begin with proving the following lemma. Lemma 2. The function log F is concave on (r₀, ∞).

(4)

Proof of Lemma 2. Let us write S = {t₁, t₂, . . . }. Then

1≤i≤nmax ^X^tⁱ ^{→ sup}_t∈T ^X^t, a.s., n → ∞. (5) Denote by Fn the distribution function of max1≤i≤nXti. For a while fix n, and denote by Γ the covariance matrix of (Xt1, . . . , Xtn)^T. For Z ∼ N (0, In), we have

(Xt1, . . . , Xtn)^{T d}= Γ^1/2Z.

Let r₁, r₂ ∈ R and λ ∈ [0, 1], and set A = {x ∈ Rⁿ^{: max}1≤i≤n^(Γ^1/2^x)i ^{≤ r}1^}

and B = {x ∈ Rⁿ^{: max}1≤i≤n^(Γ^1/2^x)i^{≤ r}2^{}. Since}

λA + (1 − λ)B ⊂ {x ∈ Rⁿ^{: max}

1≤i≤n^(Γ 1/2_x)

i ^{≤ λr}1^{+ (1 − λ)r}2^},

by Theorem 2, we conclude that

F_n(λr₁+ (1 − λ)r₂) ≥ F_n(r₁)^λF_n(r₂)^1−λ. (6) By (5), F_n(r) → F (r) as n → ∞ for every continuity point of F . Denote by D the set of jump points of F . The set D is countable. Choose and fix r₁, r₂ ∈ R\D with r1 ^{̸= r}2, and let Λ_D = {λ ∈ [0, 1] : λr₁+ (1 − λ)r₂ ∈ D}. The set Λ_D is also countable. Taking n → ∞ in (6), we have for all λ ∈ [0, 1]\Λ_D,

F (λr1+ (1 − λ)r2) ≥ F (r1)^λF (r2)^1−λ. (7) We shall verify that the above inequality holds for all λ ∈ [0, 1]. Indeed, for λ ∈ ΛD, take a sequence λm in [0, 1]\ΛD with λm → λ such that λmr1+ (1 − λ_m)r₂ ↓ λr₁+ (1 − λ)r₂. Then by the right continuity of F , we see that (7) also holds for such λ. In the similar manner, we see that (7) holds for all r₁, r₂ ∈ R. Therefore, by taking logarithm of both the sides of (7), we

obtain the desired conclusion. _□

We recall the following (well-known) fact on convex/concave functions. Lemma 3. Let f : (a, b) → R be a convex (or concave) function (a < b; a = −∞ and b = ∞ are allowed). Then f is locally absolutely continuous on (a, b), i.e., f is absolutely continuous on each compact subinterval of (a, b). Proof of Lemma 3. We only need to consider convex f . Take any compact subinterval [a₁, b₁] ⊂ (a, b), and moreover take a < a₂ < a₁ < b₁ < b₂ < b. Since f is convex, for all x, y ∈ [a1, b1] with x ̸= y,

f (a₁) − f (a₂) a₁− a₂ ^≤

f (y) − f (x) y − x ^≤

f (b₂) − f (b₁) b₂− b₁ ^, so that

|f (y) − f (x)| ≤ |y − x| × max {

f (a1) − f (a2) a₁− a₂

,

f (b2) − f (b1) b₂− b₁

} . This implies that f is Lipschitz continuous on [a₁, b₁]. Every Lipschitz continuous function on a compact interval is absolutely continuous on the interval, so that the desired conclusion follows. _□

(5)

Proof of Theorem 1. Let G = log F so that F = e^G. By Lemma 2, G is concave on (r₀, ∞). By Lemma 3, G is locally absolutely continuous on (r0, ∞), and so is F . Since F is a probability distribution function, F is absolutely continuous on (r₀, ∞).

To prove the second assertion, we first verify that G is strictly increasing. Observe that for t₀ ∈ T such that E[Xt²0] > 0 (such t₀ is assumed to exist), F (r) ≤ P(Xt0 ≤ r) < 1 for all r ∈ R. Suppose on the contrary that there exist points r₂ > r₁(> r₀) such that F (r₁) = F (r₂) =: p. Note that p < 1, and take r₃> r₂ such that F (r₃) > p. Write r₂ as a convex combination of r1 and r3: r2 = λr1+ (1 − λ)r3 for some λ ∈ (0, 1). Then

G(r₂) = log p < λG(r₁) + (1 − λ)G(r₃), which contradicts concavity of G.

By concavity of G, it is routine to see that the map t 7→ G(r + t) − G(r)

t , (0, ∞) → R,

is non-increasing, so that the right derivative G^′₊(r) exists and is positive (the latter follows from the fact that G is strictly increasing), i.e.,

G^′₊(r) = lim

t↓0

G(r + t) − G(r) t ^{> 0.}

The G^′₊(r) is finite and map r 7→ G^′₊(r) is non-increasing. Hence G^′₊is continuous except on at most countably many points, which implies that, except on at most countably many points, G is differentiable and its derivative is

positive. This completes the proof. _□

References

[1] Davydov, Y., Lifshits, M. and Smorodina, N. (1998). Local Properties of Distributions of Stochastic Functions (Transaction of Mathematical Monographs, Vol. 173). American Mathematical Society.

[2] Gikhman, I.I. and Skorohod, A.V. (1996). Introduction to the Theory of Random Functions. Dover.

[3] Hoffman-Jorgensen, J., Shepp, L.A. and Dudley, R.M. (1979). On lower tails of Gaussian seminorms. Ann. Probab. 7 319-342.