• 検索結果がありません。

FOR THE SAMPLE COVARIANCE MATRIX

N/A
N/A
Protected

Academic year: 2022

シェア "FOR THE SAMPLE COVARIANCE MATRIX"

Copied!
4
0
0

読み込み中.... (全文を見る)

全文

(1)

Elect. Comm. in Probab. 5(2000) 73–76 ELECTRONIC

COMMUNICATIONS in PROBABILITY

A WEAK LAW OF LARGE NUMBERS

FOR THE SAMPLE COVARIANCE MATRIX

STEVEN J. SEPANSKI

Department of Mathematics, Saginaw Valley State University, 7400 Bay Road University Center, MI 48710

email: [email protected] ZHIDONG PAN

Department of Mathematics, Saginaw Valley State University, 7400 Bay Road University Center, MI 48710

email: [email protected]

submitted February 15, 1999Final version accepted March 20, 2000 AMS 1991 Subject classification: Primary 60F05; secondary 62E20, 62H12

Law of large numbers, affine normalization, sample covariance, central limit theorem, domain of attraction, generalized domain of attraction, multivariate t-statistic

Abstract

In this article we consider the sample covariance matrix formed from a sequence of indepen- dent and identically distributed random vectors from the generalized domain of attraction of the multivariate normal law. We show that this sample covariance matrix, appropriately nor- malized by a nonrandom sequence of linear operators, converges in probability to the identity matrix.

1. Introduction:

LetX, X1, X2· · · be iid Rd valued random vectors withL(X) full. The condition of fullness is the multivariate analogue of nondegeneracy and will be in force throughout this article.

It means that L(X) is not concentrated on anyd−1 dimensional hyperplane. Equivalently, hX, θiis nondegenerate for every θ.Hereh, idenotes the inner product.

Throughout this article all vectors inRdare assumed to be column vectors. For any matrix,A, At denotes its transpose. Let ¯Xn = 1nPn

i=1Xi. We denote and define the sample covariance matrix byCn = n1Pn

i=1(Xi−X¯n)(Xi−X¯n)t.That Cn has a unique nonnegative symmetric square root, denoted above byCn1/2,follows from the fact thathCnθ, θi=Pn

i=1hXi−X¯n, θi2 0,so that Cn is nonnegative. Also,Cn is clearly symmetric. However, there is no guarantee thatCn is invertible with probability one.

In [3] we describe two ways to circumvent the problem of lack of invertibility ofCn.One such approach is to define

Bn=

Cn ifCn is invertible

I otherwise (1.3)

The success of this approach relies on the fact that ifL(X) is in the Generalized Domain of Attraction of the Normal Law (see (1.6) below for the definition), thenP(Cn=Bn)1.(See

73

(2)

74 Electronic Communications in Probability

[3], Lemma 5.) In light of this, we will assume without loss of generality thatCn is invertible.

L(X) is said to be in the Generalized Domain of Attraction (GDOA) of the Normal Law if there exist matricesAn and vectorsvn such that

An

Xn i=1

Xi−vn ⇒N(0, I). (1.6) One construction ofAn is such thatAn is invertible, symmetric and diagonalizable. See Hahn and Klass [2].

The main result is Theorem 1 below. This result was shown in Sepanski [5]. However, there the proof was based on a highly technical comparison of the eigenvalues and eigenvectors of Cn andAn.There the proof was essentially real valued. The purpose of this note is to give a more efficient proof that is operator theoretic and multivariate in nature. For more details, we refer the interested reader to the original article. In particular, Sepanski [5] contains a more complete list of references.

2. Results

Theorem 1: If the law of X is in the generalized domain of attraction of the multivariate

normal law, then

nAnCn1/2→I in pr.

Proof: LetPn(ω) denote the empirical measure. That is, Pn(ω)(A) = n1Pn

i=1I[Xi(ω)∈A]. Here I is the indicator function. For eachω∈Ω letX1,· · ·Xn be iid with lawPn(ω).

Sepanski [4], Theorem 2, shows that under the hypothesis of GDOA, An

Xn j=1

Xj−nµ⇒N(0, I) in pr.

Sepanski [3], Theorem 1, shows that under the hypothesis of GDOA, (nCn)−1/2

Xn j=1

Xj−nµ⇒N(0, I) in pr.

These two results, together with the multivariate Convergence of Types theorem of Billingsley [1], imply that

(nCn)−1/2=BnRnAn, (1)

where Bn →I in pr.,andRn are (random) orthogonal. The proof of Theorem 1 is thereby reduced to showing thatRn→I in pr.However, convergence in probability is equivalent to every subsequence having a further subsequence which converges almost surely. This reduces the proof to a pointwise result about the behavior of the linear operators.

WriteAn=QnDnQtnwhereQn is orthogonal andDn is diagonal with nonincreasing diagonal entries. Let Pn =QnRnQtn andKn=QnBnQtn.

kKn−Ik=kQtnBnQn−QtnQnk ≤ kBn−Ik →0

By the same token,Rn→I if and only ifPn→I.Also, (nCn)−1/2is positive and symmetric and therefore so areBnRnAnandKnPnDn.The proof of Theorem 1 is reduced to the following lemma.

(3)

A Law of Large Numbers 75

Lemma 2: LetPn be orthogonal. LetDn = diag(λn1,· · ·, λnd) be diagonal such thatλn1 λn2 ≥ · · · ≥ λnd >0. Suppose Kn →I. If KnPnDn is positive and symmetric for every n, thenPn→I.

Proof: Given a subsequence ofPn we show that there is a further subsequence along which Pn →I.Let En=λ−1n1Dn. This is a diagonal matrix of all positive entries that are bounded above by 1. Therefore, given any subsequence, there is a further subsequence along which Kn→I, Pn→P,andEn→E.Necessarily,P is orthogonal andEis diagonal with entries in [0,1].Furthermore,Ehas at least one diagonal entry that is 1 and its entries are nonincreasing.

SinceKnPnEn is symmetric, nonnegative andKn →I,we have that P E=EPt, andP E is nonnegative. Now, (P E)2 = (P E)tP E =EP−1P E =E2. Hence, sinceP E and E are both nonnegative, P E = E. If E is invertible, then P = I and we are done. Suppose E is not invertible. Write E =

E(1) 0

0 0

where E(1) is an m×m invertible diagonal matrix with m < d. Next, writeP =

P(1) P(2)

P(3) P(4)

where P(1) is anm×m matrix. Since P E =E, we

have

P(1)E(1) 0 P(3)E(1) 0

=

E(1) 0

0 0

.

From P(1)E(1) =E(1) and the invertibility of E(1), we have that P(1) = Im. Similarly, from P(3)E(1)= 0 we have thatP(3)= 0.Therefore,P =

Im P(2) 0 P(4)

.Next, multiplyingP Pt,and PtP,and equating the (1,1) entries we have that Im+P(2)P(2)t =Im. From this we conclude thatP(2)P(2)t = 0,and therefore also,P(2)= 0.We have that,

P =

I 0 0 P(4)

.

The proof continues inductively. Let K(n4), P(n4), E(n4) be the (2,2) block of Kn, Pn, En

respectively. P(n4) may not be orthogonal, but P(4) is. Apply the previous argument to

K(n4)P(n4)P(4)t

P(4)E(n4).Note thatK(n4)P(n4)P(4)t →IP(4)P(4)t =I,so that we may apply the argument withK(n4)P(n4)P(4)t as the newKnin the induction step. Since the matrices are all finite dimensional, the argument will eventually terminate.

AcknowledgementWe would like to thank the referee for suggestions on how to shorten this article. The suggestions led to a much more efficient presentation of the material.

(4)

76 Electronic Communications in Probability

REFERENCES

[1] Billingsley, P. (1966). Convergence of types in k-space. Z. Wahrsch. Verw. Gebiete. 5 175-179.

[2] Hahn, M. G. and Klass, M. J. (1980). Matrix normalization of sums of random vectors in the domain of attraction of the multivariate normal. Ann. Probab. 8262-280.

[3] Sepanski, S.J. (1993). Asymptotic normality of multivariate t and Hotelling’sT2 statistics under infinite second moments via bootstrapping. J. Mutlivariate Analysis,4941-54 [4] Sepanski, S. J. (1994). Necessary and sufficient conditions for the multivariate bootstrap

of the mean. Statistics and Probability Letters,19No. 3, 205-216

[5] Sepanski, S. J. (1996). Asymptotics for the multivariate t-statistic for random vectors in the generalized domain of attraction of the multivariate law. Statistics and Probability Letters, 30179-188.

参照

関連したドキュメント

In Section 2 we show that, under a suitable exponen- tial tail condition, the Poisson shot noise process considered in this paper satisfies the sample path large deviations

By con- structing a single cone P in the product space C[0, 1] × C[0, 1] and applying fixed point theorem in cones, we establish the existence of positive solutions for a system

McIntosh and Halford ([8]) have shown that this condition can be weakened for the case of a metric of type (1,3), in that it is suffi- cient to demand that the dimension of the

We will show in section 2 that the matrix transformation corresponding to the regular convolution method generated by an independent and identically distributed sequence of

We use these to show that a segmentation approach to the EIT inverse problem has a unique solution in a suitable space using a fixed point

I¿ Yurchuk; The quasi-reversibility method for the problem of the control of an initial condition for the heat equation with an integral boundary condition, Differential Equations

The repeated homogeneous balance method is used to construct new exact traveling wave solutions of the (2+1) dimensional Zakharov- Kuznetsov (ZK) equation, in which the

Particularly, for the bounded domain case, the low Mach number limit of weak solutions to the compressible flow of liquid crystals was proved in [13], and Yang [15] firstly studied