Asymptotic expansions of the null distributions of three test statistics in a nonnormal GMANOVA model
Hirokazu Yanagihara
(Received August 22, 2000) (Revised November 11, 2000)
Abstract. This paper deals with three test statistics for testing a linear hypothesis and estimators of regression coe½cients in the GMANOVA model which was proposed by Potthof and Roy (1964), without assuming normal error. The test statistics considered include the likelihood ratio statistic, the Lawley-Hotelling trace criterion and the Bartlett-Nanda-Pillai trace criterion, which have been proposed under normality. We obtain asymptotic expansions of the null distributions of three test statistics up to the ordernÿ1, wherenis the sample size. The results are generalizations of the formulas in Wakaki, Yanagihara and Fujikoshi (2000). In addition, asymptotic expansions of the distribution functions of several standardized statistics on regression coe½cients are derived.
1. Introduction
The GMANOVA model considered is de®ned by
Y AXX0E; 1:1
where Y y1;. . .;yn0 is an np observation matrix of response variables, A a1;. . .;an0 is an nk between-individuals design matrix of explanatory variables with full rank k, X is a pq within-individuals design matrix of explanatory variables with full rank q ap, X is a kq unknown parameter matrix and E e1;. . .;en0 is an np error matrix. It is assumed that each vector ej is i.i.d., i.e., independently and identically distributed with E ej 0 and Cov ej S. This model can be applied to analysis of growth curve data, and hence it is also called the growth curve model.
We consider to test for a general linear hypothesis
H0:CXDO; 1:2
where C is a known ck matrix with rank c ak, D is a known qd matrix with rank d aq andOis a cd matrix all of whose elements are 0.
2000 Mathematics Subject Classi®cation. primary 62H10, secondly 62E20.
Key words and phrases. Con®dence interval, Conservativeness, General multivariate linear hy- pothesis, Linear combination, MLE, Nonnormality, Robustness.
The GMANOVA model (1.1) with normal error was introduced by Pottho¨ and Roy (1964) and have been extensively studied by many authors.
The maximum likelihood estimators X^ and S^ of X and S, and the likelihood ratio test statistic were obtained by Khatri (1966) and Gleser and Olkin (1970).
Fujikoshi (1974) studied properties of some test statistics, including the LR test statistic, and gave asymptotic expansions of their non-null distributions.
Gleser and Olkin (1970) were the ®rst to derive the exact density of MLE X. Asymptotic expansions of the distributions of^ X^ and its linear trans- form have been studied by Fujikoshi (1985, 1993a) and von Rosen (1997).
Various aspects of statistical inference under normality have been also discussed in literature. For these results, see. e.g., Kariya (1985), von Rosen (1991), Fujikoshi (1993b), Kshirsagar and Smith (1995), and Srivastava and von Rosen (1999).
The above results are based on the assumption that the error vectors e1;. . .;en are independently and identically distributed as a multivariate normal distribution with means 0 and covariance matrix S. Khatri (1988) discussed robustness for test statistic under elliptical distribution. However, the non- normal case has not been investigated so much, except for the case XIp, i.e., MANOVA case. For MANOVA case, Ito (1969, 1980), Chase and Bulgren (1971) and Everitt (1979) studied robustness of certain test statistics by simulation. Wakaki, Yanagihara and Fujikoshi (2000) obtained asymptotic expansions of the null distributions of three test statistics in nonnormal multivariate linear model. These results include several expansions obtained by Kano (1995), Fujikoshi (1997b, 2001), Fujikoshi, Ohmae and Yanagihara (1999) and Yanagihara (1999), as special cases. Our main purpose is to extend the asymptotic expansion formulas in a multivariate linear model to the ones in the GMANOVA model.
The present paper is organized in the following way. In O2, we describe three test statistics. It is shown that our test statistics can be expresses in terms of a random matrix U, which is a kind of Studentized version of X. Using^ this expression, we derive perturbation expansions of our test statistics. In O3, we give an asymptotic expansion of the distribution function of U. Further, asymptotic expansions of other standardized statistics of X^ are obtained in O4. In O5, we obtain asymptotic expansions of the null distributions of three test statistics, by expanding their characteristic function. Moreover, in O6, we discuss robustness of testing under nonnormality and derive a result on conservativeness based on the asymptotic expansion formulas. Some applica- tions of the asymptotic expansions of test statistics are given in O7. In O8, numerical accuracies are studied for some con®dence interval of X and asymptotic expansions of the null distributions for some test statistics under nonnormality.
2. Test statistics and perturbation expansion
First, we summarize typical three test criteria that have been proposed under normality. LetShandSebe the variation matrices due to the hypothesis and the error, respectively, i.e.,
Sh CXD^ 0 CRC0ÿ1 CXD;^ SeD0 X0Sÿ1Xÿ1D;
where
X^ A0Aÿ1A0YSÿ1X X0Sÿ1Xÿ1;
R A0Aÿ1 A0Aÿ1A0YfSÿ1ÿSÿ1X X0Sÿ1XX0Sÿ1gY0A A0Aÿ1; and SY0 InÿPAY. Here PA is the projection matrix to the linear space R Agenerated by the column vectors of A. Then the following three criteria have been proposed, in particular, under normality.
(i) the likelihood ratio statistic:
TLR ÿfnÿkÿ pÿq s1glog jSej=jSeShj;
(ii) the Lawley-Hotelling trace criterion:
THL fnÿkÿ pÿq s2gtr ShSeÿ1;
(iii) the Bartlett-Nanda-Pillai trace criterion:
TBNP fnÿkÿ pÿq s3gtrfSh ShSeÿ1g;
where the constantssj's are the Bartlett corrections in the normal case, and they are given as follows: s1 ÿ dÿc1=2, s2 ÿ d1 and s3c. For the special case qp, note that three criteria are reduced to the ones in the usual MANOVA model. Therefore, as in the MANOVA model, it may be sug- gested to use the criteria for nonnormal models.
Under normality, the distributions of these statistics have been extensively studied. Fujikoshi (1974) obtained asymptotic expansions of the non-null dis- tributions for three test statistics. Under nonnormality it is easily seen that the null distributions of these statistics converge towcd2 as the sample sizentends to in®nity under an appropriate regularity condition on the design matrix (see Huber (1973)). Our main purpose is to obtain asymptotic expansions of the null distributions of these statistics up to the order nÿ1 under a general condition.
Note that the three test statistics are invariant under the transformations from Y0;X to Sÿ1=2Y0;X. Therefore, without loss of generality we may assume SIp by replacing X with Sÿ1=2X. In the following, we shall do that, and we regardX as Sÿ1=2X. We consider expressing the test statistics in terms of
Z A0Aÿ1=2A0E; V 1
pnXn
j1
ejej0ÿIp: 2:1
Note that nÿ1Sÿ1 can be expanded as 1
nS ÿ1
Ipÿ1 nV1
n V2Z0Z Op nÿ3=2:
Therefore, 1
n X0Sÿ1Xÿ1
X0Xÿ1=2
Iqp X1n 0Xÿ1=2X0VX X0Xÿ1=2 ÿ1
n X0Xÿ1=2X0fV IpÿPXVZ0ZgX X0Xÿ1=2
X0Xÿ1=2Op nÿ3=2:
By using these results, we de®ne modi®ed matrices S~e, X~ and R~ by the fol- lowing relations, respectively.
1 nSe ÿ1
fD0 X0Xÿ1Dgÿ1=2S~e2fD0 X0Xÿ1Dgÿ1=2; X^ A0Aÿ1=2ZXX X~ 0Xÿ1=2;
A0Aÿ1=2C0 CRC0ÿ1C A0Aÿ1=2RW~ R;~ where
W A0Aÿ1=2C0fC A0Aÿ1C0gÿ1C0 A0Aÿ1=2:
Then, we obtain W2W and get rank W tr W c. Further, the random matrices S~e, X~ and R~ can be expanded as
S~eIdÿ 1 2pnL0VL
1
2nL0 V IpÿPX3 4Q
VZ0Z
LOp nÿ3=2;
X~Ipÿ 1
p In pÿPXV
1
n IpÿPXfV IpÿPXVZ0Zg Op nÿ3=2;
R~Ikÿ 1
2nWZ IpÿPXZ0WOp nÿ3=2;
2:2
where
LX X0Xÿ1DfD0 X0Xÿ1Dgÿ1=2;
QLL0X X0Xÿ1DfD0 X0Xÿ1Dgÿ1D0 X0Xÿ1X0: Using these expressions, the three test statistics can be expanded as
TGtr U0WU 1
nfr1ÿkÿ pÿqgtr U0WU
r2trf U0WU2g Op nÿ3=2; 2:3
where
URZ~ XL~ S~e: 2:4
Here the constants r1 and r2 are de®ned as follows;
i TLR :r1s1; r2 ÿ1=2;
ii THL :r1s2; r20;
iii TBNP:r1s3; r2 ÿ1:
In our derivation, ®rst we derive an asymptotic expansion of the distribution of U. Then, using the result, we obtain an asymptotic expansion of the null distribution of TG.
3. Edgeworth expansion of U
In this section, we obtain an asymptotic expansion of the distribution function of U up to the ordernÿ1. Without loss of generality, we assume that SIp as in a previous section. So, we regard X as Sÿ1=2X in the following expressions. Lete;e1;. . .;en be a sequence ofi.i.d.random vectors with E e 0 and Cov e Ip. We write a moment of e as
mi1...imEei1. . .eim;
where ej denotes the jth element of e. Similarly, the corresponding cumulant of e is expressed as ki1...im. Further, we use the following real matrix notation for arguments of some characteristic functions.
T tab:kd matrix;
T1 t 1ab:kp matrix;
T2 1dabt 2ab=2:pp matrix;
where dab is the Kronecker delta, i.e., daa1 and dab0 for a0b.
In order to get a valid expansion for the distribution function of U up the order nÿ1, we make some assumptions for the between-individuals design matrix A and the distribution of e. Let ln be the smallest eigenvalue ofA0A,
and Mnmaxfkajk: j1;. . .ng, where k k denotes the Euclidean norm.
We make the following assumptions.
B1. lim sup
n!y
1 n
Xn
j1
kajk4<y, B2. lim inf
n!y
ln
n >0,
B3. For some constant d>0, MnO n1=2ÿd, B4. E kek8<y,
B5. The CrameÂr's condition for e and ee0; lim sup
ktkkT2k!yjEexpfit0eitr T2ee0gj<1;
wheret is a p1 real vector. Here, we de®ne the norm of a matrix T2 as kT2k Pp
a1
Pp
b1f 1dabt 2abg2=41=2:
From (2.2) and (2.4), the random matrix U can be expanded as UU0 1
pnU11
nU2Op nÿ3=2; 3:1
where
U0ZL;
U1 ÿ1
2ZfQ2 IpÿPXgVL;
U21
8Z2fQ2 IpÿPXgVfQ2 IpÿPXg QVQVL
1
2ZfQ2 IpÿPXgZ0ZLÿ1
2WZ IpÿPXZ0WZL:
Using (3.1), the characteristic functionCU TofU can be expanded under the assumptions B1, B2, B3 and B4 as
CU T Eexpfitr T0Ug
Eexpfitr T0U0g 1
pnEitr T0U1expfitr T0U0g
1
nE itr T0U2 i2
2ftr T0U1g2
expfitr T0U0g
o nÿ1
CU 0 T 1
pnCU 1 T 1
nC 2U T o nÿ1:
Now we need to evaluate each term in the expansion ofCU T. Here we note that, rank L d, which can be essentially done in the same way as in Wakaki, Yanagihara and Fujikoshi (2000). The method is based on the use of dif- ferentials for C T1;T2, which is de®ned by
C T1;T2 Eexpfitr T10Znÿ1=2T2Vg:
Therefore, letting T1TL0 we have CU 0 T C T1;O
exp (i2
2 tr T10T1 i3 6
pn Xk
a0b0c0
Xp
abc
t 1a0at 1b0bt 1c0cwa0b0c0kabc
i4 24n
Xk
a0b0c0d0
Xp
abcd
t 1a0at 1b0bt 1c0ct 1d0dwa0b0c0d0kabcd o nÿ1 )
;
where
wa1...aj 1 n
Xn
i1
Yj
l1
wial;
pn
A0Aÿ1=2ai wi1;. . .;wik0: 3:2
Further,
CU 1 T ÿi
2EtrfT0Z Q2 IpÿPXVLgexpfitr T0ZLg
ÿi
2EtrfT10Z Q2 IpÿPXVgexpfitr T10Zg
ÿi
2 iÿ2Xk
a0
Xp
abc
t 1a0c qab2rab q2
qt 1a0aqt 2bc C T1;T2jT2O; 3:3
where qab and rab the a;bth elements of Q and IpÿPX resrpectively, and Pk
a1...aj Pk
a11. . .Pk
aj1. Note that q2
qt 1a0aqt 2bc C T1;T2jT2O
"
i2wa0kabci4t 1a0a
Xk
b0
Xp
d
t 1b0dwb0kbcd
1
pn (
i3Xp
d
t 1a0d mabcdÿdaddbc i5 2t 1a0a
Xk
b0
Xp
de
t 1b0dt 1b0e mbcdeÿdbcdde
i5 2
Xk
b0c0d0
Xp
def
t 1b0et 1c0ft 1d0dwa0b0c0wd0kaefkbcd )#
C T1;O o nÿ1=2:
Moreover, it holds that IpÿPXL0, tr IpÿPX pÿq, IpÿPX2 IpÿPX, L0LId and Q2Q, in other words
Xp
b
rablbj0; Xp
a
raa pÿq; Xp
c
racrbcrab; Xp
a lailajdij; Xp
c qacqbcqab;
where lab is the a;bth element of L. By substituting these equations into (3.3) and replacing t 1a0a with Pd
j1ta0jlaj, we can evaluate CU 0 T and CU 1 T.
Similarly, we can evaluate CU 2 T. Therefore, we can obtain an expansion of CU T, whose formal inversion yields a valid expansion of the distribution function of U as in the following Theorem 3.1.
Some additional notations on cumulants need to be de®ned before describing Theorem 3.1. The quantity K laq1q1, which depends on the third order cumulants and the elements of L and Q is de®ned as
K laq1q1 Xp
a0b0c0
la0aqb0c0ka0b0c0: 3:4
In this expression, the order of indices inK corresponds to the one of indices in ka0b0c0. So, the l accompanying with index a0 appears to the ®rst order of indices in K. Similarly, the second and third order of indices in K are the q with indices b0 and c0, respectively. Further, the same number in indices expresses as the same element of symmetric matrix. Along the same line as (3.4), we de®ne
K lar1r2 lbr1r2 Xp
a0b0c0d0e0f0
la0ald0brb0e0rc0f0ka0b0c0kd0e0f0: Other constants are de®ned similarly.
Theorem 3.1. Suppose that the design matrix A and the error matrix Ein (1.1) satisfy the assumptions B1,B2,B3,B4 and B5. Let uvec U, then the distribution function of U can be expanded as
P vec Uax
x1
ÿy. . . xkd
ÿyfkd u 1 1
pnR1 u 1 nR2 u
duo nÿ1;
where
R1 u ÿ1 2
Xk
a0
Xd
ab
wa0 K laq1q12K lar1r1Ha0a u
1 6
Xk
a0b0c0
Xd
abc
wa0b0c0ÿ3wa0db0c0K lalblcHa0a;b0b;c0c;d0d u; 3:5
R2 u 1 8
Xk
a0b0
Xd
abcd
wa0wb0 K laq1q1 lbq2q23K lalbq1 q1q2q2
4K laq1q2 lbq1q24K lar1r1 lbr2r28K lalbr1 r1r2r2
12K lar1r2 lbr1r24K lalbr1 q1q1r112K laq1r1 lbq1r1
4K lalbq1 q1r1r14K laq1q1 lbr1r1Ha0a;b0b u
1 2
Xk
a0
Xd
a
f k3pÿ2q1 oa0a0 pÿqgHa0a;a0a u
1 24
Xk
a0b0c0d0
Xd
abcd
wa0b0c0d0ÿ3da0b0dc0d0K lalblcld
ÿ2wa0b0c0wd0fK lalblc ldq1q13K laldq1 lblcq1
2K lalblc ldr1r16K laldr1 lblcr1g 3wa0wb0dc0d0fK laq1q1 lblcld
K lalbq1 lcldq12K lalcq1 lbldq14K lar1r1 lblcld
4K lalbr1 lcldr18K lalcr1 lbldr1g
6da0b0dc0d0dacdbcHa0a;a0b;c0c;d0d u
1 72
Xk
a0b0c0d0e0f0
Xd
abcbdef
wa0b0c0wd0e0f0ÿ6wa0b0c0wd0de0f09wa0db0c0wd0de0f0
K lalblc ldlelfHa0a;b0b;c0c;d0d;e0e;f0f u: 3:6
Here fkd u is the probability density function of Nkd 0;Ikd given by fkd u 2pÿkd=2exp ÿu0u=2, and Ha10a1;...;aj0aj uis the multivariate Hermite polynomial.
In Theorem 3.1, the multivariate Hermite polynomial is de®ned by Ha10a1;...;aj0aj u ÿ1j qj
qua10a1. . .quaj0aj
fkd u;
where ua0a is the a0;ath element of U. For example Ha0a u ua0a;
Ha0a;b0b u ua0aub0bÿdabda0b0;
Ha0a;b0b;c0c u ua0aub0buc0cÿX
3
ua0adbcdb0c0;
Ha0a;b0b;c0c;d0d u ua0aub0buc0cud0dÿX
6
ua0aub0bdcddc0d0X
3
dabdcdda0b0dc0d0; Ha0a;b0b;c0c;d0d;e0e;f0f u ua0aub0buc0cud0due0euf0f ÿX
15
ua0aub0buc0cud0ddefde0f0
X
45
ua0aub0bdcddefdc0d0de0f0X
45
dabdcddefda0b0dc0d0de0f0: Here P
j means the sum of all j possible combinations of the sets ai0 and ai, for example
X
3
da0b0dc0d0dabdcd da0b0dc0d0dabdcdda0c0db0d0dacdbdda0d0db0c0daddbc: It may be noted that we can demonstrate the validity of the expansion by the argument similar to the one as in Wakaki, Yanagihara and Fujikoshi (2000), which is based on the same manner as in Bhattacharya and Ghosh (1978). Moreover, the moment condition B4 will be replaced with E kek4<
y as in Hall (1987).
4. Asymptotic expansions of the distribution functions of X^ and its linear combination
4.1. Two types of standardizations
In this section we consider asymptotic expansions of the distribution functions for X^ and its linear combination, where X^ is the maximum like- lihood estimator of X under normality. Related to the construction of con®- dence intervals ofXand its linear combination, we consider following two types of standardizations.
(1) standardized X:^
US A0A1=2 X^ÿX X0Sÿ1X1=2; (2) Studentized X:^
UT
pn
A0A1=2 X^ÿX X0Sÿ1X1=2; (3) standardized linear combination of X:^
USL a0 X^ÿXb=t;
(4) Studentized linear combination of X:^ UTLa0 X^ÿXb=^t;
where a and b are k1 and q1 ®xed vectors, respectively, and positive values t and ^t are de®ned by
t2a0 A0Aÿ1ab0 X0Sÿ1Xÿ1b; ^t21
na0 A0Aÿ1ab0 X0Sÿ1Xÿ1b:
Note that these standardizations have been proposed under normality.
However, we shall see that such standardizations do work asymptotically under nonnormality. Under normality, Fujikoshi (1987, 1993a) and von Rosen (1997) derived asymptotic expansions of the distributions of these statistics.
Further, its error bounds were discussed in Fujikoshi (1987, 1993a).
In this section, without loss of generality, we assume that SIp as in previous sections. So, X is regarded asSÿ1=2X. Therefore, t2 is rewritten as
t2 a0 A0Aÿ1ab0 X0Xÿ1b:
4.2. Asymptotic expansion of US
Let MX X0Xÿ1=2 whose a;bth elements of M is denoted as mab. From (2.1), US can be expanded as
US ZMÿ 1
pnZ IpÿPXVM
1
nZ IpÿPXfV IpÿPX Z0ZgMOp nÿ3=2: 4:1
From (4.1) we can expand the characteristic function CUS T3 of US as CUS T3 Eexpfitr T30ZMg
ÿ i
pnEtrfT30Z IpÿPXVMgexpfitr T30ZMg
i
nEtrfT30Z IpÿPXfV IpÿPX Z0ZgMgexpfitr T30ZMg
i2
2nEftr T30Z IpÿPXVMg2expfitr T30ZMg o nÿ1;
where T3 is a kq real matrix. Letting T1T3M0, we can see that the characteristic function can be evaluated by the same method as in Section 3.
In this case, using the relations M0MIq, MM0PX and IpÿPXM 0, we can obtain an expansion of CUS T3, whose inversion yields an asymptotic expansion of the distribution function of US as in Theorem 4.1.
Theorem 4.1. Suppose that the design matrix A and the error matrix Ein (1.1)satisfy the assumptions B1,B2,B3,B4and B5. Let uvec US, then the distribution function of US can be expanded as
P vec USax
x1
ÿy. . . xkq
ÿyfkq u 1 1
pnRS;1 u 1 nRS;2 u
duo nÿ1;
where
RS;1 u ÿXk
a0
Xq
a
wa0K mar1r1Ha0a u
1 6
Xk
a0b0c0
Xq
abc
wa0b0c0K mambmcHa0a;b0b;c0c u; 4:2
RS;2 u 1 2
Xk
a0b0
Xq
ab
pÿqda0b0dabÿda0b0K mambr1r1
wa0wb0fK mar1r1 mbr2r22K mambr1 r1r2r2
3K mar1r2 mbr1r2gHa0a;b0b u
1 24
Xk
a0b0c0d0
Xq
abcd
12wa0wb0dc0d0K mamcr1 mbmdr1
ÿ4wa0b0c0wd0fK mambmc mdr1r13K mambr1 mcmdr1g
wa0b0c0d0K mambmcmdHa0a;b0b;c0c;d0d u
1 72
Xk
a0b0c0d0e0f0
Xq
abcdef
wa0b0c0wd0e0f0
K mambmc mdmemfHa0a;b0b;c0c;d0d;e0e;f0f u: 4:3
Specially, when e is distributed as Np 0;S, RS;1 u 0; RS;2 u 1
2 pÿqXk
a0
Xq
a
Ha0a;a0a u:
Therefore,
P vec USax
x1
ÿy. . . xkq
ÿyfkq u 1 1
2n pÿqXk
a0
Xq
a
Ha0a;a0a u
" #
duo nÿ1:
This result coincides with the formula in Fujikoshi (1987).
4.3. Asymptotic expansion of UT
Let li be the eigenvalues of X0X and Ldiag l1;. . .;lq. Further, let H be an orthogonal matrix of orderq such that X0X HLH0. Then, using a perturbation formula (see, Okamoto and Fujikoshi (1976)), we have
n
p X0Sÿ1X1=2 X0X1=2p1nHS 1H01
nHS 2H0Op nÿ3=2; 4:4
where the i;jth elements of S 1 and S 2 are de®ned by S 1ij Bij
li p
lj
p ; S 2ij ÿ S 12ij
li p
lj p :
Here ij is denoted as the i;jth element of the matrix in the parenthesis and the matrix B is de®ned by
B ÿH0X0 Vÿp V1n 2Z0Z
XH:
Substituting (4.4) into UT, we can represent as UT US 1
pnUS X0Xÿ1=2HS 1H0
1
nUS X0Xÿ1=2HS 2H0Op nÿ3=2: 4:5
From (4.5), the characteristic function CUT T3 of UT can be expanded as CUT T3 CUS T3
i
pnEtrfT30 X0Xÿ1=2HS 1H0gexpfitr T30USg
i
nEtrfT30 X0Xÿ1=2HS 2H0gexpfitr T30USg
i2
2nEftrfT30 X0Xÿ1=2HS 1H0gg2expfitr T30USg o nÿ1:
By computing CUT T3 and inverting the resultant expansion, we can obtain an asymptotic expansion of UT as in Theorem 4.2.
Theorem 4.2. Suppose that the design matrix A and the error matrix Ein (1.1) satisfy the assumptions B1, B2, B3, B4 and B5. Let uvec UT,
hij 1 li p
lj
p ; nijhij li;
and hab, h 1ab and h 2ab denote the a;bth elements of H, XH and X0X1=2H respectively. Then the distribution function of UT can be expanded as