NULL DISTRIBUTION OF MULTIPLE CORRELATION COEFFICIENT UNDER MIXTURE NORMAL MODEL

(1)

http://ijmms.hindawi.com

NULL DISTRIBUTION OF MULTIPLE CORRELATION COEFFICIENT UNDER MIXTURE NORMAL MODEL

HYDAR ALI and DAYA K. NAGAR Received 14 April 2001

The multiple correlation coeﬃcient is used in a large variety of statistical tests and regres- sion problems. In this article, we derive the null distribution of the square of the sample multiple correlation coeﬃcient,R², when a sample is drawn from a mixture of two multivariate Gaussian populations. The moments of 1−R²and inverse Mellin transform have been used to derive the density ofR².

2000 Mathematics Subject Classiﬁcation: 62H10, 62H15.

1. Introduction. Suppose thatx(p×1),µ(p×1), andΣ(p×p) >0 are partitioned as x=_x

1 x⁽²⁾

,µ=_µ

µ⁽²⁾1

, andΣ=_σ

11σ₂₁ σ21Σ22

, wherex⁽²⁾=(x2, . . . , xp)andµ⁽²⁾=(µ2, . . . , µp) are(p−1)×1 andΣ22is(p−1)×(p−1), so that Var(x1)=σ11, Cov(x⁽²⁾)=Σ22, and σ12 is the (p−1)×1 vector of covariances betweenx1 andx2, . . . , xp. The multiple correlation coeﬃcient betweenx1andx⁽²⁾, denoted by ¯R1·2···p, is deﬁned as

R¯_1·2···p=

σ21Σ⁻22¹σ21

σ11

1/2

. (1.1)

LetAbe the sample sum of squares and products matrix formed fromNindepen- dent observations onx. PartitionAasA=_a

11 a₂₁ a21 A₂₂

, whereA22is(p−1)×(p−1). The sample multiple correlation coeﬃcient betweenx1andx⁽²⁾is deﬁned by

R=a21A⁻¹₂₂a21

a11

1/2

. (1.2)

It is well known that, when the underlying population is normal, the random matrixA has Wishart distribution withn=N−1 degrees of freedom and parameter matrixΣ.

Further, ¯R1·2···p=0 if and only ifx1is independent ofx⁽²⁾=(x2, . . . , xp). Furthermore, when the population multiple correlation coeﬃcient ¯R_1·2···pis zero, the distribution ofR²is beta with parameters(1/2)(p−1)and(1/2)(N−p).

In practice, it is often the case that the random variables are not normally dis- tributed. When such is the case, how would the departure from the normality affect the conventional inference procedure? Specifically, one may wonder what would be the sampling distributions of some commonly used statistics? For providing some answers to the above questions, Srivastava and Awan [9] and Tan [11] derived the distribution of the sample sum of squares and products matrix when sampling from a mixture of two multivariate normal distributions. The normal mixture is defined as follows:

f (x)=Np

µ1,Σ;x

+(1−)Np

µ2,Σ;x

, x∈R^p, (1.3)

(2)

where Np(µ,Σ;x)

=(2π )⁻^(1/2)pdet(Σ)⁻^1/2exp

−1

2(x−µ)Σ⁻¹(x−µ)

, x∈R^p,µ∈R^p,Σ>0, (1.4) and 1− is known as the degree of contamination. This model is very common in medical, biological, and agricultural experiments (Titterington et al. [12]). For results on the distribution theory and robustness studies of certain test statistics when sampling from a mixture normal model, see Srivastava [8], Srivastava and Awan [9,10], Kabe and Gupta [5], Amey and Gupta [2], and Nagar and Castañeda [7].

Srivastava [8], using certain transformations, derived the null distribution of multiple correlation coeﬃcient when sampling from a mixture of two multivariate normal distributions (see also Gupta and Kabe [3]). Amey [1] integrated the joint density of a11,a²¹, andA22suitably to derive the density ofR²and studied its robustness.

In this article, we derive the null distribution ofR²when sampling from a mixture of two multivariate normal distributions. First, we derive thehth null moment of 1−R². Then, by using the inverse Mellin transform, the density of 1−R²is obtained from which the density ofR²is deduced.

Note thatR²is a function of the elements of sample sum of squares and products matrixA. Therefore, in our derivation, we use the distribution of Awhen sampling from the above model. Srivastava and Awan [9] and Tan [11] have shown that the density ofA, when sampling from (1.3), is a binomial sum of linear noncentral Wishart densities:

f (A)=

N γ=0

N γ

^γ(1−)^N⁻^γWp

n,Σ, c²_γΣ⁻¹νν;A

, (1.5)

wheren=N−1,c_γ²=γ(N−γ)/N, andν=(µ1−µ2). HereWp(n,Σ, c_γ²Σ⁻¹νν;A)rep- resents the noncentral Wishart density withndegrees of freedom and noncentrality parameter matrixc²_γΣ⁻¹ννdeﬁned by

Kp(n,Σ,ν)etr

−1 2Σ⁻¹A

det(A)(1/2)(n−p−1) 0F₁^(p)

1 2n;1

4c_γ²Σ⁻¹AΣ⁻¹νν

, (1.6) where

Kp(n,Σ,ν)=

2^(1/2)pnΓp

1 2n

det(Σ)^(1/2)n ₋1

etr

−1

2c_γ²Σ⁻¹νν

(1.7) andΓm(a)=π^(1/4)m(m⁻¹⁾m

j=1Γ(a−(1/2)(j−1)).

2. Null moments of 1−R². In this section, we derive moments of 1−R² when R¯_1·2···p=0 (or equivalently σ21=0). LetΣ0=_σ₁₁ ₀

0 Σ22

andU=1−R². Sincea11 is scalar, then

U=1−R²=1−a21A⁻₂₂¹a21

a11 = det(A) a11det

A22. (2.1)

(3)

Thehth null moment ofUis given by E

U^h

=

N γ=0

N γ

^γ(1−)^N⁻^γEγ

U^h

, (2.2)

Eγ U^h

=Kp

n,Σ0,ν

A>0etr

−1 2Σ⁻0¹A

a^−h₁₁det A22−h

×det(A)(1/2)(n−p−1)+h 0F₁^(p)

1 2n;1

4c_γ²Σ⁻¹0 AΣ⁻¹0 νν

dA.

(2.3)

Replacinga⁻₁₁^hand det(A22)^−hby their integral representations, namely a⁻₁₁^h= 1

2^hΓ(h) _∞

0 exp

−1 2a11y1

y₁^h⁻¹dy1, Re(h) >0, det

A22

₋h

= 1

2^(p⁻^1)hΓp−1(h)

Y₂₂>0etr

−1 2A22Y22

×det Y22

h−(1/2)(p−1+1)

dY22, Re(h) >1 2(p−2),

(2.4)

respectively, in (2.3) and integratingA, the moment expression is rewritten as Eγ

U^h

=2^(1/2)npKp

n,Σ0,ν Γp

(1/2)n+h Γ(h)Γp−1(h)

× _∞

0 y₁^h⁻¹

Y₂₂>0det Y22

h−(1/2)p

det

Σ⁻0¹+Y₋(1/2)n−h

×1F₁^(p) 1

2n+h;1 2n;1

2c_γ²Σ⁻¹0

Σ⁻¹0 +Y₋1

Σ⁻¹0 νν

dy1dY22,

(2.5)

whereY=y₁ 0 0 Y22

and1F₁^(p)is the conﬂuent hypergeometric function of matrix argument (Gupta and Nagar [4]). Since rank(Σ⁻¹0 (Σ⁻¹0 +Y )⁻¹Σ⁻¹0 νν)=1, the only nonzero characteristic root of the matrixΣ⁻0¹(Σ⁻0¹+Y )⁻¹Σ⁻0¹ννis tr((Σ⁻0¹+Y )⁻¹Σ⁻0¹ννΣ⁻0¹) and therefore,

1F₁^(p) 1

2n+h;1 2n;1

2c_γ²Σ⁻0¹

Σ⁻0¹+Y₋1

Σ⁻0¹νν

=1F1

1 2n+h;1

2n;1 2c_γ²tr

Σ⁻0¹+Y₋1

Σ⁻0¹ννΣ⁻0¹

,

(2.6)

where1F1is the conﬂuent hypergeometric function of scalar argument (see [6]). Substi- tuting (2.6) in (2.5) and expanding1F1in series form, the moment expression simpliﬁes to

Eγ U^h

=2^(1/2)npKp

n,Σ0,ν Γp

(1/2)n+h Γ(h)Γp−1(h)

∞ t=0

c_γ² 2

t

(1/2)n+h t

(1/2)n

tt!

× _∞

0

y₁^h−1

Y₂₂>0

det

Y22h−(1/2)pdet

Σ⁻¹0 +Y−(1/2)n−h

× νΣ⁻¹0

Σ⁻¹0 +Y−1Σ⁻¹0 νt

dy1dY22,

(2.7)

(4)

where(a)r=a(a+1)···(a+r−1)and(a)0=1. Noting thatΣ0is a block diagonal matrix, we obtain

νΣ⁻0¹

Σ⁻0¹+Y−1Σ⁻0¹νt

= ν₁²σ₁₁⁻¹

1+σ11y1−1+ν2Σ⁻¹22

Σ⁻¹22+Y2−1Σ⁻¹22ν2t

=

k+=t

t!

k!!

ν₁²σ₁₁⁻¹

1+σ11y1−1k ν2Σ⁻¹22

Σ⁻¹22+Y22−1Σ⁻¹22ν2

,

det

Σ⁻¹0 +Y

= σ11−1

1+σ11y1 det

Σ22−1det

Ip−1+Σ22Y22 .

(2.8)

Now substituting (2.8) in (2.7), we have Eγ

U^h

=2^(1/2)npKp

n,Σ0,ν

Γp((1/2)n+h) det(Σ0)^(1/2)n⁺^hΓ(h)Γp−1(h)

×

∞ t=0

c_γ² 2

t

(1/2)n+h t

(1/2)n

t k+=t

1 k!!

ν₁² σ11

k

× _∞

0 y₁^h−1

1+σ11y1−((1/2)n+h+k)

dy1

×

Y₂₂>0det Y22

h−(1/2)p

det

I_p−1+Σ22Y22

₋(1/2)n−h

× ν2Σ⁻22¹

Σ⁻22¹+Y22

₋1

Σ⁻22¹ν2

dY22.

(2.9)

SubstitutingZ=(Ip−1+Σ^1/222Y22Σ^1/222 )⁻¹, the integral involvingY22is evaluated as

Y₂₂>0det Y22

h−(1/2)p

det

I_p−1+Σ22Y22

₋(1/2)n−h ν2Σ⁻22¹

Σ⁻22¹+Y22

₋1

Σ⁻22¹ν2

dY22

=det Σ22−h

0<Z<I_p−1det(Z)^(1/2)(n⁻^p)

×det

Ip−1−Zh−(1/2)p

ν2Σ⁻22^1/2ZΣ⁻22^1/2ν2

dZ

=det

Σ22−h ∂

∂η

η=0

0<Z<I_p−1

det(Z)^(1/2)(n⁻^p)det

Ip−1−Zh−(1/2)p

×etr

ην2Σ⁻22^1/2ZΣ⁻22^1/2ν2 dZ

=det

Σ22−hΓp−1

(1/2)n Γp−1(h) Γp−1

(1/2)n+h ∂

∂η

η=01F₁^(p⁻¹⁾ 1

2n;1

2n+h;ηΣ⁻¹22ν2ν2

=det Σ22

₋hΓp−1

(1/2)n Γp−1(h) Γp−1

(1/2)n+h

(1/2)n

(1/2)n+h

ν2Σ⁻22¹ν2

,

(2.10) where₁F₁^(p⁻¹⁾is the conﬂuent hypergeometric function of matrix argument (see [4]).

(5)

Collecting terms containingy1and integrating, we obtain _∞

0 y₁^h⁻¹

1+σ11y1

₋((1/2)n+h+k)

dy1=σ₁₁⁻^hΓ (1/2)n

Γ(h) Γ

(1/2)n+h

(1/2)n k

(1/2)n+h

k

. (2.11) Substituting (2.10), (2.11), and (1.7) in (2.9) and simplifying the resulting expression using results on gamma function, we get

Eγ

U^h

=exp

−1

2c_γ²νΣ⁻0¹ν

Γ (1/2)n Γ

(1/2)(n−p+1) ^∞

t=0+k=t

c_γ² 2

t (1/2)n

k

(1/2)n

t

×

ν₁²/σ11

k

ν2Σ⁻22¹ν2

k!!

Γ

(1/2)(n−p+1)+h Γ

(1/2)n+t+h Γ

(1/2)n+k+h Γ

(1/2)n++h

=exp

−1

2c_γ²νΣ⁻¹0 νΓ

(1/2)n Γ

(1/2)(n−p+1)+h Γ

(1/2)n+h Γ

(1/2)(n−p+1) ^∞

k=0

c_γ² 2

k

×

ν₁²/σ11

k

k! ²F2

1

2n+h+k,1 2n;1

2n+k,1 2n+h;1

2c_γ²ν2Σ⁻¹22ν2

,

(2.12) where2F2is the generalized hypergeometric function of scalar argument (see [6]).

3. Distribution ofR²under mixture normal model. The density functionf (u)of U=1−R²is obtained by taking the inverse Mellin transform ofE(U^h)as

f (u)=

N γ=0

N γ

^γ(1−)^N⁻^γfγ(u) (3.1) with

fγ(u)=(2π ι)⁻¹

CEγ

U^h

u⁻^h⁻¹dh, 0< u <1, (3.2) whereι=√

−1 andCis a suitable contour. Substituting (2.12) in (3.2), we obtain fγ(u)=exp

−1

2c_γ²νΣ⁻0¹ν

Γ

(1/2)n Γ

(1/2)(p−1) Γ

(1/2)(n−p+1) ^∞

t=0k+=t

c_γ² 2

t

×

(1/2)n

k

(1/2)n

t

ν₁²/σ11

ν₂Σ⁻22¹ν2

k

k!! u(1/2)n+k+−1(1−u)^(1/2)(p−3)

×2F1

1

2(p−1)+k,1

2(p−1)+;1

2(p−1); 1−u

, 0< u <1,

(3.3) where2F1is the Gauss hypergeometric function (see [6]). To obtain (3.3) we have used the result

1 0

u(1/2)n+h+k+−1(1−u)^(1/2)(p−3)2F1

1

2(p−1)+k,1

2(p−1)+;1

2(p−1); 1−u

du

=Γ

(1/2)(p−1) Γ

(1/2)(n−p+1)+h Γ

(1/2)n+t+h Γ

(1/2)n+k+h Γ

(1/2)n++h .

(3.4)

(6)

The density ofR²=1−Uis now derived from the density ofUas g

R²

=

N γ=0

N γ

^γ(1−)^N−γgγ

R²

, (3.5)

where gγ

R²

=exp

−1

2c²_γνΣ⁻0¹ν

Γ (1/2)n Γ

(1/2)(p−1) Γ

(1/2)(n−p+1) ^∞

t=0k+=t

c_γ² 2

t

×

(1/2)n

k

(1/2)n

t

ν₁²/σ11

ν2Σ⁻¹22ν2

k

k!!

×

1−R²(1/2)n+k+−1

R²(1/2)(p−3)

×2F1

1

2(p−1)+k,1

2(p−1)+;1

2(p−1);R²

, 0< R²<1.

(3.6) By using the result2F1(a, b;c;z)=(1−z)^c⁻^a⁻^b2F1(c−a, c−b;c;z), the above density can be rewritten as

gγ R²

=exp

−1

2c_γ²νΣ⁻¹0 ν Γ (1/2)n Γ

(1/2)(p−1) Γ

(1/2)(n−p+1)

×

R²(1/2)(p−3)

1−R²(1/2)(n−p−1) ∞ t=0k+=t

c_γ² 2

t (1/2)n

k

(1/2)n

t

×

ν₁²/σ11

ν2Σ⁻22¹ν2

k

k!! ²F1

−k,−;1

2(p−1);R²

, 0< R²<1.

(3.7) It is interesting to note that ifν=0, then the densityg(R²)reduces to

g R²

= Γ

(1/2)n Γ

(1/2)(p−1) Γ

(1/2)(n−p+1)

×

R²(1/2)(p−3)

1−R²(1/2)(n−p−1)

, 0< R²<1.

(3.8)

References

[1] A. K. A. Amey,Robustness of the multiple correlation coeﬃcient when sampling from a mixture of two multivariate normal populations, Comm. Statist. Simulation Com- put.19(1990), no. 4, 1443–1457.

[2] A. K. A. Amey and A. K. Gupta,Testing sphericity under a mixture model, Austral. J. Statist.

34(1992), 451–460.

[3] A. K. Gupta and D. G. Kabe,On some noncentral distribution problems for the mixture of two normal populations, Metrika38(1991), 1–10.

[4] A. K. Gupta and D. K. Nagar,Matrix Variate Distributions, Chapman & Hall/CRC, Florida, 2000.

[5] D. G. Kabe and A. K. Gupta,Hotelling’sT²-distribution for a mixture of two normal popu- lations, South African Statist. J.24(1990), 87–92.

(7)

[6] Y. L. Luke,The Special Functions and Their Approximations, Vol. I, Academic Press, New York, 1969.

[7] D. K. Nagar and M. E. Castañeda,Distribution of correlation coeﬃcient under mixture normal model, to appear in Metrika, 2002.

[8] M. S. Srivastava,On the distribution of Hotelling’sT²and multiple correlationR²when sampling from a mixture of two normals, Comm. Statist. A—Theory Methods12 (1983), no. 13, 1481–1497.

[9] M. S. Srivastava and H. M. Awan,On the robustness of Hotelling’sT²-test and distribution of linear and quadratic forms in sampling from a mixture of two multivariate normal populations, Comm. Statist. A—Theory Methods11(1982), no. 1, 81–107.

[10] ,On the robustness of the correlation coeﬃcient in sampling from a mixture of two bivariate normals, Comm. Statist. A—Theory Methods13(1984), 371–382.

[11] W. Y. Tan,On the distribution of the sample covariance matrix from a mixture of normal densities, South African Statist. J.12(1978), 47–55.

[12] D. M. Titterington, A. F. M. Smith, and U. E. Makov,Statistical Analysis of Finite Mixture Distributions, John Wiley, Chichester, 1985.

Hydar Ali: Department of Mathematics and Computer Science, the University of the West Indies, St. Augustine, Trinidad and Tobago

Daya K. Nagar: Departamento de Matemáticas, Universidad de Antioquia, Medellín, A. A.1226, Colombia

(8)

Special Issue on

Intelligent Computational Methods for Financial Engineering

Call for Papers

As a multidisciplinary field, financial engineering is becom- ing increasingly important in today’s economic and financial world, especially in areas such as portfolio management, as- set valuation and prediction, fraud detection, and credit risk management. For example, in a credit risk context, the re- cently approved Basel II guidelines advise financial institu- tions to build comprehensible credit risk models in order to optimize their capital allocation policy. Computational methods are being intensively studied and applied to im- prove the quality of the financial decisions that need to be made. Until now, computational methods and models are central to the analysis of economic and financial decisions.

However, more and more researchers have found that the financial environment is not ruled by mathematical distribu- tions or statistical models. In such situations, some attempts have also been made to develop financial engineering mod- els using intelligent computing approaches. For example, an artificial neural network (ANN) is a nonparametric estima- tion technique which does not make any distributional as- sumptions regarding the underlying asset. Instead, ANN ap- proach develops a model using sets of unknown parameters and lets the optimization routine seek the best fitting pa- rameters to obtain the desired results. The main aim of this special issue is not to merely illustrate the superior perfor- mance of a new intelligent computational method, but also to demonstrate how it can be used e

ﬀ

ectively in a financial engineering environment to improve and facilitate financial decision making. In this sense, the submissions should es- pecially address how the results of estimated computational models (e.g., ANN, support vector machines, evolutionary algorithm, and fuzzy models) can be used to develop intelli- gent, easy-to-use, and/or comprehensible computational sys- tems (e.g., decision support systems, agent-based system, and web-based systems)

This special issue will include (but not be limited to) the following topics:

• Computational methods

: artificial intelligence, neu- ral networks, evolutionary algorithms, fuzzy inference, hybrid learning, ensemble learning, cooperative learn- ing, multiagent learning

• Application fields

: asset valuation and prediction, as- set allocation and portfolio selection, bankruptcy pre- diction, fraud detection, credit risk management

• Implementation aspects

: decision support systems, expert systems, information systems, intelligent agents, web service, monitoring, deployment, imple- mentation

Authors should follow the Journal of Applied Mathemat- ics and Decision Sciences manuscript format described at the journal site

http://www.hindawi.com/journals/jamds/.

Prospective authors should submit an electronic copy of their complete manuscript through the journal Manuscript Track- ing System at

http://mts.hindawi.com/, according to the fol-

lowing timetable:

Manuscript Due December 1, 2008 First Round of Reviews March 1, 2009 Publication Date June 1, 2009

Guest Editors

Lean Yu,

Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China;

Department of Management Sciences, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong;

[email protected]

Shouyang Wang,

Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China; [email protected]

K. K. Lai,

Department of Management Sciences, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong; [email protected]

Hindawi Publishing Corporation http://www.hindawi.com