Photocopying permittedbylicenseonly the Gordon and Breach Science Publishers imprint.
Printed in Malaysia.
Inequalities on the Singular Values of an Off-Diagonal Block of
a Hermitian Matrix*
CHI-KWONGLI and ROY MATHIAS
Departmentof Mathematics,College ofWilliam& Mary, Williamsburg,VA23187,USA
(Received26 November 1997,"Revised4 February1998)
Amajorization relating the singularvalues ofanoff-diagonal block ofaHermitian matrix anditseigenvaluesis obtained. This basicmajorization inequality impliesvariousnewand existingresults.
Keywords." Hermitianmatrix; Eigenvalue; Singular value; Majorization AMS SubjectClassification: 15A42; 15A60
1
INTRODUCTION
Let
A(H) >_ _> A,(H)
denote the eigenvalues ofan nxnHermitian matrix H. For an mxn complex matrixX,
letri(X)- V/Ai(X*X)
denote theithsingular value for 1,...,k,where k min{m,
n},
and letor(X) (Crl(A),..., r(A))
be thevectorof singular values ofX.In [2],
the following resultwasobtained as ageneralization ofaresultin[6].
THEOREM Suppose Hisan n xn positive
definite
matrix. Thenfor
anynx kmatrixXsuch that
X*X I,
tr(X*HX (X*HX) 2) <
- .-(A(H)-A,_j.+(H)) 2.
Thisresearchwassupported bygrants from theNSF.
Corresponding author. E-mail:[email protected].
137
In fact,thisresultwasconjectured byJ.Durbin, and provedin
[1]
and[3],
independently.Thisresultisimportantinthecontext ofstudying the relativeperformance of the least squaresestimatorand the bestlinear unbiased estimator in a linearmodel[1].
Observe thatifUis aunitary matrixof the formIX
Y], thenX*HY
is ak nmatrix, andrn
tr(X*HZX (X*HX) 2) rj(X*HY) 2,
j=l
wherem min{k,
n-k}.
Inthe following,weobtain amajorization result thatwillallowone to deduce a whole family of inequalities including Theorem 1. In Refs.
[1-3],
theproofof Theorem wasdone using partialdifferentia- tion to locate the optimal matrix that yields the upper bound oftr(X*H2X (X*HX)2).
Inourcase,we usedifferentapproachestogive twoproofs forourresult Theorem 2 thatconnect ourproblem to other subjects.We need some moredefinitions to state ourresult. Given two real
(row
orcolumn)
vectorsx, yERn,
wesay thatx isweaklymajorizedbyy, denotedbyx<w
y, ifthesumof the klargestentries ofx is not largerthan that of y for each k 1, n.Ifinaddition thesumof theentries of each of thevectorsisthesamethenwesay thatx ismajorized by y.
THEOREM 2 LetHbeann n Hermitianmatrix. Then
for
anyunitarymatrix U
of
theform [X
Y], whereXisn kmatrix, wehavecr(X*HY) -<w 1/2(A1 (H) An(H),..., Am(H)
An_m+l(n)),
wherem-min{k,
n-k}.
Consequently,for
anySchurconvex increasingfunction f:
Rm---+I,
wehavef(a(X*HY)) _< f(1/2(A, (H)- An(H),...,Am(H)- An-m+l(O)) ).
Note that if we take
f(x)- _,jm=, x2i
in Theorem 2, we obtainTheorem 1. In fact, there are many other interesting Schur convex functions
(see
[5, Chapter 1]fordetails). Forinstance,f(x) )P--1 IxJl
pwith p
>
and the kth elementary symmetric function Ek(Xl,... ,Xm) with_<
k<_
m aresuch examples.2
PROOFS
We first give a proof of Theorem 2 using the theory of majorization
(see [5]
for the general background) and a reduction ofthe problem tothe 2 2case.First proof
of
Theorem 2Assume,
without loss of generality, that U Iand that k< (n-k).
WriteHi1
B)
H B*
H2:2
where
Hl
isk k and922
is(n-k) (n-k).
Let B- WEVbe a singular value decomposition ofB,where Vand Ware unitary.
Thenthe eigenvalues of thematrix
H- 0 H
0 V*
are the same asthose ofH. The 2 2 principal submatrix ofHlying in rowsand columns and k
+
is[i,
k+ i] ( cri(B) -lii hk+i,k+i
_ffi(B) ),
i-1,...,k.One easily checks that Theorem 2 is truewhen n-2 andk- 1.
As
a result, if[i,
k+ i] RT ( #iO T]iO ) Ri’
where#i_/iand
RRi-
12, thenffi(B) (i- 1]i)/2.
Let R be the n n unitary matrix obtained from
In
by replacingIn[i,k+i] by R for all 1,...,k. Then
(R nR)i
i and(R*R)k+i,k+i-
]i. Since the vector of diagonal entries ofR*gR is majorizedbythevectorof eigenvalues ofR*R
(e.g.,see[5,Chapter 9, B.1]),
for any 1,...,k,wehaveZ
#i--Z(g*g)ii <_ Z Ai(H)
i=1 i=1 i=1
and
Z
T]iZ (R nR)k+i,
k+iZ )n-i+l(n)"
i=1 i=1 i=1
Itfollowsthat
Z o’i(B)<_ Z(#i-
gli)/2<_ Z(,i(H)- ,n-i+l(H))/2.
i=1 i=1 i=1
Next,
we give a proof of Theorem 2 using the theory of the C-numerical range (e.g., see[4]
and its references for the general background).Secondproof
of
Theorem 2By
the singular value decomposition,one canfindunitarymatrices Vand Wof appropriate sizessuchthat(VX*HYW)jj- crj(X*HY),
j- 1,...,m.Thus for any positive integer with
_< _<
m,if weletCt -]=1
E+j,j,where
{El
1,El2,...,Enn}
denotes the standardbasis forn nmatrices, thenZ crj(X*HY) <_
max{I 7(RX*HYS)jI" R,
Sunitary}
j=l j=l
_<
max{I Z(Z*HZ)+j, jI"
Zunitary}
j=l
max(ltr(Z*HZC,)l"
Z unitary}max{Itr(HZC,Z*)l"
Zunitary},which canbe viewedastheH-numerical radius
rH(Ct)
ofCt
(e.g.,see[4]
for thegeneral background).
Moreover,
sinceCt=( 0, 00)
isinthe so-calledshiftblock form and
(Ct + C)/2
has eigenvalues"i/2,...,1/2,0,...,0,-1/2,...,-1/2,
wehave(e.g.,see [4,
(5.1)
and(5.2)])
rn(Ct) max{[tr(HZ(Ct + C’[)Z*)/2 I"
Zunitary}
Aj((Ct +
j=l
Z(/j(H) /n-j+l (H))/2.
j=l
Remarks
Suppose
thatA1 _> _> An
aregiven real numbers and that kis apositive integer such that<
k<
n.Letm min{k,n-k}.
Onecan construct 2 2 matricesHi
with eigenvalues ,i,/n_m+i, and off- diagonalentriesequalto(i-)n_m+ i)/2.Applyingasuitablepermuta- tionsimilaritytothematrixH
H1
0""Hm
diag(Am+l,...,An-m)
willyieldamatrix
where Bisk
(n-k)
such thatif
<
i=j<
k,BO
0 otherwise.Clearly,/)
has eigenvalues A1,...,An.
Thus, we see that ourresult inTheorem 2 is best possible.
In the context of statistics one is interested in real symmetric matrices. SinceTheorem 2is trueforHermitian matrices it is
afortiori
truefor real symmetricmatrices. Itcannotbe improvedinthecase of real symmetric matrices either because the matrix constructed in the example aboveis areal symmetricmatrix.
Acknowledgement
We thank Professor G. Styan for drawingourattention to
[1,2],
and ProfessorZ.Jiafor sendingus acopy of thepaper.References
[1] P. Bloomfield and G.S. Watson, The inefficiency of least squares, Biometrika62 (1975),121-128.
[2] Z. Jia, An extension of Styan’s inequality (Chinese), Gongcheng Shuxue Xuebao (J.EngineeringMathematics)13(1996),122-126.
[3] M. Knott,Onthe minimumefficiency ofleastsquares,Biometrika 62(1975),129-132.
[4] C.K. Li, C-numericalrangesand C-numericalradii,Linearand MultilinearAlgebra 37(1994),51-82.
[5] A.W. Marshall and I. Olkin, Inequalities: The Theory ofMajorization and its Applications,AcademicPress,1979.
[6] G.P.H. Styan,Onsomeinequalitiesassociated withordinaryleastsquares and the Kantorovichinequality,ActaUniv.Tampere,Ser.A.153(1983),158-166.
Note
addedinproofX. Zhan has another proof of our main theorem, which is also obtained independently by R. Bhatia, F.C. Silva, P. Assouad and