• 検索結果がありません。

Estimation of High Dimensional Precision Matrix using Random Matrix Theory (Statistical Inference on Divergence Measures and Its Related Topics)

N/A
N/A
Protected

Academic year: 2021

シェア "Estimation of High Dimensional Precision Matrix using Random Matrix Theory (Statistical Inference on Divergence Measures and Its Related Topics)"

Copied!
9
0
0

読み込み中.... (全文を見る)

全文

(1)

Estimation of

High

Dimensional

Precision

Matrix

using

Random Matrix

Theory

Tsubasa Ito

Graduate School of

Economics, University

of

Tokyo

1

Introduction

About the problem of estimating the high-dimensional covariance matrix, it is well

known that

we

cannot invert thestandardsamplecovariancematrix$S_{p}$when$p>N$, and

even

if$N>p$ but$p/N$ is relatively large it performs poorly. When

we

have

no

advance

information about the structure of the population covariance matrix $\Sigma_{p\rangle}$ shrinking $S_{p}$

to

some

stable statistics improves the performance. There

are

httle research

on

the

direct estimation of the precision matrix and

seems

to be room for improvement

over

theestimators, $U_{p}A_{p}U_{p}^{T},$ $\alpha(S_{p}+\gamma I_{p})^{-1},$ $\alpha S_{p}^{-1}+\beta I_{p}$ proposed in recent years. Then

we

propose $\alpha(S_{p}+\gamma I_{p})^{-1}+\beta I_{p}.$

2

Preliminaries

We begin by stating the basic assumptions which

are

common

in estimation of the

high-dimensionalcovariance matrixbased

on

therandommatrix theory. Throughout the

paper, and denote thespaces of real andcomplexnumbers,respectively. Also, $+$

denotes

the half-plane of complex numbers with strictly positive imaginary part. The real and

imaginary partsof$z\in are$ denotedby$\mathfrak{R}(z)$ and $\Im(z)$, respectively.

(A1) $p/Narrow y\in(O, 1)\cup(1, +\infty)$ as$p,$$Narrow+\infty.$

(A2) $\Sigma_{p}$ is

a

non-random$p$-dimensional positive definite matrix. $X_{f}=(x_{p,1}, \ldots,x_{p,N})^{\prime r}$

is

an

$Nxp$randommatrix, where$x_{p,1}$,

..

.

,$x_{\varphi,N}$

are

mutuallyi.i.d

as

$E[x_{p,j}]=0$ and

$Cov(x_{p,j})=I_{p}.$ $Y_{p}=(y_{p,1}, \ldots,y_{p,N})^{T}$, where$y_{p,j}=\Sigma_{p}^{1/2}x_{p,j}.$

(A3) $t_{p}=(l_{p,1}, \cdots,t_{p,p})^{r}r$ is a system of eigenvalues of $\Sigma_{p}$, sorted in decreasing order.

The empirical spectraldistribution (ESD) of$\Sigma_{p}$is defined by

(2)

$H_{p}(t)$ convergesto limit $H(t)$ at all pointsofcontinuityof $H.$

(A4) $Supp(H)\}$ the support of $H$, is the union of a finite number of closed intervals,

bounded away from

zero

and infinity.

Let $S_{p}=N^{-1}Y_{p}^{T}Y_{p}.$ $P_{p}=(l_{p}, {}_{1}P_{p,p})^{T}$ and $(u_{1}, \ldots, u_{p})$

are

a system of eigenvalues

sorted in decreasing order andeigenvectorsof$S_{p}$The empiricalspectraldistribution (ESD)

of$S_{p}$ is defined by

$F_{p}(t) \equiv\frac{1}{p}\sum_{i=1}^{p}I_{[l_{p,t},+\infty)}, \forall t\in \mathbb{R}.$

For

a

nondecreasing

function

$G$

on

the real line, the stieltjes transform $m_{G}$ of $G$ is

defined by

$m_{G}(z) \equiv\int\frac{1}{x-z}dG(x) , \forall z\in \mathbb{C}^{+},$

where $\mathbb{C}^{+}$

denotes the half-plane of complex numbers with strictly positive imaginary

part.

The stieltjestransformhas the well-knowninversionformula

$G \{[a, b]\}=\frac{1}{\pi}\lim_{\etaarrow 0+}\int_{a}^{b}\Im(m_{G}(\xi+i\eta)\rangle d\xi,$

if$G$iscontinuous at$a$ and$b$

.

Stieltjes transform of $F_{p}$ is

$m_{F_{p}}(z)= \int\frac{1}{\lambda-z}dF_{p}(\lambda)=\frac{\lambda}{p}\sum_{i=1}^{p}\frac{1}{\ell_{i}-z}=\frac{1}{p}tr(S_{p}-zI_{p})^{-1}$

Under $(A1)-(A4)$ and assumption that entries of$X_{p}$areindependentwith

common mean

and variance and forany $\eta>0$,

as

$p/Narrow y$

$\frac{1}{\eta^{2}Np}\sum_{jk}E[|x_{jk}^{(p\rangle}|^{2}I(|x_{jk}^{(p)}|>\eta N^{1/2})]arrow 0,$

there exists

a

distribution function $F$ (limiting spectral distribution (LSD)) suchthat

$F_{p}(x)arrow F(x) , \forall x\in \mathbb{R}\backslash \{0\}.$

$F$ iseverywherecontinuous except at zero, and that the mass of$F$ at

zero

is

$F( O)=\max\{1-y^{-1}, H(O)\}.$

Under the

same

assumptions,$m\equiv m_{F}(z)$is theuniquesolutiontotheequation (Silverstein (1995))

(3)

3

Estimation of the precision matrix

We consider the followinglossfunction $L_{p}(\Sigma_{p}^{-1}, \Omega_{r})\equiv\frac{1}{p}tr(\Omega_{p}\Sigma_{p}-I_{p})(\Omega_{p}\Sigma_{p}-I_{p})^{T}$

In-stead ofminimizing$R(\Sigma_{p}^{-1}, \Omega_{p})\equiv E[L_{p}(\Sigma_{p}^{-1}, \Omega_{p} we$minimize$the$limit$of L_{p}(\Sigma_{p}^{-1}, \Omega_{p})$ obtained from RMT. We consider rotation-equivariant estimator.

$\Omega_{p}=U_{p}A_{p}U_{p}^{T}$ where$A_{p}\equiv Diag(a_{1}, \cdots, a_{p})$

finite-sampleoptimal$a_{i}$ is

$a_{i}^{*}= \frac{u_{\T\Sigma_{p}u_{i}}{u_{1p^{1l}}^{T_{\Sigma 2}}}$

Ledoit and Wolf (2012) consider the hmit of$\tilde{a}_{i}=4^{T}\Sigma_{p}u_{i}$under$\tilde{L}_{p}(\Sigma_{p}^{-1}, \Omega_{p})=\frac{1}{p}tr(\Sigma_{p}^{-1}-$

$\Omega_{p})^{2}.$ $\delta(\ell_{i})$, the limit of$u_{i}^{T}\Sigma_{p}u_{i}$ is, (Ledoit and Peche (2011))

$\delta(l_{i})=\{\begin{array}{ll}\frac{\ell_{\triangleleft}}{|_{\frac{1-y-y\ell_{i}mp1}{(y-1)mp(0)}}(\ell_{i})|^{2}} if\ell_{i}>0if\ell_{i}=0mdy>10 otherwise\end{array}$

$\phi(\ell_{i})$, the limit of$u_{i}^{T}\Sigma_{p}^{2}u_{i}$ is

$\phi(\ell_{i})=\{\begin{array}{ll}\frac{\ell^{2},\{1-y^{2}-2y^{2}\ell.\Re[m_{F}(\ell_{i})]-y^{2}\ell_{i}^{2}|m_{F}(l_{:})|\}}{\overline{},y-\overline{1}^{\frac{+1}{m_{E_{\sim}}})-\frac{1}{ym_{L}(0)})}A\frac{1\}_{-\infty^{tdH(t)y\ell_{i}}}^{1-y-y\ell.m_{F}}\infty}{(0)|1-y-y\ell.mp(\ell_{:})|(\int_{-\infty}^{\infty}tdH(t}(x)\rangle^{2}|^{2}} if\ell_{i}=0andy>1ifl_{i}>00 otherwise\end{array}$

$\underline{F}$isLSD of$\frac{1}{N}Y_{p}Y_{p}^{T}=\frac{1}{N}X_{p}\Sigma_{p}X_{p}^{T}$and$m_{\underline{F}}(z)$isthesolution of$m=-[z-y \int\frac{t}{1+tm}dH(l)]^{-1}.$

By replacing$m_{F}(P_{i})$ and$m_{F}(O)$withtheir estimator$\hat{m}_{F}(l_{i})$ and$\hat{m}_{\underline{F}}(0)$,weobtain$\hat{\Omega}_{p}^{LW}=$

$U_{p}\hat{A}_{p}U_{p}^{T}\hat{a}_{i}^{*}=\hat{\delta}(\ell_{i})/\hat{\phi}(P_{i})$

.

We

use a

package QuEST

on

Matlab introduced in Ledoit

and Wolf to estimate$\hat{m}_{F}(\ell_{i})$

.

In this algorithm,

we

obtain$\hat{t}_{p}$, theconsistent estimatorof

eigenvalues of$\Sigma_{p}$and solve

$m= \frac{1}{p}\sum_{i=1}^{p}\frac{1}{\hat{t}_{i,p}(1-(p/N)-(p/N)\ell_{i}m)-\ell_{i}}$

When$N,p$arerelatively small,the approximations of$u_{i}^{T}\Sigma_{p}u_{i},$ $u_{i}^{T}\Sigma_{p}^{2}u_{i}$by$\hat{\delta}(P_{i})$, $\hat{\phi}(\ell_{i})$

be-comebad, and$\hat{\Omega}_{p}^{LW}$performs poorly. Weproposethefollowingestimatorof theprecision matrix.

(4)

Inthe

case

of$N>p$, consider the following hierarchical bayes model.

$V(=NS_{p})|\Sigma_{p}\sim \mathcal{W}_{p}(N, \Sigma_{p})$

$2_{p}^{-1}|\eta\sim(1-\eta)\mathcal{W}_{p}(k, \Lambda_{1})+\eta\delta_{\Lambda 0}(\Sigma_{p}^{-1})$

$\eta\sim Ber(\theta)$

Denote pdf of V andprior distribution of$\Sigma_{p}^{-1}$ by

V $|\Sigma_{p}^{-1}\sim f(V|\Sigma_{p}^{-1})$

$\Sigma_{p}^{-1}|\eta\sim(1-\eta)\pi(\Sigma_{p}^{-1}|\Lambda_{1})+\eta\delta_{A0}(\Sigma_{p}^{-1})$

Thejoint distribution of $(V, \Sigma_{p}^{-1})$ and marginal distributionof V

are

$f(V, \Sigma_{p}^{-1})=f(V|\Sigma_{p}^{-1})\{\langle 1-\theta)\pi(\Sigma_{p}^{-1}|A_{1}\rangle+\theta\delta_{\Lambda_{0}}(\Sigma_{p}^{-1}\rangle\}$

$f( V\rangle=(1-\theta)\int f(V|\Sigma_{p}^{-1})\pi(\Sigma_{p}^{-1}|\Lambda_{1})d\Sigma_{p}^{-1}+\theta f(V|\Lambda_{0}\rangle.$

$\Omega_{p}^{Bayeo}=E[\Sigma_{p}^{-1}|V]$ is

$\Omega_{p}^{Bayes}= \int\Sigma_{p}^{-1}f(V, \Sigma_{p}^{-1})d\Sigma_{p}^{-1}/f(V)$

$= \frac{(1-\theta)[\Sigma_{p}^{-1}f(V|\Sigma_{\overline{v}}^{1})\pi(\Sigma_{\mathcal{D}}^{-1}|\Lambda_{1})d\Sigma_{p}^{-1}+\theta\Lambda_{0}f(V|\Lambda_{0})}{(1-\theta)\int f く V|*^{-1})\pi(l_{p}^{-1}|A_{1})d\Sigma_{p}^{-1}+\theta;(V|A_{0})}$

$= (1-w_{0}) \frac{\int\Sigma_{l}^{-1}f\langle V|2_{p}^{-1})\pi(\mathfrak{B}_{\overline{p}}^{1}|\Lambda_{1}\rangle d\Sigma_{\overline{p}}^{1}}{\int f\langleV|\mathfrak{B}_{\overline{p}^{1}})\pi\langle\Sigma_{\overline{p}}^{1}|A_{1}\rangle d\mathfrak{B}_{p}^{-1}}+w_{0}\Lambda_{0},$

where

$w_{0}= \frac{\theta f(V|\Lambda_{0})}{(1-\theta\rangle\int f(V|\Sigma_{p}^{-1})\pi(\Sigma_{p}^{-1}|\Lambda_{1})d\Sigma_{p}^{-1}+\theta f(V|\Lambda_{0})}.$

Let $v_{0}=(N+k)/N,$

$\int\Sigma_{p}^{-1}f(V|\Sigma_{p}^{-1})\pi(\Sigma_{p}^{-1}|\Lambda_{1})d\Sigma_{p}^{-1}$

$= (N+k)(V+\Lambda_{1})^{-\lambda}$

$\int f(V|\Sigma_{p}^{-1})\pi(\Sigma_{p}^{-1}|\Lambda_{1}\rangle d\Sigma_{p}^{-1}$

$= v_{0}(S_{p}+N^{-1}\Lambda_{1})^{-1},$

then,

we

get

$\Omega_{p}^{Bayes}=(1-w_{0})v_{0}(S_{p}+N^{-1}\Lambda_{1})^{-1}+w_{0}A_{0}.$

where $v_{0}>1,$ $0<w_{0}<1$

.

Letting $\Lambda_{1}=N\gamma I_{p},$ $\Lambda_{0}=(1/\overline{P})I_{x},$ $\overline{\ell}=\sum_{i=1}^{p}P_{i}/p=tr[S_{p}]/p,$

$\alpha=v_{0}(1-w_{0})_{\}}\beta=w_{0}/\overline{p}_{\rangle}$

we

obtain

(5)

We estimate$\alpha,$ $\beta,$ $\gamma$ to satisfy$v_{0}>1,$ $0<w_{0}<1.$ Under $L_{p}( \Sigma_{p}^{-1}, \Omega_{p})\cong\frac{1}{p}tr(\Omega_{p}\Sigma_{p}-I_{p})(\Omega_{p}\Sigma_{p}-I_{p})^{T},$

$\alpha^{*}(\gamma)= \frac{tr[(S_{p}+\gamma I_{p})^{-1}\Sigma_{p}]tr[\Sigma_{p}^{2}]-tr[(S_{p}+\gamma I_{p})^{-1}\Sigma^{2},]tr[\Sigma_{p}]}{tr[(S_{p}+\gamma I_{p})^{-2}\Sigma_{p}2]tr[\Sigma_{p}^{2}]-\{tr[(S_{p}+\gamma I_{p})^{-12}\Sigma_{p}]\}^{2}}$

$\beta^{*}(\gamma)= \frac{tr[(S_{p}+\gamma I_{p})^{-2}\Sigma_{n}^{2}]tr[\Sigma_{p}]-tr[(S_{p}+\gamma I_{p})^{-1}\Sigma_{p}]tr[(S_{p}+\gamma I_{p})^{-1}\Sigma_{p}^{2}]}{tr[(S_{p}+\gamma I_{p})^{-22}\Sigma_{p}]tr[\Sigma_{p}^{2}]-\{tr[(S_{p}+\gamma I_{p})^{-1}\Sigma_{p}2]\}^{l}}$

$L_{p}^{*}(\gamma)= L_{p}(\Sigma_{p}^{-1}, \Omega_{p}^{LR}(\alpha^{*}(\gamma), \beta^{*}(\gamma),\gamma))$

$= \frac{1}{p}[tr[(S_{p}+\gamma I_{p})^{-2}\Sigma_{p}^{2}]tr[\Sigma_{p}^{2}]-\{tr[(S_{p}+\gamma I_{p})^{-1}\Sigma_{p}^{2}]\}^{2}]^{-1}$

$\cross[-\{tr[(S_{p}+\gamma I_{p})^{-1}\Sigma_{p}]\}^{2}tr[\Sigma_{p}^{2}]$

$-tr[(S_{p}+\gamma I_{p})^{-2}\Sigma_{p}^{2}](tr[\Sigma_{p}])^{2}$

$+2tr[(S_{p}+\gamma I_{p})^{-1}\Sigma_{p}]tr[(S_{p}+\gamma I_{p})^{-1}\Sigma_{p}^{2}]tr[\Sigma_{p}]]+1$

Wang, et.al(2014) shows, for $\gamma>0$

$\frac{1}{p}tr[(S_{p}+\gamma I_{p})^{-1}\Sigma_{p}] a.s. \frac{1-\gamma m_{F}(-\gamma)}{1-y(1-\gamma m_{F}(-\gamma))}$

Wang, et.al(2014) shows this by considering the limit of $F^{\Sigma_{p}^{-1/2}(S_{p}+\gamma I_{p})\Sigma_{p}^{-1/2}}$

Fromslide

11,

we

know

$\frac{1}{p}tr[(S_{p}+\gamma I_{p})^{-1}\Sigma_{p}^{2}] a.s.\frac{-\gamma+\gamma^{2}m_{F}(-\gamma)}{(1-y(1-\gamma m_{F}(-\gamma)\rangle)^{2}}$

$+ \frac{\int tdH(t)}{1-y(1-\gamma m_{F}(-\gamma))}.$

Since$p^{-1}tr[(S_{p}+\gamma I_{p})^{-2}\Sigma_{p}^{2}]=-(d/d\gamma)p^{-1}tr[(S_{p}+\gamma I_{p})^{-1}\Sigma_{p}^{2}],$

$\frac{1}{p}tr[(S_{p}+\gamma I_{p})^{-2}\Sigma_{p}^{2}] arrow \frac{d}{d\gamma}\{\frac{-\gamma+\gamma^{2}m_{F}(-\gamma)}{(1-y(1-\gamma m_{F}(-\gamma)))^{2}}$

$+ \frac{\int tdH(t)}{1-y(1-\gamma m_{F}(-\gamma))}\}$

We estimate $m_{F}(-\gamma)$ と $m_{F}’(-\gamma)$ by $p^{-1}tr[(S_{p}+\gamma I_{p})^{-1}],$ $p^{-1}tr[(S_{p}+\gamma I_{p})^{-2}]$

.

Consis-tent estimator of$p^{-1} tr(\Sigma_{p})arrow\int tdH(t)$ is $p^{-1}tr(S_{p})$

.

For $p^{-1} tr(\Sigma_{p}^{2})arrow\int t^{2}dH(t)$, $\hat{a}_{2}=$

($N-1\rangle N^{-1}(N-2)^{-1}(N-3\rangle^{-1}p^{-1}[(N-1)(N-2)tr(S_{p})^{2}+\{tr(8_{p})\}^{2}-NQ]$, where, $Q=(N-1)^{-1} \sum_{i=1}^{N}\{(y_{i}-$ $\overline{y})^{T}(y_{i}-\overline{y})\}^{2}$ is a consistent estimator which proposed by Himeno and Yamada $(2014\rangle.$

We look at two estimators: the ridge and the linear shrinkageestimators and check the

optimalvalues of the parametersin theseestimators with respect to

our

loss fumction.

[1] Ridge estimator. (Wang, et.al (2014))

(6)

$\alpha^{r\iota dge*}(\gamma\rangle=\frac{tr[(S,+\gamma I_{p})^{-1}\Sigma_{p}]}{tr((S_{p}+\gamma I_{p}\rangle^{-12}\Sigma_{p}(S_{p}+\gamma I_{p})^{-1})}$, which leads to thereduced loss function

$L_{p}(\Sigma_{p}^{-1}, \Omega_{p}^{ridge}(\alpha^{ridge*}(\gamma),\gamma))$

$=1- \frac{1}{p}\frac{\{[(S_{p}+\gamma I_{p})^{-1}\Sigma_{p}]\}^{2}}{t\iota[(S_{p}+\gamma I_{p})^{-1}\Sigma_{p}a(S_{p}+\gamma I_{p})^{-1}]}.$

[2] Linear shrinkageestimator. (Bodnar, et.al (2014))

The linear shrinkageestimator isoftheform $\Omega_{p}^{linear}=\{$

$\alpha S_{p}^{-1}+\beta I_{p}$ if$N>p$

In the

$\alpha S_{p}^{+}+\beta I_{p}$ if$N<p.$

case

of$N>p,$

$L_{p}(\Sigma_{p}^{-1}, \Omega_{p}^{linear})$

$= \frac{1}{p}\{\alpha^{2}tr[S_{p}^{-2}\Sigma_{p}^{2}]+2\alpha\beta tr[S_{p}^{-1}\Sigma_{p}^{2}]$

$+\beta^{2}tr[\Sigma_{p}^{2}]-2\alpha tr[S_{p}^{-1}\Sigma_{p}]-2\beta tr[\Sigma_{p}]\}+1$

Inthe case of$N<p$, Bodnar, (2014) cannot provide estimators forgeneral $\Sigma_{p}$, because

the limit of$p^{-1}tr[S_{p}^{+}\Sigma_{p}^{-1}]$ is needed, whichcannot be obtained withoutassuming a

struc-ture such as $X_{p}=\sigma^{2}I_{p}$. Without assuming such a structure, however, we

can

obtain

estimators ofthe optimal $a$ and $\beta$ in

our

situation. The loss functionis

$L_{p}(\Sigma_{p}^{-1}, \Omega_{p}^{linear})$

$= \frac{1}{p}\{\alpha^{2}t::[(S_{p}^{+})^{2}\Sigma_{p}^{2}]+2a\beta tr[S_{p}^{+}\Sigma_{p}^{2}]+\beta^{2}tr[\Sigma_{p}^{2}]$

$-2\alpha tr[S_{p}^{+}\Sigma_{p}]-2\beta tr[\Sigma_{p}]\}+1$

so that

we

need the limit of$p^{-1}tr[(S_{p}^{+}\rangle^{2}\Sigma_{p}^{2}$], $p^{-1}tr[S_{p}^{+}\Sigma_{p}^{2}]$ and$p^{-1}tr[S_{p}^{+}\Sigma_{p}]$

.

By Theorem

3.3 in Bodnar, (2014),

one

gets

$\lim_{N,parrow\infty}p^{-1}tr[(S_{p}^{+}\rangle^{2}\Sigma_{p}^{2}]=\lim_{N,parrow\infty}p^{-1}\sum_{i=1}^{N}\frac{\phi(\ell_{i})}{\ell_{i}^{2}}=\int\frac{\phi(x)}{x^{2}}d\underline{F}(x)$

$\varliminf_{N_{4}\infty}p^{-1}tr[S_{p}^{+}\Sigma_{p}^{2}]=Nparrow 1;_{m_{\infty}p^{-1}}\sum_{i=1}^{N}\frac{\phi(\ell_{i})}{\ell_{i}}=\int\frac{\phi(x)}{x}d\underline{F}(x\rangle$

$\lim_{N,parrow\infty}p^{-1}tr[S_{p}^{+}\Sigma_{p}]=\frac{1}{y-1}$

$p^{-1} \sum_{i=1\ell}^{N}\hat{\mathfrak{W}}_{i}^{t_{t}}$ is the estimator of$p^{-1}tr[(S_{p}^{+})^{2}\Sigma_{p}^{2}].$

4

Numerical Results

We comI

are

estimators with $\alpha(S_{p}+\gamma I_{p})^{-1}($Wang, $et.al (2014)$), $\alpha S_{p}^{-1}+\beta I_{p}($Bodnar,

(7)

(D1) $x_{ij}$i.i.d $\sim N(O, 1)$, $i=1,$$\cdots,$$N,$ $j=1,$$\cdots,$ $p$

(D2) $x_{ij}=\sqrt{(m-2)}/mz_{i_{J}’},$ $z_{ij}i.i.d\sim t_{m},$ $i=1,$$\cdots,$$N,$ $j=1,$$\cdots,$$p,$ $m=10$

L.S.$D$of$\Sigma_{p}$ is based

on

Beta distribution

$H_{(a,b)}(x)= \frac{\Gamma(a+b)}{\Gamma(a)\Gamma(b)}\int_{0}^{x}t^{a-1}(1-t)^{b-1}dt, x\in[O, 1 ],$

and the population eigenvalues

are

generated by

$1+9H_{(a,b)}^{-1}( \frac{i}{p}-\frac{1}{2p}) , i=1, p.$

Risk is evaluated by the averaging the empirical losses from 1000 times simulation.

ae

1: EmpiricalRisksof$\Omega_{p}^{orade},$$\Omega_{p}^{LW},$ $\Omega_{p}^{LR},$ $\Omega_{p}^{ridge}$ and$\Omega_{p}^{l1near}$with$N=50,$ $(a, b)=(1,1)$

$p$ oracle LW LR ridge linear

30 0.1538 0.1710 0.1665 0.1681 0.1830 70 0.1703 0.1782 0.1770 0.1813 0.8705 Normal S50 0.1769 0.1854 0.1856 0.1901 0.8081 (a,b)$=(1,1)$ 250 0.1791 0.1933 0.1902 0.1951 0.9056 500 0.1800 0.2342 0.1981 0.2209 0.9757 700 0.1799 0.3789 0.2116 0.2561 0.9877 30 0.1544 0.1704 0.1670 0.1689 0.1878 70 0.1702 0.1786 0.1787 0.1823 0.8612 $t_{10}$ 150 0.1769 0.1849 0.1869 0.1896 0.8110 (a,b)$=(1,1)$ 250 0.1790 0.1911 0.1889 0.1932 0.9062 500 0.1801 0.2189 0.2029 0.2117 0.9757 700 0.1799 0.2632 0.2174 0.2464 0.9876

We conduct Quadratic Discriminant Analysis using the microarray data where

expres-sion levels for 2000 genes

were

measured

on

22 normal and

on

40 colon tumor tissues.

Discriminat rule is

$\frac{N_{1}}{N_{1}+1}(x-\overline{x}_{1})^{T}\Omega_{p}^{(1)}(x-\overline{x}_{1})-\frac{N_{2}}{N_{2}+1}(x-\overline{x}_{2})^{T}\Omega_{p}^{(2\rangle}(x-\overline{x}_{2})<0\Rightarrow x\in\Pi_{1}$

where $\Omega^{LR}\Omega^{LW}\Omega^{r}u_{ge}\Omega^{linear}\Omega^{MP}\Omega^{diag}$

are

used. Correct classification rates

are

$p$ ’ $p$ ’ $p$ $\rangle$

$r$ , $p$ , $p$

evaluatedbyleave-one-out cross-validation.

References

[1] Bodnar,T., Gupta, A.K., andParolya, N. (2015), Optimal linear shrinkageestimator

for largedimensionalprecisionmatrix. J. Multivariate $\mathcal{A}$

(8)

$\ovalbox{\tt\small REJECT} 2$

: EmpiricalRisksof$\Omega_{p}^{oracle},$$\Omega_{r}^{LW},$$\Omega_{p}^{LR},$ $\Omega_{p}^{ridge}$and$\Omega_{p}^{iinear}$with$N=50$underNormal Distribution

$\overline{\frac{(a,b)porac1eLWLRr.idge1i.near}{300.12160.13S40.13270134301437}}$ 70 0.1335 0.1415 0.1416 0.1472 0.8494 (1.5,1.5) 150 0.1388 $(\rangle.1468$ 0,1467 0.1534 0.8043 250 0.1405 0.1551 0.1487 0.1580 0.9087 500 0.1415 0.1887 0.1631 0.1813 0.9766 30 0.2122 0.2288 0.2253 0.2304 0.2542 70 0.2359 0.2441 0.2436 0.2463 0.8932 (0.5,0.5) 1W 0.2444 0.2534 0.2SS5 0.2559 0.8279 250 0.2468 0.2627 0.2547 0.2619 0.9008 500 0.2480 0.2982 0.2629 0.286S 0.9738 $\ovalbox{\tt\small REJECT}$

3: EmpiricalRisksof$\Omega_{p}^{oracle},$ $\Omega_{p}^{LW},$$\Omega_{p}^{LR},$$\Omega_{p}^{ridge}$ and$\Omega_{p}^{linear}$withN$=50$underNormal Distribution

$\overline{\frac{(a,b)porac1eLWLRr.idge1inear}{300.04960.05850.0570006930_{\backslash }059S}}$ 70 (J.0536 eD595 0.0604 0.0754 0.8357 (5,5) 150 0.0557 0.0624 0.0612 0.0757 0.7958 250 0.0563 0.0688 0.0653 0.0769 0.9089 500 0.0567 0.1072 0.0843 0.1015 0,9784 30 0.1123 0.1277 0.1257 0.1268 0.1421 70 0.1248 0.1340 0.1338 0.1376 0.8357 (2,5) 160 0.1323 0.1407 0.1416 0.1445 0,8042 250 0.1350 0.1487 0.1443 0.1507 0.9100 500 0.1364 0.1843 0.1589 0,1760 0.9769

ee

4: CorrectClassificationRatesintheColon Cancer Dataset

$\overline{\frac{pLWLRridgeinearMP\ ag}{10067.7\% 87.1\% 71.o^{o}/3.9^{o/0_{o}}\circ 38.7/85.5^{o}/_{0}}}$

$2S0$ 65.2% 87.1% 83.9% 87.1% 38.7% 83.9%

500 61.3% 87.1% 72.6% 83.9% 41.9% 87.1% 900 66.1% 87.1% 61.3% 87.1% OS.6% 87.1%

(9)

[2] Ledoit, O., andPeche,

S.

(2011), Eigenvectorsof

some

largesamplecovariance matrix

ensembles. Prob. Theory Relat. Fields, 152, 233-264.

[3] Ledoit, O., and Peche, S. (2015), Spectram estimation: A unified framework for

covarianc matrix estimation and

PCA

in large dimensions. J. Multivariate Analysis,

88, 365-411.

[4] Silverstein, J. W., (1995), Strong convergence of the empirical distribution of the

eigenvalues of large-dimensional random matrices. J. Multivariate Anal., 54,

295-309.

[5] Wang, $C_{\rangle}$ Pan, G., Tong, T., and Zhu, L. (2014), Shrinkage estimation of large

dimensional precisionmatrix using random matrixtheory. StatisticaSinica, 25, $993arrow$ 1008.

Graduate School of Economics, University of Tokyo

7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033 JAPAN

$E$-mail address: [email protected]

$\ovalbox{\tt\small REJECT}$

R

参照

関連したドキュメント

In Section 4 we present conditions upon the size of the uncertainties appearing in a flexible system of linear equations that guarantee that an admissible solution is produced

The issue of classifying non-affine R-matrices, solutions of DQYBE, when the (weak) Hecke condition is dropped, already appears in the literature [21], but in the very particular

Since our aim in this article is to prove the strong Feller property and give a gradient estimate of the semigroup, we don’t need the smooth conditions for all the coefficients or

Next, we prove bounds for the dimensions of p-adic MLV-spaces in Section 3, assuming results in Section 4, and make a conjecture about a special element in the motivic Galois group

Maria Cecilia Zanardi, São Paulo State University (UNESP), Guaratinguetá, 12516-410 São Paulo,

[Mag3] , Painlev´ e-type differential equations for the recurrence coefficients of semi- classical orthogonal polynomials, J. Zaslavsky , Asymptotic expansions of ratios of

We study the classical invariant theory of the B´ ezoutiant R(A, B) of a pair of binary forms A, B.. We also describe a ‘generic reduc- tion formula’ which recovers B from R(A, B)

The exponentiated gamma EG distribution and Fisher information matrices for complete, Type I, and Type II censored observations are obtained.. Asymptotic variances of the