• 検索結果がありません。

AlanLee OntheSemiparametricEfficiencyoftheScott-WildEstimatorunderChoice-BasedandTwo-PhaseSampling ResearchArticle

N/A
N/A
Protected

Academic year: 2022

シェア "AlanLee OntheSemiparametricEfficiencyoftheScott-WildEstimatorunderChoice-BasedandTwo-PhaseSampling ResearchArticle"

Copied!
23
0
0

読み込み中.... (全文を見る)

全文

(1)

Volume 2007, Article ID 86180,23pages doi:10.1155/2007/86180

Research Article

On the Semiparametric Efficiency of the Scott-Wild Estimator under Choice-Based and Two-Phase Sampling

Alan Lee

Received 30 April 2007; Accepted 8 August 2007 Recommended by Paul Cowpertwait

Using a projection approach, we obtain an asymptotic information bound for estimates of parameters in general regression models under choice-based and two-phase outcome- dependent sampling. The asymptotic variances of the semiparametric estimates of Scott and Wild (1997, 2001) are compared to these bounds and the estimates are found to be fully efficient.

Copyright © 2007 Alan Lee. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and repro- duction in any medium, provided the original work is properly cited.

1. Introduction

Suppose that for each number of subjects, we measure a response yand a vector of co- variatesx, in order to estimate the parametersβof a regression model which describes the conditional distribution ofygivenx. If we have sampled directly from the conditional distribution, or even the joint distribution, we can estimateβwithout knowledge of the distribution of the covariates.

In the case of a discrete response, which takes one ofJvaluesy1,. . .,yJ, say, we often estimateβusing a case-control sample, where we sample from the conditional distribu- tion ofXgivenY =yj. This is particularly advantageous if some of the values yj occur with low probability. In case-control sampling, the likelihood involves the distribution of the covariates, which may be quite complex, and direct parametric modelling of this distribution may be too difficult. To get around this problem, the covariate distribution can be treated nonparametrically. In a series of papers (Scott and Wild [1,2] Wild [3]) Scott and Wild have developed an estimation technique which yields a semiparametric estimate ofβ. They dealt with the unknown distribution of the covariates by profiling it out of the likelihood, and derived a set of estimating equations whose solution is the semiparametric estimator ofβ.

(2)

This technique also works well for more general sampling schemes, for example, for two-phase outcome-dependent stratified sampling. Here, the sample space is partitioned intoSdisjoint strata which are defined completely by the values of the response and possi- bly some of the covariates. In the first phase of sampling, a prospective sample of sizeNis taken from the joint distribution ofxandy, but only the stratum to which the individual belongs is observed. In the second phase, fors=1,. . .,S, a sample of sizen(s)1 is selected from then(s)0 individuals in stratumswhich were selected in the first phase, and the rest of the covariates are measured. Such a sampling scheme can reduce the cost of studies by confining the measurement of expensive variables to the most informative subjects. It is also an efficient design for elucidating the relationship between a rare disease and a rare exposure, in the presence of confounders.

Another generalized scheme that falls within the Scott-Wild framework is that of case- augmented sampling, where a prospective sample is augmented by a further sample of controls. In the prospective sample, we may observe both disease state and covariates, or covariates alone. Such schemes are discussed in Lee et al. [4].

In this paper, we introduce a general method for demonstrating that the Scott-Wild procedures are fully efficient. We use a (slightly extended) version of the theory of semi- parametric efficiency due to Bickel et al. [5] to derive an “information bound” for the asymptotic variance of the estimates. We then compute the asymptotic variances of the Scott-Wild estimators, and demonstrate their efficiency by showing that the asymptotic variance coincides with the information bound in each case.

The efficiency of these estimators has been studied by several authors, who have also addressed this question using semiparametric efficiency theory. This theory assumes an i.i.d. sample, and so various ingenious devices have been used to apply it to the case of choice-based sampling. For example, Breslow et al. [6] consider case-control sampling, that the data are generated by Bernoulli sampling, where either a case or a control is selected by a randomisation device with known selection probabilities, and the covariates of the resulting case or control are measured. The randomisation at the first stage means that the i.i.d. theory can be applied.

The efficiency of regression models under an approximation to the two-phase sam- pling scheme has been considered by Breslow et al. [7] using missing value theory. In this approach, a single prospective sample is taken. For some individuals, the response and the covariates are both observed. For the rest, only the response is measured and the covari- ates are regarded as missing values. The efficiency bound is obtained using the missing value theory of Robins et al. [8].

In this paper, we adopt a more direct approach. First, we sketch an extension of Bickel- Klaassen-Ritov-Wellner theory to cover the case of sampling from several populations, which we require in the rest of the paper. Such extensions have also been studied by McNeney and Wellner [9], and Bickel and Kwon [10]. Then information bounds for the regression parameters are derived assuming that separate prospective samples are taken from the case and control populations.

The minor modifications to the standard theory required for the multisample effi- ciency bounds are sketched inSection 2. This theory is then applied to case-control sam- pling and an information bound derived in Section 3. We also derive the asymptotic

(3)

variance of the Scott-Wild estimator and show that it coincides with the information bound.

InSection 4, we deal with the two-phase sampling scheme. We argue that a sampling scheme, equivalent to the two-phase scheme described above is to regard the data as aris- ing from separate independent sampling fromS+ 1 populations. This allows the appli- cation of the theory sketched inSection 2. We derive a bound and again show that the asymptotic variance of the Scott-Wild estimator coincides with the bound. Finally, math- ematical details are given inSection 5.

In the context of data that are independently and identically distributed, Newey [11]

characterises the information bound in terms of a population version of a profile likeli- hood, rather than a projection. A parallel approach to calculating the information bound for the case-control and two-phase problems, using Newey’s “profile” characterisation, is contained in Lee and Hirose [12].

2. Multisamples, information bounds, and semiparametric efficiency

In this section, we give a brief account of the theory of semiparametric efficiency when the data are not independently and identically distributed, but rather consist of separate independent samples from different populations.

Suppose we haveJ populations. From each population, we independently select sep- arate i.i.d. samples so that for j=1,. . .,J, we have a sample{xi j, i=1,. . .,nj} from a distribution with density pj, say. We call the combined sample a multisample. We will consider asymptotics wherenj/nwj, andn=n1+···+nJ.

Suppose thatpjis a member of the family of densities ᏼ=

pj(x,β,η),βᏮ,η, (2.1)

whereᏮis a subset of᏾kandᏺis an infinite-dimensional set. We denote the true values ofβandηbyβ0andη0, and pj(x,β00) bypj0. Consider asymptotically linear estimates ofβof the form

n ββ0

= 1

n J j=1

nj

i=1

φjxi j+op(1), (2.2) whereEjφj(X)=0,Ej denoting expectation with respect to pj0. The functionsφj are called the influence functions of the estimate and its asymptotic variance is

J j=1

wjEj

φjφTj . (2.3)

The semiparametric information bound is a matrix B that is a lower bound for the asymptotic variance of all asymptotically linear estimates ofβ. We have

Avarβ=

j

wjEjφjφTjB, (2.4) where theφjare the influence functions ofβ.

(4)

The efficiency bound is found as follows. LetTbe a subset of᏾pso thatᏼT= {pj(x,β, η(t)),βᏮ,tT}is ap-dimensional submodel ofᏼ. We also suppose that ifη0is the true value ofη, thenη(t0)=η0for somet0T. Thus, the submodel includes the true model, havingβ=β0andη=η0.

Consider the vector-valued score functions

l˙j,η=logpjx,β,η(t)

∂t , (2.5)

whose elements are assumed to be members ofL2(Pj0), where Pj0 is the measure cor- responding to pj(x,β00). Consider also the spaceL2k(Pj0), the space of all᏾k-valued square-integrable functions with respect toPj0, and the Cartesian productᏴof these spaces, equipped with the norm defined by

f1,. . .,fJ2= J j=1

wj fj2dPj0. (2.6)

The subspace ofᏴgenerated by the score functions (˙l1,η,. . ., ˙lJ) is the set of all vector- valued functions of the form (A˙l1,η,. . ., A˙lJ,η), where A ranges over allk by pmatrices.

Thus, to each finite-dimensional sub-family ofᏼ, there correspond a score function and subspace ofᏴgenerated by the score function. The closure inᏴof the span(over all such subfamilies) of all these subspaces is called the nuisance tangent space and denoted byη.

Consider also the score functions

l˙β,j=logpj(x,β,η)

∂β . (2.7)

The projection ˙l in Ᏼof ˙lβ=lβ,1,. . .,˙lβ,J) onto the orthogonal complement of ᐀η is called the efficient score, and its elements (which are members ofL2,k(Pj0)) are denoted by ˙lj. The matrix B (the efficiency bound) is given by

B1= J j=1

wjEj

l˙jl˙jT. (2.8)

The functions B ˙lj are called the efficient influence functions, and any multisample asymp- totically linear estimate ofβhaving these influence functions is asymptotically efficient.

3. The efficiency of the Scott-Wild estimator in case-control studies

In this section, we apply the theory sketched inSection 2to regression models, where the data are obtained by case-control sampling. Suppose that we have a responseY(assumed as discrete with possible values y1,. . .,yJ) and a vectorXof covariates, and we want to model the conditional distribution ofYgivenXusing a regression function

fj(x,β)=PY=yj|X=x, (3.1)

(5)

say, whereβis ak-vector of parameters. If the distribution of the covariatesXis specified by a densityg, then the joint distribution ofXandY is

fj(x,β)g(x) (3.2)

and the conditional distribution ofxgivenY=yjis pj(x,β,η)= fj(x,β)g(x)

πj , (3.3)

where

πj=

fj(x,β)g(x)dx. (3.4)

In case-control sampling, the data are not sampled from the joint distribution, but rather from the conditional distributions ofXgivenY=yj. We are thus in the situation ofSection 2withgplaying the role ofηand

pj(x,β,g)= fj(x,β)g(x)

πj . (3.5)

3.1. The information bound in case-control studies. To apply the theory ofSection 2, we must identify the nuisance tangent space᐀ηand calculate the projection of ˙lβon this space. Direct calculation shows that

l˙β,j=logfj(x,β)

∂β j

logfj(x,β)

∂β

, (3.6)

where Ᏹj denotes expectation with respect to the true density pj0, given by pj0(x)= pj(x,β0,g0), whereβ0 andg0 are the true values ofβandg. Here, and in what follows, all derivatives are evaluated at the true values of parameters.

Also, for any finite-dimensional family{g(x,t)}of densities withg(x,t0)=g0(x), we have

l˙η,j=logg(x,t)

∂t j

logg(x,t)

∂t

. (3.7)

It follows by the arguments of Bickel et al. [5, page 52] that the nuisance tangent space is of the form

η=

h1[h],. . .,hJ[h]:hL2,kG0

, (3.8)

wheredG0=g0dx, andL2,k(G0) is the space of allk-dimensional functions f satisfying the conditionf2dG0(x)<.

(6)

The efficient score, the projection of ˙lβ on the orthogonal complement of᐀η, is de- scribed in our first theorem. In the theorem, we use the notationsπj0=

fj(x,β0)dG0(x), f(x)=

J j=1

wj

πj fj(x), l˙β,j=l˙β,j1,. . ., ˙lβ,jk

T

, φl(x)=

J j=1

wj

πj0

l˙β,jlfj

x,β0

.

(3.9)

Then we have the following result.

Theorem 3.1. LetAbe the operatorL2(G0)L2(G0) defined by (Ah)(x)= f(x)h(x)

J j=1

wj

πj fj(x) fj

πj,h

2, (3.10)

where (·,·)2is the inner product inL2(G0). Then the efficient score hasj,lelement l˙β,jlhl +Ej

hl , (3.11)

wherehl is any solution inL2(G0) of the operator equation

Ahl =φl. (3.12)

A proof is given inSection 5.1.

It remains to identify a solution to (3.12). DefinePj(x)=(wjj0)fj(x,β0)/ f(x) and vj j =

PjPj fdG0. Let V=(vj j), W=diag(w1,. . .,wJ), and M=WV. Note that the row and column sums of M are zero since

wj J j=1

PjPj fdG0=wjwj

πj

fjdG0=0. (3.13) Using these definitions and (3.10), we get

Ahl=hlf J j=1

hl, fj

πj

2

Pjf (3.14)

so thatAhl=φlif and only if

hl= φl f+

J j=1

hl, fj

πj

2Pj. (3.15)

This suggests thathl will be of the form hl = φl

f+ J j=1

cjPj (3.16)

(7)

for some constantsc1,. . .,cJ. In order thathl satisfy (3.12), we must have cj

J j=1

cj

Pj, fj

πj

2

wj1φl,Pj

2=0, j=1,. . .,J. (3.17) Now,

Pj, fj

πj

2

=πj1

Pj fjdG0=wj1

PjPjfdG0=

W1Vj j (3.18) so that (3.17) will be satisfied if the vectorc=(c1,. . .,cJ)T satisfies

Mc=d(l), (3.19)

wheredl=(d1l,. . .,dJl)Twithdjl=l,Pj)2. Thus, we require thatc=Md(l), where M is a generalised inverse of M.

Our next result gives the information bound.

Theorem 3.2. Let D=(d1,. . .,dk) andφ=1,. . .,φk)T. The inverse of the information bound B is given by

B1= J j=1

wjj

l˙β,jl˙Tβ,j φφT

f dG0DTMD. (3.20) SeeSection 5.2for a proof.

3.2. Efficiency of the Scott-Wild estimator in case-control studies. Suppose that we haveJdisease states (typicallyJ=2, with disease-state case and control), and we choose nj individuals at random from disease population j, j=1,. . .,J, observing covariates x1,j,. . .,xnj,jfor the individuals sampled from population j. Also suppose that we have a regression function fj(x,β), j=1,. . .,J, giving the conditional probability that an indi- vidual with covariatesxhas disease state j. The unconditional densityg of the covariates is unspecified. The true values ofβandg are denoted byβ0andg0, and the true proba- bility of being in disease state jisπj0=

f(x,β0)g0(x)dx.

Under the case-control sampling scheme, the log-likelihood (Scott and Wild [2]) is J

j=1 nj

i=1

logfj xi j+

J j=1

nj

i=1

loggxi j

J j=1

njlogπj. (3.21) Scott and Wild show that the nonparametric MLE ofβis the “beta” part of the solution of the estimating equation

J j=1

nj

i=1

logPjxi j,β,ρ

∂θ =0, (3.22)

(8)

whereθ=(β,ρ),ρ=1,. . .,ρJ1),

Pj(x,β,ρ)= eρjfj(x,β) J1

l=1eρlfl(x,β) +fJ(x,β), j=1,. . .,J1, PJ (x,β,ρ)= fJ(x,β)

J1

l=1eρlfl(x,β) +fJ(x,β).

(3.23)

A Taylor series argument shows that the solution of (3.22) is an asyptotically linear esti- mate.

Thus, to estimateβ, we are treating the functionl(θ)=J

j=1

nj

i=1logPj(xi j,β,ρ) as though it were a log-likelihood. Moreover, Scott and Wild indicate that we can obtain a consistent estimate of the standard error by using the second derivative2l(θ)/∂θ∂θT, which they call the “pseudo-information matrix.”

Now letn=n1+···+nJ, let thenj’s converge to infinity withnj/nwj, j=1,. . .,J, and letρ0=01,. . .,ρ0,J1)T, where exp(ρ0j)=(wj0j)/(wJ0J). It follows from the law of large numbers and the results of Scott and Wild that the asymptotic variance of the estimate ofβis theββblock of the inverse of the matrix

I= − J j=1

wjj

2logPjxi j,β,ρ

∂θ∂θT

, (3.24)

where all derivatives are evaluated at (β00). Using the partitioned matrix inverse for- mula, theββblock of (I)1is

IββIβρIρρ1Iρβ1, (3.25) where Iis partitioned as

I=

Iββ Iβρ Iρβ Iρρ

. (3.26)

To prove the efficiency of the estimator, we show that the information bound (3.20) co- incides with the asymptotic variance (3.25). To prove this, the following representation of the matrix Iwill be useful. Let S be theJ×kmatrix withj,lelementSjl=(∂logfj(x,β)/

∂βl)|β=β0andjth rowSj, and let E be theJ×kmatrix withj,lelementᏱj[Sjl]. Also note thatPj(x)=Pj(x,β0,ρ0) and writeP=(P1,. . .,PS)T. Then we have the following theo- rem.

Theorem 3.3. (1) Iββ=J

j=1wjj[SjSTj]

STPPTSfdG0. (2) Let U=WE

PPTSfdG0. Then Iρβconsists of the firstJ1 rows of U.

(3) Iρρconsists of the firstJ1 rows and columns of M=WV.

A proof is given inSection 5.3.

Now we show that the information bound coincides with the asymptotic variance.

Using the definitionφl(x)=J

j=1(wjj0lβ,jlfj(x,β0), we can write φ=(SE)TP f,

(9)

and substituting this and the relationship ˙lβ=SE into (3.20), we get B1=

J j=1

wjEj

SjSTjETWE

(SE)TPPT(SE)fdG0(x)DTMD. (3.27) Moreover,

D=

TdG0(x)=

PPT(SE)fdG0(x)=WEUVE=MEU. (3.28) Substituting this into (3.27) and using the relationships described inTheorem 3.3, we get B1=IββUTMUET(IMM)UUT(IMM)E. (3.29) ByTheorem 3.3, the matrix

Iρρ1 0 0T 0

(3.30) is a generalised inverse of M, so UTMU=IβρIρρ1Iρβ. Also,

IMMU=

IMM(MED)=

IMMME

IMMMC=0 (3.31) by the properties of a generalised inverse. Thus, B1=IββIβρIρρ1Iρβ and the Scott- Wild estimate is efficient.

4. Efficiency of the Scott-Wild estimator under two-stage sampling

In this section, we use the same techniques to show that the Scott-Wild nonparametric MLE is also efficient under two-stage sampling.

4.1. Two stage sampling. In this sampling scheme, the population is divided intoSstrata, where stratum membership is completely determined by an individual’s responseyand possibly some of the covariatesx—typically those that are cheap to measure. In the first sampling stage, a random sample of sizen0is taken from the population, and the stratum to which the sampled individuals belong is recorded. For theith individual, letZis=1 if the individual is in stratums, and zero otherwise. Thenn(s)0 =ni=11Zisis the number of individuals in stratums. In the second sampling stage, for each stratums, a simple random sample of sizen(s)1 is taken from then(s)0 individuals in the stratum. Letxis, i= 1,. . .,n(s)1 andyis,i=1,. . .,n(s)1 be the covariates and responses for those individuals. Note thatn(s)1 depends onn(s)0 and must be regarded as random sincen(s)0 n(s)1 fors=1,. . .,S.

We assume that the distribution ofn(s)1 depends only onn(s)0 , and that, conditional on the n(s)0 ’s, then(s)1 ’s are independent.

As inSection 3, let f(y|x,β) be the conditional density ofygivenx, which depends on a finite number of parametersβ, which are the parameters of interest. Letgdenote the density of the covariates. We will regardgas an infinite-dimensional nuisance parameter.

(10)

The conditional density of (x,y), conditional on being in stratums, using Bayes theorem, is

Is(x,y)f(y|x,β)g(x)

Is(x,y)f(y|x,β)g(x)dx d y, (4.1)

whereIs(x,y) is the stratum indicator, having value 1 if an individual having covariates xand response yis in stratums, and zero otherwise. The unconditional probability of being in stratumsin the first phase is

Qs=

Is(x,y)f(y|x,β)g(x)dx d y. (4.2)

Introduce the functionQs(x,β)=

Is(x,y)f(y|x,β)d y. Then,

Qs=

Qs(x,β)g(x)dx. (4.3)

Under two-phase sampling, the log-likelihood (Wild [3], Scott and Wild [2]) is

S s=1

n(s)1

i=1

logfyis|xis+ S s=1

n(s)1

i=1

loggxis+ S s=1

mslogQs, (4.4)

wherems=n(s)0 n(s)1 . Scott and Wild show that the semiparametric MLEβ(i.e., the

“β” part of the maximiser (β,g) of (4.4)) is equal to the “β” part of the solution of the estimating equations

∂β =0,

∂ρ =0. (4.5)

The function is given by

(β,ρ)= S s=1

n(s)1

i=1

logfyis|xis S s=1

n(s)1

i=1

log

r

μr(ρ)Qr

xis

+ S s=1

mslogQs(ρ), (4.6)

(11)

whereQ1(ρ),. . .,QS(ρ) are probabilities defined bySs=1Qs(ρ)=1 and logQs/QS=ρs,s= 1,. . .,S, andμs(ρ)=c(n0ms/Qs(ρ)). Theμs’s depend on the quantitycand thems’s, and for fixed values of these quantities, they are completely determined by theS1 quantities ρs. Note that the estimating equations (4.5) are invariant under choice of c. It will be convenient to takecasN1, whereN=n0+n1, wheren1=S

s=1n(s)1 .

In order to apply the theory ofSection 2to two-phase sampling, we will prove that the asymptotics under two-phase sampling are the same as those under the following multi-sample sampling scheme.

(1) As in the first scheme, take a random sample of n0individuals and record the stratum in which they fall. This amounts to taking an i.i.d. sample{(Zi1,. . .,ZiS), i=1,. . .,n0}of sizen0from MULT(1,Q1,. . .,QS).

(2) Fors=1,. . .,S, take independent i.i.d. samples{(xis,yis),i=1,. . .,n(s)1 }of sizen(s)1 from the conditional distribution of (x,y) givens, having densityps(x,y,β,g)= Is(x,y)f(y|x,β)g(x)/Qs.

We note that the likelihood under this modified sampling scheme is the same as before, and we show inTheorem 4.1that the asymptotic distribution of the parameter estimates is also the same. It follows that if an estimate is efficient under the multisampling scheme, it must also be efficient under two-phase sampling.

Theorem 4.1. LetN=n0+n1, wheren1=S

s=1n(s)1 , and suppose thatN(n0/Nw0)p 0 andN(n(s)1 /Nws)p 0,s=1,. . .,S.

Letθbe the solution of the estimating equation (4.5), and letθ0be the solution to the equation

w0ψ0

Z1,. . .,Zs,θ + S s=1

s

ψs(x,y,θ) =0, (4.7)

wheresdenotes expectation with respect tops,

ψ0

Z1,. . .,Zs=

∂θ S s=1

ZslogQs, ψs(x,y,θ)=

∂θ

logf(y|x,β)log

s

μsQs(x,β)

logQs

, s=1,. . .,S.

(4.8)

ThenN(θθ0) is asymptoticallyN(0, (I)1V(I)1) under both sampling schemes, where V=S

s=0wsEs[(ψsEss])(ψsEss])T] and I= −S

s=0wsEs[∂ψs/∂θ].

A proof is given inSection 5.4.

4.2. The information bound. Now we derive the information bound for two-stage sam- pling. By the arguments ofSection 4.1, the information bound for two-phase sampling is the same as that for the case of independent sampling from theS+ 1 densitiesps(x,y,β,g),

(12)

where

ps(x,y,β,g)=Is(x,y)fy|x,βg(x)

Qs , s=1,. . .,S, p0(x,y,β,g)=Q1Z1···QJZJ,

(4.9)

whereZs=Is(x,y) is thesth stratum indicator.

First, we identify the form of the nuisance tangent space (NTS) for this problem. As in Section 3, we see that the score functions for this problem are

l˙0=logp0(x,y,β,g)

∂β =

S s=1

Zss[᏿], l˙s=logps(x,y,β,g)

∂β =s[᏿], s=1,. . .,S,

(4.10)

where᏿=logf(y|x,β)/∂βandᏱsdenotes expectation with respect to the true den- sityps(x,y,β0,g0). Similarly, ifg(x,t) is a finite-dimensional subfamily of densities, then

logps(x,y,β,g(x,t))/∂t=hs[h],s=1,. . .,S, and

logp0

x,y,β,g(x,t)

∂t =

S s=1

Zss[h], (4.11)

where h=logg(x,t)/∂t. Arguing as in Section 3, we see that the NTS consists of all elements of the form

T(h)= S

s=1

Zs

s[h]Ᏹ[h],h1[h],. . .,hs[h]

, (4.12)

whereᏱdenotes expectation with respect toG0.

As before, the efficient score is ˙l=l˙T(h), whereh is the element ofL2k(G0) which minimisesl˙T(h)2. An explicit expression for this squared distance is

k j=1

w0

S s=1

Zs

s

j s

hj +Ᏹhj 2 +

S s=1

wss

js

j hj+Ᏹs

hj 2! , (4.13) wherehjand᏿jare the jth elements ofhand᏿, respectively. To obtain the projection, we must choosehjto minimise the term in the braces in (4.13). Some algebra shows that this term may be written as

hj,Ahj

22hjj

2+ S s=1

w0Qs0ws

s

j 2

+ S s=1

wss

2j , (4.14)

(13)

where Qs0=

Q(x,β0)g0(x)dx is the true value ofQs, (·,·)2 is the inner product on L2(G0), andAis a selfadjoint nonnegative definite operator onL2(G0) defined by

Ah=Q h S r=1

S s=1

δrsγrs

h(x)Qr x,β0

g0(x)dx

Qr0 Ps

! ,

Q(x)= S s=1

ws Qs0Qs

x,β0 ,

Ps(x)=

ws/Qs0 Qj

x,β0 Q(x) ,

γrs=

w0Qr1Qr

wr , r=s,

w0QrQs

wr , r=s, φj(x)=

S s=1

ws

Qs0Qsx,β0

logQs(x,β)

∂βj |β=β0 S s=1

S r=1

Q(x)Pr(x)δrsγrss

j

.

(4.15) As inSection 3, (4.14) is minimised whenhj=hj, wherehj is a solution ofAhj=φj, which must be of the form

hj = φj f+

S r=1

cr jPr (4.16)

for constantscr jwhich satisfy the equation cr j

S s=1

S t=1

δrsγrs ws vstct j=

S s=1

δrsγrs

ws ds j, (4.17)

wherevrs=

PrPsQdG0andds j=(Psj)2. WritingΓ=rs), C=(cr j), D=(dr j), W= diag(w1,. . .,wS), and V=(vrs), (4.17) can be expressed in matrix terms as

MC=D, (4.18)

where M=W(IΓ)1V. These results allow us to find the efficient score and hence the information bound, which is described in the following theorem.

Theorem 4.2. The information bound B is given by B1=

S s=1

wss

᏿᏿T + S s=1

w0Qs0ws

s[᏿]Ᏹs[᏿]T φφT

Q dG0(x)DTMD.

(4.19) The proof is similar to that ofTheorem 3.2and hence omitted.

参照

関連したドキュメント

In this paper, we extend the results of [14, 20] to general minimization-based noise level- free parameter choice rules and general spectral filter-based regularization operators..

Using the semigroup approach for stochastic evolution equations in Banach spaces we obtain existence and uniqueness of solutions with sample paths in the space of continuous

Furthermore, the upper semicontinuity of the global attractor for a singularly perturbed phase-field model is proved in [12] (see also [11] for a logarithmic nonlinearity) for two

In this work, we present an asymptotic analysis of a coupled sys- tem of two advection-diffusion-reaction equations with Danckwerts boundary conditions, which models the

In section 3, we state some results on the large time behavior of subsonic solutions (for classical solutions in section 3.1 and for subsonic shock solutions in

The aim of this work is to prove the uniform boundedness and the existence of global solutions for Gierer-Meinhardt model of three substance described by reaction-diffusion

For certain singular limits of phase field models, u is approximately constant near the phase interface, and an asymptotic analysis can be conducted to de- termine the wave profile

We show that the Chern{Connes character induces a natural transformation from the six term exact sequence in (lower) algebraic K { Theory to the periodic cyclic homology exact