• 検索結果がありません。

We derive the maximum likelihood estimators of p.

N/A
N/A
Protected

Academic year: 2021

シェア "We derive the maximum likelihood estimators of p."

Copied!
3
0
0

読み込み中.... (全文を見る)

全文

(1)

Example 1.17b: Suppose that X 1 , X 2 , · · · , X n are mutually independently and identically distributed as Bernoulli ran- dom variables with parameter p.

We derive the maximum likelihood estimators of p.

The joint density (or the likelihood function) of X 1 , X 2 , · · · , X n is:

f (x 1 , x 2 , · · · , x n ; p) =

n

Y

i=1

f (x i ; p) =

n

Y

i=1

p x

i

(1 − p) 1

x

i

= p

Pni=1

x

i

(1 − p) n

Pni=1

x

i

= l(p).

331

The log-likelihood function is given by:

log l(p) = (

n

X

i=1

x i ) log(p) + (n −

n

X

i=1

x i ) log(1 − p).

For maximization of the likelihood function, di ff erentiating the log-likelihood function log l(p) with respect to p , the first derivatives should be equal to zero, i.e.,

d log l(p) d p = 1

p X n

i=1

x i − 1 1 − p (n −

X n

i=1

x i )

= n p x − n

1 − p (1 − x) = 0

Let ˆ p be the solution which satisfies the above equation.

332

We obtain the maximum likelihood estimates as follows:

ˆ

p = x = 1 n

n

X

i=1

x i ,

Replacing x i by X i for i = 1, 2, · · · , n, the maximum likeli- hood estimator of p is given by ˆ p = X = 1

n

n

X

i=1

X i .

333

˔ We check whether ˆ p is unbiased.

E( ˆ p) = E(X) = E( 1 n

n

X

i=1

X i ) = 1 n

n

X

i=1

E(X i ) = p

Remember that E(X i ) =

1

X

x

i=0

x i p x

i

(1 − p) 1

x

i

= p, where x i takes 0 or 1.

Thus, ˆ p is an unbiased estimator of p.

334

˔ Next, we check whether ˆ p is e ffi cient.

From Cramer-Rao inequality, V( ˆ p) ≥ − 1

nE d 2 log f (X; p) dp 2

.

f (X; p) = p X (1 − p) 1

X

log f (X; p) = X log(p) + (1 − X) log(1 − p) d log f (X; p)

dp = X

p − 1 − X 1 − p d 2 log f (X; p)

dp 2 = − X

p 2 − 1 − X (1 − p) 2 335

We need to check whether the equality holds.

V( ˆ p) = V( 1 n

n

X

i=1

X i ) = 1 n 2 V(

n

X

i=1

X i ) = 1 n 2

n

X

i=1

V(X i )

= 1 n 2

n

X

i=1

p(1 − p) = p(1 − p) n ,

Note as follows:

V(X i ) = E((X i − p) 2 ) =

1

X

x

i=0

(x i − p) 2 p x

i

(1 − p) 1

x

i

= p(1 − p).

336

(2)

The Cramer-Rao lower bound is:

− 1

nE d 2 log f (X; p) d p 2

= − 1

nE

− X

p 2 − 1 − X (1 − p) 2

= − 1

n

− E(X)

p 2 − 1 − E(X) (1 − p) 2

= 1 n 1

p + 1 1 − p

= p(1 − p) n ,

which is equal to V( ˆ p).

Thus, ˆ p is an e ffi cient estimator of p.

337

˔ We check whether ˆ p is consistent.

From Chebyshev’s inequality, P( | p ˆ − p | ≥ ) ≤ E(( ˆ p − p) 2 )

2 = p(1 − p) n 2 .

As n −→ ∞ , P( | p ˆ − p | ≥ ) −→ 0.

That is, ˆ p converges in probability to p.

Thus, ˆ p is a consistent estimator of p.

338

Properties of Maximum Likelihood Estimator: For small sample (খඪຊ), the MLE has the following properties.

• MLE is not necessarily unbiased in general, but we often have the case where we can construct the unbiased estimator by an appropriate transformation.

For instance, the MLE of σ 2 , S

∗∗

2 , is not unbiased.

However, n

n − 1 S

∗∗

2 = S 2 is an unbiased estimator of σ 2 .

• If the e ffi cient estimator exists, the maximum likelihood estimator is e ffi cient.

339

E ffi cient estimator ⇐⇒ The variance of the estimator is equal to the Cramer-Rao lower bound.

For large sample (େඪຊ), as n −→ ∞ , the maximum likelihood estimator of θ, ˆ θ n , has the following property:

√ n(ˆ θ n − θ) −→ N(0, σ 2 (θ)), (23) where

σ 2 (θ) = 1 E ∂ log f (X; θ)

∂θ

2 ! = − 1 E ∂ 2 log f (X; θ)

∂θ 2

! .

340

(23) indicates that the MLE has consistency, asymptotic un- biasedness (઴ۙෆภੑ), asymptotic e ffi ciency (઴ۙ༗ޮ

ੑ) and asymptotic normality (઴ۙਖ਼نੑ).

Asymptotic normality of the MLE comes from the central limit theorem discussed in Section 6.3.

Even though the underlying distribution is not normal, i.e., even though f (x; θ) is not normal, the MLE is asymptotically normally distributed.

341

Note that the properties of n −→ ∞ are called the asymptotic properties, which include consistency, asymptotic normality and so on.

By normalizing, as n −→ ∞ , we obtain as follows:

√ n(ˆ θ n − θ) σ(θ) =

θ ˆ n − θ σ(θ)/ √

n −→ N(0, 1).

√ n(ˆ θ n − θ) has the distribution, which does not depend on n.

√ n(ˆ θ n − θ) = O(1) is written, where O() is a function n.

That is, ˆ θ n − θ = n

1/2 × O(1) = O(n

1/2 ).

342

(3)

As another representation, when n is large, we can approxi- mate the distribution of ˆ θ n as follows:

θ ˆ n ∼ N θ, σ 2 (θ)

n .

This implies that when n −→ ∞ , ˆ θ n approaches the lower bound of Cramer-Rao inequality: σ 2 (θ)

n . This property is called an asymptotic e ffi ciency.

343

Moreover, replacing θ in variance σ 2 (θ) by ˆ θ n , when n −→

∞ , we have the following property:

θ ˆ n − θ σ(ˆ θ n )/ √

n −→ N(0, 1). (24) Practically, when n is large, we approximately use:

θ ˆ n ∼ N

θ, σ 2 (ˆ θ n ) n

. (25)

344

Proof of (23): By the central limit theorem (11) on p.254,

√ 1 n

n

X

i=1

∂ log f (X i ; θ)

∂θ −→ N

0, 1 σ 2 (θ)

, (26)

where σ 2 (θ) is defined in (14), i.e., V ∂ log f (X i ; θ)

∂θ

= 1 σ 2 (θ) . Note that E ∂ log f (X i ; θ)

∂θ

= 0.

Apply the central limit theorem, taking ∂ log f (X i ; θ)

∂θ as the

ith random variable.

345

By the Taylor series expansion around ˆ θ n = θ, 0 = 1

√ n

n

X

i=1

∂ log f (X i ; ˆ θ n )

∂θ

= 1

√ n

n

X

i=1

∂ log f (X i ; θ)

∂θ + 1

√ n

n

X

i=1

2 log f (X i ; θ)

∂θ 2 (ˆ θ n − θ) + 1

2!

√ 1 n

n

X

i=1

3 log f (X i ; θ)

∂θ 3 (ˆ θ n − θ) 2 + · · ·

346

参照

関連したドキュメント

Next, we prove bounds for the dimensions of p-adic MLV-spaces in Section 3, assuming results in Section 4, and make a conjecture about a special element in the motivic Galois group

Maria Cecilia Zanardi, São Paulo State University (UNESP), Guaratinguetá, 12516-410 São Paulo,

We prove tight- ness of the recentered maximum of the Gaussian fields and provide exponentially decaying bounds on the right and left tails.. Display (1.1) implies that the

“Breuil-M´ezard conjecture and modularity lifting for potentially semistable deformations after

To derive a weak formulation of (1.1)–(1.8), we first assume that the functions v, p, θ and c are a classical solution of our problem. 33]) and substitute the Neumann boundary

Sickel.; Sobolev spaces of fractional order, Nemytskij operators and nonlinear partial differential equations, 1996, New York. Svetlin

After briefly summarizing basic notation, we present the convergence analysis of the modified Levenberg-Marquardt method in Section 2: Section 2.1 is devoted to its well-posedness

When s = 1/2, Cabr´ e and Tan [6] established the existence of positive solutions for equations having nonlinearities with the subcritical growth, their regularity, the