Econometrics I: Solutions of Homework #13
Hiroki Kato * July 15, 2020
Contents
1 Solutions: Question 1 1
1.1 (1) . . . . 1 1.2 (2) . . . . 2 1.3 (3) . . . . 3
2 Solutions: Question 2 4
2.1 (1) . . . . 4 2.2 (2) . . . . 5 2.3 (3) . . . . 6
1 Solutions: Question 1
1.1 (1)
OLS estimator of β is given by
β ˆ = (X
′X)
−1X
′y = β + (X
′X)
−1X
′u = β + (
1 T
∑
Tt=1
X
tX
t′)
−1(
1 T
∑
Tt=1
X
tu
t)
, (1)
*
email:
[email protected]. Room 503. If you find any errors in handouts and materials,please contact me via email.
where X
tis k × 1 vector whose elements are explanatory variables for observetion t. Taking the probability limit to both sides yields
plim
T→∞
β ˆ = β + lim
T→∞
( 1 T
∑
Tt=1
X
tX
t′)
−1plim
T→∞
( 1 T
∑
Tt=1
X
tu
t)
(2)
First, we assume the following stationarity condition:
lim
T→∞
1 T
∑
Tt=1
X
tX
t′= M
xx. (3)
Then, by g(X
n) →
pg(X) if X
n→
pX, we have
plim
T→∞
( 1 T
∑
Tt=1
X
tX
t′)
−1= M
xx−1. (4)
We further assume zero covariance between X and u, that is,
plim
T→∞
( 1 T
∑
Tt=1
X
tu
t)
= 0. (5)
By these two assumptions, we finally obtain plim
T→∞
β ˆ = β + M
xx−1· 0 = β. (6)
We can show the consistency of OLSE.
1.2 (2)
Greenberg and Webster (1983) states the central limit theorem as follows:
Z
1, . . . , Z
nare mutually independent. Z
iis distributed with mean µ and variance Σ
ifor
i = 1, . . . , n. Then, we have the follwing result:
Using this theorem, we derive the asymptotic distribution of
√1nX
′u. Define Z
i= X
tu
t. By the assumption of u
t, µ = E(Z
i) = 0 and the variance of Z
iis given by
Σ
i= V (X
tu
t) = E[(X
tu
t)(X
tu
t)
′] = σ
2X
tX
t′. (7) Then, we obtain Σ as follows:
Σ = lim
T→∞
( 1 n
∑
Tt=1
σ
2X
tX
t′)
= σ
2lim
T→∞
( 1 n
∑
Tt=1
X
tX
t′)
= σ
2M
xx. (8)
By the central limit theorm, we have
√ 1
n X
′u = 1
√ n
∑
Tt=1
(X
tu
t− 0) →
dN (0, σ
2M
xx). (9)
1.3 (3)
We rewrite the equation (1) as follows:
√ T ( ˆ β − β) = (
1 T
∑
Tt=1
X
tX
t′)
−1(
√ 1 T
∑
Tt=1
X
tu
t)
. (10)
We use the following proerty: z
n= H
ny
n→
dN (Hµ, HΩH
′) where H
nis an r × k matrix with plim
n→∞H
n= H and y
nis a × 1 vector with y
n→
dN (µ, Ω). Because the equation (10) is seen as the form z
n, √
T ( ˆ β − β) has a limiting normal distribution with zero mean and variance given by σ
2M
xx−1M
xxM
xx−1= σ
2M
xx−1. (11) Thus, we can write
√ T ( ˆ β − β) →
dN (0, σ
2M
xx−1). (12)
2 Solutions: Question 2
2.1 (1)
The loglikelihood function is
log L
T(θ) = log
∏
Ti=1
f (X
i; θ) =
∑
Ti=1
log f (X
i; θ). (13)
The maximum likelihood estimator is defined by θ ˆ ∈ arg max
θ
1 T
∑
Ti=1
log f (X
i; θ). (14)
Assuming the domain of X
idoes not depend on the parameter θ, we define the following expec
tation of log f (X
i; θ) as
log L(θ) = E [log f (X
i; θ)] =
∫
(log f(X
i; θ))f (X
i| θ
0)dx, (15) where θ
0is a unknown true parameter. The function log L(θ) is maximized at θ = θ
0, that is, log L(θ) ≤ log L(θ
0) for any θ bacuase
E[log f (X
i; θ) − log f(X
i; θ
0)]
=E [
log f (X
i; θ) f(X
i; θ
0)
]
≤ E
[ f (X
i; θ) f(X
i; θ
0) − 1
]
=
∫ ( f(X
i; θ) f(X
;θ
0) − 1
)
f(X
i; θ
0)dx
=
∫
f(X
i; θ)dx −
∫
f(X
i; θ
0)dx = 0.
Note that log x ≤ x − 1.
By the weak law of large numbers, for any θ, T
−1log L
T(θ) → log L(θ). By defenition, θ ˆ is the
2.2 (2)
Applying the central limit theorem with unequal variance yields
√ T ( 1
T
∂ log L(θ)
∂θ − µ )
= 1
√ T
∑
Ti=1
( ∂ log f(X
i; θ)
∂θ − µ )
→
dN (0, Σ), (16)
where
∂θ∂log f(X
i; θ) is distributed with mean µ and variance Σ
i, and Σ = lim
T→∞(T
−1∑
T i=1Σ
i).
From (16), we have
T
lim
→∞E [ 1
T
∂ log L(θ)
∂θ ]
= µ, (17)
T
lim
→∞T · Var [ 1
T
∂ log L(θ)
∂θ ]
= Σ. (18)
We need the expectation and variance of
∂θ∂log L(θ). Because L(θ) is a joint distribution, ∫
L(θ)dx = 1. Taking the firstorder derivative with respect to θ on both sides yields
∫ ∂L(θ)
∂θ dx = 0
∫ ∂ log L(θ)
∂θ L(θ)dx = 0 E
[ ∂ log L(θ)
∂θ ]
= 0.
Thus, we obtain
T
lim
→∞E [ 1
T
∂ log L(θ)
∂θ ]
= lim
T→∞
1 T E
[ ∂ log L(θ)
∂θ ]
= 0 = µ. (19)
To obtain the variance, taking the secondorder derivative of ∫
L(θ)dx = 1 with respect to θ,
∫ ∂
2log L(θ)
∂θ∂θ
′L(θ)dx +
∫ ∂ log L(θ)
∂θ
∂ log L(θ)
∂θ
′L(θ)dx = 0
− E
[ ∂
2log L(θ)
∂θ∂θ
′]
= E
[ ∂ log L(θ)
∂θ
∂ log L(θ)
∂θ
′]
Thus, we obtain the variance as follows:
Var
[ ∂ log L(θ)
∂θ ]
=E
[ ∂ log L(θ)
∂θ
∂ log L(θ)
∂θ
′]
− E
[ ∂ log L(θ)
∂θ ]
2=E
[ ∂ log L(θ)
∂θ
∂ log L(θ)
∂θ
′]
= − E
[ ∂
2log L(θ)
∂θ∂θ
′]
=I(θ), (20)
where I(θ) is the information matrix. This leads to
T
lim
→∞T · Var [ 1
T
∂ log L(θ)
∂θ ]
= lim
T→∞
1 T Var
[ 1 T
∂ log L(θ)
∂θ ]
= lim
T→∞
1
T I (θ) = Σ. (21) Hence, the asymptotic distribution is
√ 1 T
∑
Ti=1