We have proposed a spatial generalized autoregressive conditional heteroskedas-ticity (S-GARCH) model as extension of a spatial autoregressive conditional het-eroskedasticity (S-ARCH) model by Sato and Matsuda (2017). By re-expressing S-GARCH as spatial autoregressive moving average (SARMA) models, we em-ploy spatial econometrics methodology to estimate the parameters by the two step procedure, and establish rigorous asymptotic results. Applications to land price data in Tokyo demonstrate that S-GARCH models detect several interest-ing features of spatial volatilities caused by the Great East Japan Earthquake in 2011.
Finally let us introduce possible extensions S-GARCH models. We employed the first-order contiguity relations to construct a spatial weight matrix, which is the simplest choice. It is desired to check what kind of spatial weight matrix can improve the fitting of S-GARCH models. Spatio-temporal extension of the S-GARCH models are surely our next target that can provide much better ways for land price data analysis than the year by year fitting of S-GARCH models in this paper.
Table3.1:Theempiricalmeansandrootmeansquarederrors(RMSE)oftheestimators. normalchi(3)lognormal n=100n=400n=100n=400n=100n=400 ϕBiasRMSEBiasRMSEBiasRMSEBiasRMSEBiasRMSEBiasRMSE 0.90.0290.0820.0070.0290.0320.0780.0090.0300.0310.0800.0090.029 0.05-0.0390.069-0.0090.026-0.0400.066-0.0100.027-0.0380.068-0.0110.026 0.50.0390.3780.0060.1050.0150.310-0.0030.100-0.0370.310-0.0180.101 1.00.0210.1880.0090.0890.0230.1760.0040.0820.0200.1730.0040.077 0.45-0.0600.238-0.0150.098-0.0650.243-0.0150.103-0.0530.224-0.0160.097 0.45-0.0010.1550.0020.0720.0020.1590.0020.0750.0020.1510.0030.073 0.5-0.0140.292-0.0020.092-0.0540.313-0.0070.113-0.2770.595-0.0860.255 1.00.0340.2320.0120.1130.0440.2290.0180.1090.0410.2160.0070.103 0.05-0.0270.139-0.0110.080-0.0230.141-0.0110.079-0.0290.132-0.0130.079 0.9-0.0110.1080.0020.069-0.0170.1150.0020.068-0.0060.1080.0030.068 0.5-0.4310.829-0.1000.240-0.6271.089-0.1140.295-1.0091.660-0.3130.630 1.00.0130.2360.0070.1140.0220.2280.0060.1090.0070.2150.0040.105 Note:ϕ=(λ,ρ,α,β)′
Table 3.2: Estimated values and standard errors ofλ,ρ, αand β in S-ARCH and S-GARCH models, which are applied year by year to the residuals by fitting SAR models to land priced data.
S-ARCH S-GARCH
2010 2011 2012 2013 2014 2010 2011 2012 2013 2014
ˆλ 0.772 0.845 0.874 0.893 0.601
se(λ) 0.206 0.139 0.128 0.100 0.415
ˆ
ρ 0.240 0.244 0.274 0.279 0.184 0.110 0.076 0.059 0.060 0.104
se(ρ) 0.083 0.081 0.082 0.083 0.084 0.077 0.055 0.048 0.045 0.086 ˆ
α 0.569 -0.518 -0.606 -0.193 -0.804 0.162 -0.121 -0.130 -0.021 -0.412 βˆ -0.022 0.212 0.232 0.109 0.225 -0.001 0.052 0.049 0.025 0.120 AIC 1538.7 1481.7 1549.8 1573.8 1537.7 1536.6 1475.3 1547.9 1570.4 1537.9
Figure 3.1: The identified volatilities in 2010 and 2011. The great earth quake occurred in 2011.
Figure 3.2: A comparison between identified volatilities by ARCH and S-GARCH models.
A. Hessian, average Hessian and symmetric ma-trix Ω
ψ,nThe Hessian matrixHn(ψ)≡ ∂ψ∂ψ∂2 ′ logLn(ψ) has the elements:
Hββ′ = −1
σ2Xn′Rn′−1(λ)R−n1(λ)Xn, Hβσ2 = −1
σ4Xn′Rn′−1(λ)V(θ), Hβρ = −1
σ2Xn′Rn′−1(λ)R−n1(λ)WnYn, Hβλ = 1
σ2Xn′R′−n 1(λ)(Wn′R′−n 1(λ)Vn(θ) +R−n1(λ)WnVn(θ)−R−n1(λ)WnYn), Hσ2σ2 = n
2σ4 −Vn′(θ)Vn(θ) σ6 , Hσ2ρ = −1
σ4Yn′Wn′R′−n 1(λ)V(θ), Hσ2λ = 1
σ4(Vn′(θ)−Yn′)Wn′R′−n 1(λ)Vn(θ), Hρρ = −1
σ2Yn′Wn′R′−n 1(λ)R−n1(λ)WnYn−tr(S−n1(θ)WnSn−1(θ)Wn), Hρλ = 1
σ2Yn′Wn′R′−n 1(λ)(Wn′R′−n 1(λ)V(θ) +R−n1(λ)WnVn(θ)−R−n1(λ)WnYn)
−tr(Sn−1(θ)WnSn−1(θ)Wn), Hλλ = 1
σ2(Yn′−Vn′(θ))Wn′R′−n 1(λ)(2Wn′R′−n 1(λ)Vn(θ) +Rn−1(λ)WnVn(θ)−R−n1(λ)WnYn) +tr(R−n1(λ)WnR−n1(λ)Wn)−tr(Sn−1(θ)WnSn−1(θ)Wn).
The average Hessian matrix Σψ,n ≡ −E(1
n
∂2
∂ψψ′logLn(ψ0))
has the
ele-ments:
Σββ′ = 1
nσ02Xn′R′−n 1R−n1Xn, Σβσ2 = 0,
Σβρ = 1
nσ02Xn′R′−n 1R−n1WnSn−1Xnβ0, Σβλ = 1
nσ02Xn′R′−n 1R−n1WnSn−1Xnβ0, Σσ2σ2 = 1
2σ40, Σσ2ρ = 1
nσ02tr(WnSn−1), Σσ2λ = 1
nσ02tr(WnSn−1−WnR−n1), Σρρ = 1
nσ02β0′Xn′Sn′−1Wn′R′−n 1R−n1WnS−n1Xnβ0+ 1
ntr(R′nS′−n 1Wn′Rn′−1Rn−1WnSn−1Rn+Sn−1WnSn−1Wn), Σρλ = 1
nσ02β0′Xn′Sn′−1Wn′R′−n 1R−n1WnS−n1Xnβ0+ 1
ntr(R′nS′−n 1Wn′Rn′−1Rn−1WnSn−1Rn+Sn−1WnSn−1Wn)
−1
ntr(R′nSn′−1Wn′R′−n 1R−n1Wn+Sn−1WnRn−1Wn), Σλλ = 1
nσ02β0′Xn′Sn′−1Wn′R′−n 1R−n1WnS−n1Xnβ0+ 1
ntr(R′nS′−n 1Wn′Rn′−1Rn−1WnSn−1Rn+Sn−1WnSn−1Wn)
−2
ntr(R′nSn′−1Wn′R′−n 1R−n1Wn+Sn−1WnRn−1Wn) +1
ntr(R−n1WnR−n1Wn+Wn′R′−n 1R−n1Wn).
The symmetric matrix Ωψ,n has the elemetns:
Ωββ′ = 0, Ωβσ2 = µ3
2nσ60Xn′R′−n 11n, Ωβρ = µ3
nσ04
∑n i
{(R−n1Xn)i}′(R−n1WnSn−1Rn)ii,
Ωβλ = µ3
nσ04
∑n i
{(R−n1Xn)i}′(R−n1WnSn−1Rn−R−n1Wn)ii,
Ωσ2σ2 = µ4−3σ40 4σ80 , Ωσ2ρ = µ3
2nσ60β0′Xn′S′−n 1Wn′R′−n11n+µ4−3σ40
2nσ06 tr(Sn−1Wn), Ωσ2λ = µ3
2nσ60β0′Xn′S′−n 1Wn′R′−n11n+µ4−3σ40
2nσ06 tr(Sn−1Wn−R−n1Wn), Ωρρ = 2µ3
nσ04
∑n i=1
(R−n1WnSn−1Xnβ0)i(R−n1WnSn−1Rn)ii+µ4−3σ40 nσ04
∑n i=1
{(Rn−1WnSn−1Rn)ii}2,
Ωρλ = µ3
nσ04
∑n i=1
(R−n1WnSn−1Xnβ0)i(2R−n1WnSn−1Rn−R−n1Wn)ii
+µ4−3σ04 nσ04
∑n i=1
(R−n1WnSn−1Rn)ii(R−n1WnSn−1Rn−R−n1Wn)ii,
Ωλλ = 2µ3
nσ04
∑n i=1
(R−n1WnSn−1Xnβ0)i(R−n1WnSn−1Rn−R−n1Wn)ii
+µ4−3σ04 nσ04
∑n i=1
{(R−n1WnSn−1Rn−R−n1Wn)ii}2,
whereµ3andµ4are the third and fourth moments ofvis, respectively, (R−n1Xn)i is thei-th row of (R−n1Xn), (R−n1WnSn−1Xnβ0)iis thei-th element of (R−n1WnS−n1Xnβ0) and (R−n1WnS−n1Rn)ii, (R−n1WnSn−1Rn −R−n1Wn)ii and (2R−n1WnSn−1Rn − R−n1Wn)iiare the (i, j)th element of (R−n1WnSn−1Rn), (R−n1WnS−n1Rn−R−n1Wn) and (2R−n1WnSn−1Rn−R−n1Wn), respectively.
B. Some useful Lemmas
Lemma 3.5.1(Proposition 8.4.13, Bernstein (2009)). Let A and B be matrices.
We useγmaxandγminto denote the largest and smallest eigenvalues of a matrix.
If A is symmetric and B is positive semi definite, then γmin(A)tr(B)≤tr(AB)≤γmax(A)tr(B).
Lemma 3.5.2 (Lee, 2002, p.256; Lee, 2004, p1918). Let {An} and {Bn} be two two sequences of n×n matrices that are uniformly bounded in both row and column sums and the elements of ann×nmatrix{Cn}beO(1)uniformly.
Then
1. the sequence{AnBn}are uniformly bounded in both row and column sums, 2. the elements of CnBn have the uniform orderO(1), and
3. the elements of An are uniformly bounded andtr(An) =O(n).
Lemma 3.5.3(Lee, 2004, p1918). The elements, thev′isofVn are assumed to be i.i.d. with zero mean and a finite variance and the fourth moment of thev′s is assumed to exist. Suppose thatAn is a square matrix with tis column sums being uniformly bounded and elements of the n×K matrix Zn are uniformly bounded. Let {Bn} be uniformly bounded either in row or column sums and their elementsbn,ij haveO(1) uniformly in i and j. Then
1. √1nZn′AnVn=Op(1)and
2. n1E(Vn′BnVn) =O(1) and 1n[Vn′BnVn−E(Vn′BnVn)] =op(1).
C. Proofs of Theorems 3-5
Proof of Theorem3
The consistency of ˆθ will follow from the uniform convergence of n1(logLn(θ)− Qn(θ)) to zero on Θ and the uniqueness identification condition that, for any ϵ > 0,lim supn→∞maxθ∈Nϵc(θ0) 1
n(Qn(θ)−Qn(θ0)) < 0, where Nϵc(θ0) is the complement of an open neighborhood ofθ0in Θ of diameterϵ(Theorem 3.4 of white (1994)).
Proof of the uniform convergence of n1(logLn(θ)−Qn(θ))
First, we shall prove the uniform convergence of n1(logLn(θ)−Qn(θ)) to zero on Θ. The proof follows from:
(a) infθ∈Θσn∗2(θ) is bounded away from zero, (b) supθ∈Θ|σˆ2n(θ)−σ∗n2(θ)|=op(1),
(c) supθ∈Θ|n1(logLn(θ)−Qn(θ))|=op(1).
Proof of (a) By the definition ofVn∗(θ), Vn∗(θ) = Rn−1(λ)(Sn(θ)Yn−Xnβn∗(θ)),
= Rn−1(λ)Sn(θ)Yn−R−n1Xn(Xn′R′−n 1(λ)Rn−1(λ)Xn)−1Xn′R′−n 1(λ)Rn−1(λ)Sn(θ)E(Yn),
= Rn−1(λ)Sn(θ)Yn−PnRn−1(λ)Sn(θ)E(Yn),
= MnR−n1(λ)Sn(θ)Yn+PnR−n1(λ)Sn(θ)(Yn−E(Yn)),
where,Pn=R−n1Xn(Xn′R′−n 1(λ)R−n1(λ)Xn)−1Xn′Rn′−1 andMn=In−Pn. From the orthogonality between the two symmetric idempotent matricesMn
andPn, we have, σn∗2(θ) = 1
nE(Vn′∗(θ)Vn∗(θ)),
= 1
nE[Yn′Sn′(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)Yn+
(Yn−E(Yn))′S′n(θ)R′−n 1(λ)PnR−n1(λ)Sn(θ)(Yn−E(Yn))],
= 1
nE(Yn′)Sn′(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)E(Yn) + 1
ntr(R′−n 1(λ)R−n1(λ)V ar(Sn(θ)Yn)).
The matrix Mn is positive semi definite because Mn is a symmetric idem-potent matrix (Lemma 14.2.14 of Harville (1997)). Thus, the first term is non-negative uniformly inθ∈Θ.
Because the matrixV ar(Sn(θ)Yn) is symmetric andγminV ar(Sn(θ)Yn)>0 from the assumption, the matrix is positive semi definite (Theorem 3.25 of Schott (2005)). By Lemma 3.5.1, the second term is
1
ntr(R′−n 1(λ)R−n1(λ)V ar(Sn(θ)Yn)) ≥ 1
nγmin(R′−n 1(λ)R−n1(λ))tr(V ar(Sn(θ)Yn)),
≥ 1 ncrcy,
> 0,uniformly inθ∈Θ.
It follow that infθ∈Θσn∗2(θ) is bounded away from zero.
Proof of (b) Noting that
Vˆn(θ) = Rn−1(λ)(Sn(θ)Yn−Xnβˆn(θ)),
= Rn−1(λ)Sn(θ)Yn−R−n1Xn(Xn′R′−n 1(λ)R−n1(λ)Xn)−1Xn′R′−n 1(λ)R−n1(λ)Sn(θ)Yn,
= MnR−n1(λ)Sn(θ)Yn. Hence,
ˆ
σ2n(θ) = 1 n
Vˆ′n(θ) ˆVn(θ),
= 1
nYn′Sn′(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)Yn. It follows that
ˆ
σn2(θ)−σn∗2(θ) = 1
nYn′Sn′(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)Yn− 1 nE(
Yn′Sn′(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)Yn
)
−1 nE(
(Yn−E(Yn))′Sn′(θ)R′−n 1(λ)PnR−n1(λ)Sn(θ)(Yn−E(Yn))) ,
= (Q1−EQ1)−EQ2,
where,Q1= n1Yn′Sn′(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)YnandEQ2= n1E(
(Yn−E(Yn))′Sn′(θ)R′−n 1(λ) PnR−n1(λ)Sn(θ)(Yn−E(Yn)))
.
To show the result, it sufficient to show Q1−EQ1
−→p 0 and EQ2 −→ 0, uniformly inθ∈Θ.
First, we show thatQ1−EQ1
−→p 0 uniformly in θ∈Θ. By Theorem 1 of Andrews (1992), the uniform convergence ofQ1−EQ1 to zero in probability follows from the pointwise convergence for eachθ∈Θ and stochastic equicon-tinuity of Q1, i.e., for any ϵ > 0, there exists a positive number δ such that lim supn→∞P(supθ∈Θsupθ′∈B(θ,δ)> ϵ)< ϵ, whereB(θ, δ) denote a closed ball in Θ of radiusδ≥0 centered atθ.
First of all, the pointwise convergence ofQ1−EQ1will be shown. We have, by the identity: Yn=Sn−1Xnβ0+Sn−1RnVn,
Q1 = 1
n(Sn−1Xnβ0+Sn−1RnVn)′S′n(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)(Sn−1Xnβ0+Sn−1RnVn),
= 1
n(β′0Xn′Sn′−1Sn′(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)Sn−1Xnβ0+ 2β′0Xn′Sn′−1S′n(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)Sn−1RnVn
+Vn′R′nSn′−1Sn′(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)Sn−1RnVn),
= Q1,1(θ) + 2Q1,2(θ) +Q1,3(θ),
whereQ1,1(θ) = 1n(β0′Xn′Sn′−1Sn′(θ)R′−n1(λ)MnRn−1(λ)Sn(θ)Sn−1Xnβ0), Q1,2(θ) =n1(β0′Xn′Sn′−1Sn′(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)Sn−1RnVn) and
Q1,3(θ) =n1(Vn′R′nSn′−1Sn′(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)Sn−1RnVn). The two terms Q1,2(θ) andQ1,3(θ) are stochastic.
For the second term, the column sums ofS′−n 1Sn′(θ)R′−n1(λ)MnR−n1(λ)Sn(θ)S−n1Rn are uniformly bounded from assumption 8 and Lemma 3.5.2 andE(Q1,2(θ)) = 0.
Thus, the pointwise convergence of Q1,2(θ)−E(Q1,2)(θ) follow from Lemma 3.5.3. Similarly, the column sums ofR′nSn′−1Sn′(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)Sn−1Rn are uniformly bounded and the pointwise convergence of Q1,3(θ)−E(Q1,3)(θ) follows from Lemma 3.5.3. Therefore,Q1−EQ1
−→p 0, for eachθ∈Θ.
Next, we show thatQ1 is stochastic equicontinuous. We have by the mean value theorem:
Q1,ℓ(θ1)−Q1,ℓ(θ2) = ∂
∂θ′Q1,ℓ(¯θ)(θ2−θ1),
≤ sup
θ∈Θ
∂
∂θ′Q1,ℓ(θ)
(θ2−θ1),
where ℓ = 1,2,3 and ¯θ lies between θ1 and θ2. For stochastic equicontinu-ous, it suffices to show that supθ∈Θ ∂
∂θ′Q1,ℓ(θ) = Op(1) by Theorem 21.10 of Davidson (1994). Let Π1 be Sn′−1S′n(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)Sn−1, Π2 be β0′Xn′Sn′−1Sn′(θ)R′−n1(λ)MnRn−1(λ)Sn(θ)Sn−1Rnand Π3beR′nSn′−1Sn′(θ)R′−n 1(λ) MnR−n1(λ)Sn(θ)Sn−1Rn. The partial derivatives ∂θ∂′Π1,ℓ take simple form and consequently∂θ∂′Π1,ℓare also uniformly bounded in both row and column sums.
ForQ1,1, for anyθ, the elements ofβ0′Xn′ ∂θ∂′Sn′−1S′n(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)Sn−1
andXnβ0 are uniformly bounded. Thus, there exists constantsc1 andc2such that|{β0′Xn′(∂θ∂′Sn′−1Sn′(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)S−n1)}i|≤c1and|(Xnβ0)i| ≤ c2 where {β0′Xn′(∂θ∂′Sn′−1Sn′(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)Sn−1)}i and (Xnβ0)i are thei-th elements of each vector. It follows that ∂
∂θ′Q1,1≤c1c2 =O(1). For Q1,2, for any θ, ∂θ∂′Π1,2,i≤c3 where ∂θ∂′Π1,2,i is the i-th element of ∂θ∂′Π1,2. Therefore, from Lemma 3.5.3, P(∂θ∂′Q1,2 > M)
≤ P(n1∑n
i=1c3vi > M)
= O(
n−12)
. For Q1,3, for any θ, ∂
∂θ′Π1,3,ij ≤ c4 where where ∂θ∂′Π1,3,ij is the (i, j)th element of ∂θ∂′Π1,3. Thus, from Lemma 3.5.3, P(∂θ∂′Q1,3 > M) P(n1∑n ≤
i=1
∑
j=1c4vivj > M)
= O(1). Thus, supθ∈Θ ∂
∂θ′Q1,ℓ(θ) = Op(1) It follow thatQ1 is stochastic equicontinuous. Hence, by Theorem 1 of Andrews (1992),Q1−EQ1−→p 0 uniformly inθ∈Θ.
Secondly, we show thatEQ2−→0, uniformly inθ∈Θ. There existcxsuch that
0 < cx ≤ infλ∈Λγmin
(1
nXn′Rn′−1Rn−1X)
from assumption. By Assumption, Lemma 3.5.1 and 3.5.2 and theorem 3.4 of Schott (2005), We have,
EQ2 = 1 nE(
(Yn−E(Yn))′Sn′(θ)R′−n 1(λ)PnR−n1(λ)Sn(θ)(Yn−E(Yn))) ,
= 1
ntr(R′−n 1(λ)PnR−n1(λ)V ar(Sn(θ)Yn)),
= 1
ntr(R′−n 1(λ)R−n1Xn(Xn′Rn′−1(λ)R−n1(λ)Xn)−1Xn′R′−n1Rn−1(λ)V ar(Sn(θ)Yn)),
≤ 1
nγmin−1 (Xn′R′−n1Rn−1X)γmax2 (Rn′−1(λ)R−n1(λ))γmax(V ar(Sn(θ)Y))tr(Xn′Xn)),
= 1
nγmin−1
(Xn′Rn′−1Rn−1X n
)
γmax2 (Rn′−1(λ)R−n1(λ))γmax(V ar(Sn(θ)Y))1
ntr(Xn′Xn)),
≤ 1
nc−x1c2rcy
1
ntr(Xn′Xn)),
= O( n−1)
Hence,EQ2−→0, uniformly inθ∈Θ.
Therefore, supθ∈Θ|σˆ2n(θ)−σ∗n2(θ)|=op(1), completing the proof of (b).
Proof of (C) We show that supθ∈Θ1
n(logLn(θ)−Qn(θ))=op(1). Note that 1
n(logLn(θ)−Qn(θ)) =−1
2(log ˆσn2(θ)−logσ∗n2(θ)).
By the Taylor expansion,
log ˆσn2(θ)−logσn∗2(θ)= 1
˜
σ2n(θ)σˆn2(θ)−σn∗2(θ),
where ˜σn2(θ) lies between ˆσ2n(θ) andσ∗n2(θ). From the proof (a) and (b), it follow that ˆσ2n(θ) is uniformly bounded away from zero on Θ. Moreover, ˜σn2(θ) is also uniformly bounded away from zero on Θ because ˜σ2n(θ) exists between ˆσn2(θ)
andσ∗n2(θ) and thereby ˜σ21
n(θ) is uniformly bounded. As ˆσ2n(θ)−σ∗n2(θ) coverges in probability to zero uniformly on Θ,|log ˆσn2(θ)−logσn∗2(θ)|=op(1) uniformly on Θ.
Consequently, supθ∈Θ1
n(logLn(θ)−Qn(θ))=op(1).
Proof of the identification uniqueness condition
Secondly, we shall prove the identification uniqueness condition. The proof follow from:
(i) n1Qn(θ) is uniformly equicontinuous on Θ.
(ii) Show some properties of an auxiliary model.
(iii) Show that the identification uniqueness condition holds.
Proof of (i) We show thatn1Qn(θ) = 12(log 2π+1)−12logσ∗n2(θ)−n1log|Rn(λ)|+
1
nlog|Sn(θ)|is uniformly equicontinuous on Θ. It is sufficient to show that par-tial derivatives of each term are uniformly bounded. The uniform continuity of logσ∗n2(θ) on Θ follows because σ∗21
n(θ) is uniformly bounded since σ∗n2(θ) is uniformly bounded away form zero on Θ. For 1nlog|Rn(λ)|, dλd n1log|Rn(λ)|=
1
ntr(R−n1(λ)Wn). From assumption and Lemma 3.5.2, the elements ofR−n1(λ)Wn
are uniformly bounded. Thus, 1ntr(R−n1(λ)Wn) =O(1) from Lemma 3.5.2. Sim-ilarly, ∂θ∂ 1nlog|Sn(θ)| =O(1). Hence, 1nQn(θ) is uniformly equicontinuous on Θ.
Proof of (ii) It is useful to establish an auxiliary process:
Yn=λWnYn+ρWnYn+Rn(λ)Vn,
where Vn ∼ N(0, σ20In). The log-likelihood function of the above auxiliary process is given by
logLp,n(θ, σ2) = −n
2log(2π)−n
2log(σ2(θ))−log|Rn(λ)|+ log|Sn(θ)|
− 1
2σ2Yn′Sn′(θ)R′−n 1(λ)R−n1(λ)Sn(θ)Yn.
Let Ep be the expectation under this auxiliary process. Define Qp,n(θ) = maxσ2Ep(logLp,n(θ)). The optimal solutions of this maximization problem is
σn2(θ) = 1
nEp(Yn′Sn′(θ)R′−n1(λ)R−n1(λ)Sn(θ)Yn),
= σ2
ntr(RnSn−1Sn′(θ)R′−n 1(λ)R−n1(λ)Sn(θ)Sn−1Rn).
Hence,
Qp,n(θ) =−n
2 log(2π+ 1) +n
2logσn2(θ)−log|Rn(λ)|+ log|Sn(θ)|.
By Shannon-Kolmogorov Information Inequality (Ferguson (1996), p113), Qp,n(θ)≤Qp,n(θ0) for all θ∈Θ. This implies that n1(Qp,n(θ)−Qp,n(θ0)≤0 for allθ∈Θ.
Proof of (iii) We show that the identification uniqueness condition holds by contradiction.
1
n(Qn(θ)−Qn(θ0)) = −1
2logσn∗2(θ)−log|Rn(λ)|+ log|Sn(θ)| − (
−1
2logσ02−log|Rn|+ log|Sn| )
= (
−1
2(logσ2n(θ)−logσ20)−1
n(log|Rn(λ)| −log|Rn|) + 1
n(log|Sn(θ)| −log|Sn|) )
−1
n(logσn∗2(θ)−logσn2(θ)),
= 1
n
(Qp,n(θ)−Qp,n(θ0))
−1
2(logσ∗n2(θ)−logσ2n(θ)).
Moreover,
σ∗n2(θ)−σ2n(θ) = 1
nβ0′Xn′Sn′−1Sn′(θ)R′−n 1(λ)MnR−n1(λ)Sn(θ)Sn−1Xnβ0. Mn is positive semi definite and thereby σ∗n2(θ)−σ2n(θ) ≥ 0. This implies
−12(logσ∗n2(θ)−logσn2(θ))≤0.
Now, suppose that the identification uniqueness condition does not hold.
Then, there exists anϵ >0 and a sequence{θn}inNϵc(θ0) such that limn→∞ 1 n
(Qn(θ)− Qn(θ0))
= 0. By the compactness of Nϵc(θ0), there exists a convergent subse-quence {θnm} of {θn} with the limit θ+ of θnm being in Nϵc(θ0). This implies thatθ+̸=θ0. As n1Qn(θ) is uniformly equicontinuous, limnm→∞ 1
nm
(Qnm(θ+)− Qnm(θ0))
= 0. Because n1(
Qp,n(θ)−Qp,n(θ0))
≤ 0 and −12(
logσ∗n2(θ) − logσn2(θ))
≤0, this is possible only if limnm→∞n1
m
(Qnm(θ+)−Qnm(θ0))
= 0 and−12(
logσn∗2(θ)−logσ2n(θ))
≤0. However, limn→∞1nβ0′Xn′S′−n 1Sn′(θ)R′−n1(λ)MnRn−1(λ)Sn(θ)S−n1Xnβ0̸= 0 from the assumption in Theorem 3 . Thus, −12(
logσn∗2(θ)−logσn2(θ))
< 0 and consequently
limnm→∞n1
m
(Qnm(θ+)−Qnm(θ0))
̸
= 0. This is a contradiction. Therefore, the identification uniqueness condition must hold.
The consistency of ˆθfollow form uniform convergence and the identification uniqueness condition. This completes the proof of the theorem.
Proof of Theorem 4
We have by the Taylor expansion,
0 = 1
√n
∂logLn( ˆψn)
∂ψ ,
= 1
√n
∂logLn(ψ0)
∂ψ +
(1 n
∂2logLn( ¯ψn)
∂ψ∂ψ′ )√
n( ˆψn−ψ0),
where ¯ψnlies between ˆψnandψ0. Thus, the asymptotic normality of ˆψnfollows if
(a) √1n
∂logLn(ψ0)
∂ψ
−→D N(
0,limn→∞Γ(ψ0)) , (b) n1∂2log∂ψ∂ψLn(ψ′ 0)−E(1
n
∂2logLn(ψ0)
∂ψ∂ψ′
) p
−→0, and (c) n1∂2log∂ψ∂ψLn( ¯′ψn)−n1∂2log∂ψ∂ψLn(ψ′ 0)
−→p 0.
Proof of (a) The asymptotic normality of √1n
∂logLn(ψ0)
∂ψ follows from the central limit theorems for linear-quadratic forms in Kelejian and Prucha (2001).
We need to check that the score vector holds Assumptions in Kelejian and Prucha (2001). To check assumptions for asymptotic normality, it is sufficient to show some matrices hold desired boundaly conditions. From assumptions of this paper and Lemma 3.5.2, (R′nSn′−1Wn′R′−n 1−Wn′Rn′−1) andR′nSn′−1Wn′R′−n 1 are uniformly bounded in column sums, and the elements of Xn′Sn′−1Wn′R′−n 1 are uniformly bounded. Thus, each score function holds the assumptions and the asymptotic normality of each score function follows. Finally, the Cram´ er-Wold devise (Proposition 6.3.1 of Brockwell and Davis (1991)) leads to the joint asymptotic normality.
Proof of (b) LetDψψ be 1n∂2log∂ψ∂ψLn(ψ′ 0)−E(1
n
∂2logLn(ψ0)
∂ψ∂ψ′
). Then, Dψψ has the elements:
Dββ′ = 0, Dβσ2 = − 1
nσ04Xn′R′−n 1Vn, Dβρ = − 1
nσ02Xn′R′−n 1R−n1WnSn−1RnVn, Dβλ = 1
nσ20Xn′(R′−n1Wn′R′−n 1+R′−n 1R−n1W −R′−n 1R−n1WnSn−1Rn)Vn, Dσ2σ2 = 1
σ40 − 1 nσ06Vn′Vn, Dσ2ρ = − 1
nσ04β0′Xn′Sn′−1Wn′R′−n 1Vn− 1
nσ04(Vn′R′nSn′−1Wn′R′−n 1Vn−σ20tr(S′−n 1Wn′)), Dσ2λ = − 1
nσ04β0′Xn′Sn′−1Wn′R′−n 1Vn+ 1
nσ04(Vn′Wn′R′−n 1Vn−σ20tr(Wn′R′−n 1))
− 1
nσ04(Vn′R′nSn′−1Wn′R′−n 1Vn−σ02tr(Sn′−1Wn′)), Dρρ = − 2
nσ0
β0′Xn′Sn′−1Wn′R′−n 1R−n1WnSn−1RnVn
− 1
nσ02(Vn′R′nSn′−1Wn′R′−n 1R−n1WnSn−1RnVn−σ02tr(R′nS′−n 1Wn′R′−n1Rn−1WnSn−1Rn)), Dρλ = 1
nσ20β0′Xn′(S′−n 1Wn′Rn′−1Wn′R′−n 1+Sn′−1Wn′R′−n 1R−n1Wn−2Sn′−1Wn′R′−n 1R−n1WnSn−1Rn)Vn
+ 1
nσ02(Vn′R′nSn′−1Wn′R′−n 1Wn′R′−n 1Vn−σ02tr(Sn′−1Wn′R′−n 1Wn′)) + 1
nσ02(Vn′R′nSn′−1Wn′R′−n 1R−n1WnVn−σ20tr(R′nSn′−1Wn′R′−n 1R−n1Wn))
− 1
nσ02(Vn′R′nSn′−1Wn′R′−n 1R−n1WnSn−1RnVn−σ02tr(R′nS′−n 1Wn′R′−n1Rn−1WnSn−1Rn)), Dλλ = 1
nσ20β0′Xn′(2Sn′−1Wn′R′−n 1Wn′R′−n 1+Sn′−1Wn′R′−n 1R−n1Wn−2Sn′−1Wn′R′−n 1R−n1WnSn′−1Rn
−2R′−n 1Wn′R′−n 1Wn′Rn′−1−R′−n 1Wn′R′−n 1R−n1Wn+ 2R′−n 1Wn′Rn′−1Rn−1WnSn−1Rn
+2R′−n 1Wn′R′−n 1Wn′Rn−1+R′−n 1Wn′R′−n 1R−n1Wn−R′−n 1Wn′R′−n1Rn−1WnSn−1Rn)Vn
+ 2
nσ02(Vn′R′nSn′−1Wn′R′−n 1Wn′R′−n 1Vn−σ02tr(Sn′−1Wn′R′−n 1Wn′)) + 1
nσ02(Vn′R′nSn′−1Wn′R′−n 1R−n1WnVn−σ20tr(R′nSn′−1Wn′R′−n 1R−n1Wn))
− 1
nσ02(Vn′R′nSn′−1Wn′R′−n 1R−n1WnSn−1RnVn−σ02tr(R′nS′−n 1Wn′R′−n1Rn−1WnSn−1Rn))
− 2
nσ02(Vn′Wn′R′−n 1Wn′R′−n 1Vn−σ02tr(Wn′Rn′−1Wn′R′−n 1)) + 1
nσ02(Vn′Wn′R′−n 1R−n1WnVn−σ20tr(Wn′R′−n 1R−n1Wn)) 1 ′ ′ ′−1 −1 −1
− 2 ′ ′−1 −1 −1 54
Thus, the elements ofDψψare decomposed into sums of the forms: 1nXn′An(θ)Vn,n1β0′Xn′An(θ)Vn,
1
n(Vn′An(θ)Vn−E(Vn′An(θ)Vn)) andσ14 0−nσ16
0
Vn′Vn, where a matrixAn(θ) is
uni-formly bounded in both row and column sums. From Lemma 3.5.3, 1nXn′An(θ)Vn,n1β′0Xn′An(θ)Vn
and n1(Vn′An(θ)Vn −E(Vn′An(θ)Vn)) are convergence to zero in probability.
Moreover, σ14
0 − nσ16 0
Vn′Vn −→p 0 because n1VnVn −→p σ20 by the law of large numbers. Therefore, it follow that n1∂2log∂ψ∂ψLn(ψ′ 0)−E(1
n
∂2logLn(ψ0)
∂ψ∂ψ′
) p
−→0.
Proof of (c) From Lemma 3.5.2 and 3.5.3, it is easy to show thatn1∂2log∂ψ∂ψLn( ¯′ψn) = Op(1) and 1n∂2log∂ψ∂ψLn(ψ′ 0) = Op(1). Here, ¯σ−r = σ−0r+op(1), r = 2,4,6 be-cause ¯σ2 −→p σ02 and σr appears inHn(ψ) ≡ ∂ψ∂ψ∂2 ′logLn(ψ) multiplicatively, thus it results in an asymptotically negligible error to replace ¯σ2 by σ02. The elements of the Hessian matrix, Hn(ψ) ≡ ∂ψ∂ψ∂2 ′logLn(ψ), are decomposed into sums of terms of the forms: Xn′An(θ)Xn, Xn′An(θ)Yn, Xn′An(θ)V(θ), Yn′An(θ)Yn,2σn4−σ16Vn′(θ)Vn(θ), Yn′An(θ)Vn(θ), Vn′(θ)An(θ)Vn(θ) andtr(An(θ)), where a matrix An(θ) is uniformly bounded in both row and column sums.
Therefore, it is sufficient to show that the difference between each term at ¯ψ andψ0 converges to zero in probability and moreover this can be easily shown.
We show some examples corresponding each term of the Hessian matrix.
Noting that
R−n1(λ)−R−n1 = R−n1(λ)(Rn−Rn(λ))R−n1,
= (λ0−λ)R−n1(λ)WnR−n1. ForXn′An(θ)Xn,
1
nXn′R′−n 1(¯λ)R−n1(¯λ)Xn−1
nXn′R′−n 1R−n1Xn = 1
nXn′(Rn′−1(¯λ)−R′−n 1+R′−n 1)R−n1(¯λ)Xn− 1
nXn′R′−n 1R−n1Xn,
= 1
nXn′(Rn′−1(¯λ)−R′−n 1)R−n1(¯λ)Xn+ 1
nXn′R′−n 1R−n1(¯λ)Xn−1
nXn′R′−n 1R−n1Xn,
= (λ0−¯λ)1
nXnR′−n 1(λ)Wn′R−n1R−n1(¯λ)Xn +(λ0−λ)¯ 1
nXn′R′−n 1R−n1(λ)WnR−n1Xn,
= op(1)O(1) +op(1)O(1),
= op(1).
Moreover, the convergence ofXn′An(θ)Yn is shown similarly.
Noting that
Vn(θ) = R−n1(λ)Rn(λ)Vn(θ),
= R−n1(λ)(S(θ)Yn−Xnβ),
= R−n1(λ)((λ0−λ)WnYn+ (ρ0−ρ)WnYn+Xn(β0−β) +RnVn).
Thus, forXn′An(θ)V(θ), 1
nXn′R′−n 1(¯λ)Vn(¯θ)− 1
nXn′R′−n 1Vn = (
(λ0−λ) + (ρ¯ 0−ρ)¯)1
nXn′R′−n 1(¯λ)WnYn+ 1
nXn′R′−n 1(¯λ)Xn(β0−β) +1
nXn′R′−n 1(¯λ)RnVn− 1
nXn′R′−n1Vn,
= op(1)Op(1) +Op(1)op(1) +op(1) +op(1),
= op(1),
where the convergence of last two terms follow from Lemma 3.5.3.
Here, 1
nVn′(¯θ)Vn(¯θ) = (
(λ0−λ) + (ρ¯ 0−ρ)¯)21
nYn′Wn′R′−n 1(¯λ))R−n1(¯λ)WnYn
+(β0−β)′1
nXn′R′−n 1(¯λ)R−n1(¯λ)Xn(β0−β) +1
nVn′R′nR′−n 1(¯λ)Rn−1(¯λ)RnVn
+2 n
((λ0−λ) + (ρ¯ 0−ρ)¯)
Yn′Wn′R′−n 1(¯λ)Rn−1(¯λ)Xn(β0−β) +2
n
((λ0−λ) + (ρ¯ 0−ρ)¯)
Yn′Wn′R′−n 1(¯λ)Rn−1(¯λ)RnVn+ (β0−β)′2
nXn′Rn′−1(¯λ)R−n1(¯λ)RnVn,
= op(1)Op(1) +op(1)O(1)op(1) +σ20+op(1)Op(1)op(1) +op(1)Op(1) +op(1)op(1),
= σ02+op(1).
It follows that 2σ14 0 −nσ16
0
Vn′(θ)Vn(θ) =op(1).
Before next proof, we show an example. Yn′Sn(θ)Vn = β′Xn′Sn−1S(θ)Vn + Vn′R′nSn−1Sn(θ)Vn and
1
nVn′Rn′Sn−1Sn(θ)Vn− 1
nVn′R′nSn−1SnVn = (
(λ0−λ) + (ρ0−ρ))1
nVn′Rn′Sn−1Vn,
= op(1)Op(1),
= op(1).
It follows that n1Yn′Sn(θ)Vn−n1Yn′SnVn =op(1) and similarly n1Yn′An(θ)Vn−
1
nYn′AnVn =op(1) and n1Yn′An(θ)Yn−1nYn′AnYn=op(1) where An isAn(θ) at true valueθ0.
Now, forYn′An(θ)Vn(θ), 1
nYn′Wn′R′−n 1(λ)Vn(θ)−1
nYn′Wn′R′−n 1Vn = (
(λ0−λ) + (ρ¯ 0−ρ)¯)1
nYn′Wn′R′−n 1(λ)R−n1(λ)WnYn +1
nYn′Wn′R′−n 1(λ)R−n1(λ)Xn(β0−β)¯ +1
nYn′Wn′R′−n 1(¯λ)R−n1(¯λ)RnV −1
nYn′Wn′R′−n 1Vn
= op(1)Op(1) +Op(1)op(1) +op(1)
= op(1).
Moreover, the convergence ofVn(θ)′An(θ)Vn(θ) is also shown similary.
Finally, fortr(An(θ)), by the Taylor expansion, 1
ntr(R−n1(λ)WnR−n1(λ)Wn)−1
ntr(R−n1WnR−n1Wn) = d
dλtr(Rn−1(˜λ)WnRn−1(˜λ)Wn)(¯λ−λ0),
= O(1)op(1),
= op(1), where ˜λlies between ¯λandλ0.
The convergence of the other elements of the Hessian matrix are shown similarly, hence n1∂2log∂ψ∂ψLn( ¯′ψn)−n1∂2log∂ψ∂ψLn(ψ′ 0)
−→p 0.
This completes the proof of the theorem.
Proof of Theorem 5
The estimator forαis ˆ
αn= (1−λ) logˆ (1
n
∑n i=1
exp{(R−n1(ˆλ)[S(ˆθ)Yn−Znδ])ˆ i} )
, Here,
S(ˆθ)Yn−Znδˆ = Yn−λWˆ nYn−ρWˆ nYn−Znδ,ˆ
= (λ0−λ)Wˆ nYn+ (ρ0−ρ)Wˆ nYn+Zn(δ0−δ) +ˆ α01n+RnVn,
= D+α01n+RnVn,
whereD= (λ0−λ)Wˆ nYn+ (ρ0−ρ)Wˆ nYn+Zn(δ0−δ).ˆ BecauseR−n1(ˆλ)(S(ˆθ)Yn−Znδ) =ˆ α0
1−λˆ1n+R−n1(ˆλ)D+R−n1(ˆλ)RnVn, 1
n
∑n i=1
exp{(R−n1(ˆλ)[S(ˆθ)Yn−Znδ])ˆ i}= exp ( α
1−λ )1
n
∑n i=1
exp{(R−n1(ˆλ)D+R−n1(ˆλ)RnVn)i}. Thus,
ˆ
α−α0= (1−ˆλ) log (1
n
∑n i=1
exp{(R−n1(ˆλ)D+R−n1(ˆλ)RnVn)i} )
. (3.10)
To prove consistency, it is sufficient that the right side of (3.10) converges to zero in probability.
By the Taylor expansion, 1
n
∑n i=1
exp{(R−n1(ˆλ)D+R−n1(ˆλ)RnVn)i} = 1 + 1 n
∑n i=1
exp(bi){
(R−n1(ˆλ)D+R−n1(ˆλ)RnVn)i
}
= 1 + 1
nb′(R−n1(ˆλ)D+R−n1(ˆλ)RnVn), wherebi lies between 0 and (Rn−1(ˆλ)D+R−n1(ˆλ)RnVn)i, andb= (b1, . . . , bn)′.
From Assumptions, Theorem 3 and Lemma 3.5.2 and 3.5.3, 1
nb′(R−n1(ˆλ)D+R−n1(ˆλ)RnVn) = (λ0−λ)ˆ 1
nb′R−n1(ˆλ)WnYn+ (ρ0−ρ)ˆ 1
nb′R−n1(ˆλ)WnYn
+1
nb′R−n1(ˆλ)Zn(δ0−δ) +ˆ 1
nb′R−n1(ˆλ)RnVn,
= op(1)Op(1) +op(1)Op(1) +O(1)op(1) +op(1),
= op(1).
Thus, n1∑n
i=1exp{(R−n1(ˆλ)D+R−n1(ˆλ)RnVn)i}−→p 1 and (1−λ) logˆ (1
n
∑n
i=1exp{(R−n1(ˆλ)D+R−n1(ˆλ)RnVn)i}) p
−→0.
Bibliography
[1] Andrews, D. W. (1992). Generic uniform convergence. Econometric theory, 8(2), 241-257.
[2] Bernstein, D. S. (2009). Matrix mathematics: Theory, facts, and formulas.
2nd edition. Princeton: Princeton University Press.
[3] Bollerslev, T. (1986). Generalized autoregressive conditional heteroskedas-ticity. Journal of econometrics, 31(3), 307-327.
[4] Bollerslev, T. (1990). Modelling the coherence in short-run nominal ex-change rates: a multivariate generalized ARCH model. The review of eco-nomics and statistics, 498-505.
[5] Bollerslev, T., Engle, R. F., & Wooldridge, J. M. (1988). A capital asset pricing model with time-varying covariances. Journal of political Economy, 96(1), 116-131.
[6] Borovkova, S. & Lopuhaa, R., (2012). Spatial GARCH: A Spa-tial Approach to Multivariate Volatility Modeling. Available at SSRN:
https://ssrn.com/abstract=2176781.
[7] Brockwell, P. J., & Davis, R. A. (1991). Time series: theory and methods.
Springer-Verlag.
[8] Caporin, M., and Paruolo, P., 2009, Structured multivariate volatility mod-els, Available at SSRN: http://ssrn.com/abstract=1318639.
[9] Davidson, J. (1994). Stochastic limit theory. Oxford: Oxford University Press.
[10] Doan, O., & Tapnar, S. (2013). GMM estimation of spatial autoregressive models with moving average disturbances. Regional Science and Urban Economics, 43(6), 903-926.
[11] Engle, R. F. (1982). Autoregressive conditional heteroscedasticity with es-timates of the variance of United Kingdom inflation. Econometrica, 50, 987-1007.
[12] Engle, R. F., & Kroner, K. F. (1995). Multivariate simultaneous generalized ARCH. Econometric theory, 11(1), 122-150.
[13] Fan, J., & Yao, Q. (2003). Nonlinear time series: nonparametric and para-metric methods. Springer Science & Business Media.
[14] Ferguson, T. S. (1996). A course in large sample theory. Chapman & Hall.
[15] Harville, D. A. (1997). Matrix algebra from a statistician’s perspective.
New York: Springer.
[16] Kelejian, H. H., & Prucha, I. R. (1997). Estimation of spatial regression models with autoregressive errors by two-stage least squares procedures: a serious problem. International regional science review, 20(1-2), 103-111.
[17] Kelejian, H. H., & Prucha, I. R. (1998). A generalized spatial two-stage least squares procedure for estimating a spatial autoregressive model with autoregressive disturbances. The Journal of Real Estate Finance and Eco-nomics, 17(1), 99-121.
[18] Kelejian, H. H., & Prucha, I. R. (2001). On the asymptotic distribution of the Moran I test statistic with applications. Journal of Econometrics, 104(2), 219-257.
[19] Kelejian, H. H., & Robinson, D. P. (1993). A suggested method of esti-mation for spatial interdependent models with autocorrelated errors, and an application to a county expenditure model. Papers in regional science, 72(3), 297-312.
[20] Lee, L. F. (2002). Consistency and efficiency of least squares estimation for mixed regressive, spatial autoregressive models. Econometric theory, 18(2), 252-277.
[21] Lee, L. F. (2004). Asymptotic Distributions of Quasi‐ Maximum Likelihood Estimators for Spatial Autoregressive Models. Econometrica, 72(6), 1899-1925.
[22] Lee, L. F. (2007). GMM and 2SLS estimation of mixed regressive, spatial autoregressive models. Journal of Econometrics, 137(2), 489-514.
[23] Lee, P. M. (2012). Bayesian statistics: an introduction. John Wiley & Sons.
[24] Robinson, P. M. (2009). Large‐ sample inference on spatial dependence.
The Econometrics Journal, 12(s1).
[25] Sato, T. and Matsuda, Y. (2017). Spatial Autoregressive Conditional Het-eroskedasticity Models. J.Japan Statist. Soc., Vol. 47, 2
[26] Schott, J. R. (2005). Matrix analysis for statistics. 2nd edition. John Wiley
& Sons.
[27] Su, L., & Yang, Z. (2015). QML estimation of dynamic panel data models with spatial errors. Journal of Econometrics, 185(1), 230-258.
[28] White, H. (1994). Estimation, inference and specification analysis. New York: Cambridge university press.
[29] Yan, J. (2007). Spatial stochastic volatility for lattice data. Journal of agri-cultural, biological, and environmental statistics, 12(1), 25.
[30] Yang Z. (2018). Unified M-Estimation of Fixed-Effects Spatial Dynamic Models with Short Panels. Journal of Econometrics, 205(2), 423-427.
[31] Yu, J., De Jong, R., & Lee, L. F. (2008). Quasi-maximum likelihood esti-mators for spatial dynamic panel data with fixed effects when both n and T are large. Journal of Econometrics, 146(1), 118-134.
Chapter 4
SARAR-GARCH models
Abstract
This study proposes spatio-temporal extensions of time series generalized au-toregressive conditional heteroskedasticity (GARCH) models. We call spatio-temporally extended GARCH models as spatial autoregressive models with spa-tial autoregressive error and generalized autoregressive conditional heteroskedas-ticity processes, namely SARAR-GARCH models. One important problem which multivariate volatility models contain is the curse of dimensionality. To overcome the problem, we adopt a spatial weight matrix which expresses the dependence relation between observations. A spatial weight matrix is usually determined by geographical information of spatial data. However, financial data doesn’t include geographical information. Therefore, we propose a method to make spatial weight matrix from financial data by stepwise backward regres-sions. Parameters are estimated by a two step procedure. First step is the estimation of spatial parameters and second step is that of GARCH param-eters. In real data analysis, We apply the SARAR-GARCH model to daily returns of the Nikkei 225 stock price data and S&P 500 stock price data. We compare the in-sample and out-sample performances of SARAR-GARCH mod-els with those of CCC modmod-els which is a benchmark. The results show the in-sample performance of the CCC model is better because the CCC model contains many more parameters. However, the out-sample performance of the SARAR-GARCH model are better than that of the CCC model in both markets analysis.
4.1 Introduction
Volatility which is a conditional variance in a model is one of the most important concepts in financial econometrics because it is used in widely areas such as risk management, option pricing and portfolio selection. Financial market data often exhibits volatility clustering (i.e., volatility may be high for certain time periods
and low for other periods) This means time-varying volatility is more common than constant volatility. Therefore, accurate modeling of time-varying volatility is important in financial econometrics.
The seminal work of Engle (1982) proposes autoregressive conditional het-eroscedasticity (ARCH) models and the most important extension of the model is generalized ARCH (GARCH) models proposed by Bollerslev (1986). The models have been widely used to identify volatilities. After that, many ex-tended GARCH models have been proposed. For example, integrated GARCH models ( Engle and Bollerslev (1986)), exponential GARCH models (Nelson (1991)), threshold GARCH models (Glosten, etal (1993)), GARCH in the mean models, and GJR-GARCH models are proposed.
Univariate volatility models are generalized to multivariate cases in many ways. One important problem which multivariate volatility models contain is the curse of dimensionality. We estimate a conditional covariance matrix which has n(n+1)2 quantities for a n-dimensional time series, therefor it is difficult to estimate all quantities. Thus, we attempt to give a conditional covariance ma-trix some simple structures to reduce the number of parameters. For example, exponentially weighted moving average models, constant conditional correlation models (Bollerslev (1990)), BEKK models (Engle and Kroner (1995)), orthogo-nal GARCH models (Alexander (2001) ), dynamic conditioorthogo-nal correlation mod-els (Tse and Tsui (2002)), dynamic orthogonal component modmod-els, and factor GARCH models are proposed.
The ideas of spatial econometrics have been applied to volatility models to reduce number of parameters in a covariance matrix in recent years. Caporin and Paruolo (2008) and Borovkova and Lopuhaa (2012) have applied the ideas of spatial econometrics to time series multivariate GARCH models. Yan (2007) and Robinson (2009) have done spatial extensions of stochastic volatility models which are another kind of volatility models. Sato and Matsuda (2017, 2018) have extend time series GARCH models to spatial models.
This paper contributes to extend GARCH models to spatiotemporal mod-els which we call spatial autoregressive modmod-els with spatial autoregressive error and generalized autoregressive conditional heteroskedasticity processes, namely SARAR-GARCH models by using spatial econometrics ideas. The model is characterized by a spatial weight matrix which express cross-section correla-tions between assets and used to reduce the number of parameters. A spatial weight matrix is usually determined by geographical information of spatial data.
However, financial data doesn’t include geographical information. Therefore, we propose a method to make spatial weight matrix from financial data. we ap-ply the multiple linear regression model and stepwise backward regression to calculate spatial weights in spatial weight matrices. Parameters are estimated by a two step procedure. First step is the estimation of spatial parameters and second step is that of GARCH parameters. Spatial parameters are estimated in first step. We regard volatilities in the model as constant variance and we apply quasi-maximum likelihood method with the model. After that we ap-ply GARCH models with residuals derived from first step in second step. In
real data analysis, We apply the SARAR-GARCH model to daily returns of the Nikkei 225 stock price data and S&P 500 stock price data. We compare the in-sample and out-sample performances of SARAR-GARCH models with those of CCC models. First, we check the in-sample performances based on log-likelihood. The results show the log-likelihood of the CCC model is grater than that of SARAR-GARCH. This means model fitting of the CCC model is better. One reason is that the number of parameters in CCC models is more than five times of those of SARAR-GARCH models. Secondly, we compare out-sample performances by using quasi-likelihood loss function. The result shows the quasi-likelihood loss function of SARAR-GARCH models are smaller than that of CCC models. Then, the out-sample performance of SARAR-GARCH models is better. One reason is the CCC model may be over-fitting and it cause lower forecasting performance. Moreover, SARAR-GARCH models have bet-ter prediction performance in U.S. market analysis because stock price in U.S.
market are more volatile and proposed models can capture sharp fluctuations.
The rest of paper proceeds as follows. Section 4.2 introduces SARAR-GARCH models. The estimation procedures are described in section 4.3. Sec-tion 4.4 examines empirical properties of SARAR-GARCH models by applying the models to real data such as stock price in the Japanese and the U.S. market.
Section 4.5 discusses some concluding remarks.