Bayesian estimation considering spatial correlation structure

3. Spatial-temporal process

3.2 Bayesian estimation considering spatial correlation structure

In order to improve the estimation and to enable local estimation, utilization of Bayesian estimation including spatial correlation is proposed in this research. This approach uses prior information of the unknown parameters which characterize soil behavior, e.g. model parameters or soil properties, and the observation data, e.g. observed settlement or movement, from all observation points to search for the best estimates of the unknown parameters. The formulation consists of two statistical components, namely, the observation model and the prior information model. These two models will then be combined by Bayes’ theorem to obtain the solution.

3.2.1 Observation model

This model relates the observation data to the unknown parameters which are defined in a multivariate stochastic Gaussian field Ζ^{(x) = [z}¹^(x),^z²(x), ... , zP(x)]^T where x is a spatial vector coordinate; P is total number of the unknown parameters; and zi(x) (for i = 1, 2, … , P) is a random function of an unknown parameter (e.g. soil or model parameter) at any location x in a specific domain. This paper proposes a method to identify the best estimator of Z for a discrete spatial point field, x1, x2, ... , xn, xn+1, ... , xm, which is defined as

[

¹ ²

]

ˆ ˆ , ˆ , , ˆ_P ^T

Z = z z L z (3-1)

where

[

¹ ²

]

ˆ_i ˆ_i( ) , ˆ_i( ) , , ˆ_i( )_m ; 1, 2,...,

z = z x z x L z x i= P (3-2)

Suppose that a set of observations Yk (e.g. ground settlement) at the discrete time step k, i.e. k

= 1, 2, ... , K, has been obtained at n observation points x1, x2, ... , xn. Yk is defined as

[

^k ^k ^k ⁿ

]

k y x y x y x

Y = ( ₁) , ( ₂) , L , ( ) (3-3)

It should be noted that xn+1, xn+2, ... , xm are defined as any arbitrary points at which the unknown parameters are to be estimated, i.e. m-n interpolation points.

The general formulations of the observation model have been presented in several literatures, e.g. Hoshiya and Yoshida (1996), Honjo and Kashiwagi (1999), etc. Here it is assumed that the observation Yk is expressed as a linear function of Z with observation error of ε as follows:

k k

Y =M Z+ε (3-4)

where ε is the Gaussian observation error vector which is assumed to follow N(0, V_ε). V_ε is defined as a covariance matrix of ε^{where V}ε = σε²·In,n. σε² is the variance of the observation error and In,n is an n × n unit matrix. This implies that the observation errors are assumed to be spatially independent.

Mk is the n × (P·m) coefficient matrix, which is defined as

1 2

, 0 , , 0 , ^P, 0 ,

k n n n m n n n n m n n n n m n

M =⎡⎣M ₋ M ₋ L M ₋ ⎤⎦ (3-5)

where Mⁱ_n,n denotes n × n coefficient matrix, relating z_i to Y_k; 0n,m-n denotes n × (m-n) zero matrix, attaching to each Mⁱn,n to eliminate the unknown parameters at m-n arbitrary points (i.e. xn+1, xn+2, ... , xm) from the observation model.

Given Z and σε², the predicted settlement distribution at any time step k can be represented by the following multivariate normal distribution

(

^k ^, ²

) ^{( )}

² ⁿ^{/ 2} ^{1/ 2} ^exp ¹₂

⁽

^k ^k

⁾

^T ¹

⁽

^k ^k

⁾

p Y Z σ_ε = π ⁻ V_ε ⁻ ⋅ ^⎡⎢⎣− Y −M Z V_ε⁻ Y −M Z ^⎤⎥⎦ (3-6)

3.2.2 Prior information model

It is assumed that the prior information of the unknown parameters has the following structure

Z =Z0+δ (3-7)

where Z0 is the prior mean vector (P·m dimension) at points x1, x2, ... , xm. It can be defined as

0 0,1, 0,2 , , 0,_P ^T

Z =⎡⎣z z L z ⎤⎦ (3-8)

where

0,_i 0,_i( ) ,1 0,_i( ) ,2 , 0,_i( )_m ; 1, 2,...,

z =⎡⎣z x z x L z x ⎤⎦ i= P (3-9)

z0,i(xj) can be generally defined as z0,i(xj) = Cⁱ1+ Cⁱ2·x’j + Cⁱ3·y’j + Cⁱ4·x’j y’j + …, depending on the shape of the trend components considered to be suitable for the specific model parameters. Note that x’j and y’j denote spatial coordinates at point xj, while Cⁱ1, Cⁱ2, Cⁱ3, … represent the constant coefficients of the trend for the corresponding unknown parameters zi (i = 1, 2, … , P). These parameters can be either deterministic or unknown, depending on the assumption made. For the later case, these coefficients can be estimated as one of the hyperparameters based on ABIC which will be presented in Section 3.3.

δ represents the uncertainty of the prior mean of the unknown parameters which is assumed to follow N(0, VZ) where VZ is a covariance matrix. By introducing the spatial correlation structure in the formulation of VZ, we have

2 1

2 2

z C

z C Z

zP C

V V V

V σ

⎡ ⎤

⎢ ⎥

=⎢ ⎥

⎢ ⎥

⎣ ⎦

(3-10)

where σ²^z1^,σ²^z2^{, ... ,}σ²^zP represent the prior variance of the unknown parameters z1, z2, ... , zP, respectively. These variances also can be assumed to be either deterministic or unknown and, in the same way with the prior means, they can be estimated as one of the hyperparameters based on ABIC (see Section 3.3 for detail). VC is the auto-covariance matrix which is defined as

( ) ( )

^⎥^⎥

⎥

⎦

⎤

⎢⎢

⎢

⎣

⎡

−

m m m

m C

x x x

x V

ρ ρ

M O

1 1

1 (3-11)

ρ^(|xⁱ^{- x}^j|) denotes the auto-correlation function where xi, xj = spatial vector coordinate. Several analytical expressions have been proposed for the auto-correlation function but, in fact, none of them can claim any fundamental basis (Vanmarcke 1977a). The exponential type auto-correlation function is chosen for the current study because it is commonly used in geotechnical applications (e.g.

Vanmarcke 1977a, Fenton and Griffiths 2002, Griffiths and Fenton 2004 etc.). The function is given as

( ) [

^η

]

ρ xi −xj =^exp− xi −xj ^/ (3-12)

where η = auto-correlation distance. To emphasize, this parameter is assumed to be constant at any directions in the horizontal plane. This implies that the anisotropy of soil is not considered in this case. In addition, it should be kept in mind that this type of auto-correlation function is, in fact, chosen only as an example for an application of the proposed method. In practice, several types of autocorrelation functions may be tested and the one which fits most to the observation should be used.

From the above definitions, it is clear that the spatial correlation structure is included in the form of the spatial correlation of unknown parameters, which relate to soil properties, instead of that of ground behavior. The authors believe that this is the most suitable way to introduce the spatial correlation structure to the geotechnical model due to the fact that the physical correlation of the observed ground behavior actually results from the spatial correlation of soil properties.

It should also be noted that, for the sake of simplification, there are two important assumptions about the correlation structure for formulating the above covariance matrix (VZ). Firstly, the unknown parameters, z1, z2, ... , zP, are assumed to be independent of each other. Secondly, the correlation structures of these parameters are identical, meaning that they share the same auto-correlation distance. In fact, these assumptions can be released without major change of the

formulation, if the observation data is available in the amount that the detail specification of the spatial correlation is possible.

Given η, prior means, and prior variances of the unknown parameters, the prior distribution of the unknown parameters is also assumed as a multivariate normal distribution of the following form

( ) ^{( )}

² ⁽^{P m}⁾² ^Z ^{1/ 2}^exp ¹₂

⁽

⁰

⁾

^T ^Z¹

⁽

⁰

⁾

p Zη = π ^{− ⋅} V ⁻ ^⎡⎢⎣− Z−Z V⁻ Z−Z ^⎤⎥⎦ (3-13)

3.2.3 Bayesian estimation

Suppose that the set of observations Yk at the discrete time step k = 1, 2, ... , K has already been obtained. By employing Bayes’ theorem, the posterior distribution of the state vector Z can be formulated as

(

^, ²^,

) ( )

^K₁

(

^k ^, ²

)

p Z Y σ η_ε c p Zη p Y Z σ_ε

= ⋅

∏

^(3-14)

where Y denotes the set of all observation data, i.e. Y = (Y1, Y2, ... , YK), and c denotes the normalizing constant. By substituting Eq. (3-6) and (3-13) into the above equation, we have

(

^, ²^,

) ^{( )}

² ^[^{P m K n}^]² ^Z ^{1/ 2} ^K^{/ 2}

p Z Y σ η_ε = ⋅c π ^{− ⋅ + ⋅} V ⁻ V_ε ⁻

(

⁰

)

(

⁰

) ( )

( )

exp 1 2

T K T

Z k k k k

Z Z V⁻ Z Z Y M Z V_ε⁻ Y M Z

⎧ ⎡ ⎤⎫

⋅ ⎨⎩− ⎢⎣ − − +

∑

− − ⎥⎦⎬⎭ ^(3-15)

The Bayesian estimator of Z, i.e. ˆZ, is the one that maximizes the above function. Therefore, it is equivalent to minimizing the following objective function

(

²^,

) ⁽

⁰

⁾

^T ^Z¹

⁽

⁰

⁾

^K₁

⁽

^k ^k

⁾

^T ¹

⁽

^k ^k

⁾

J Z σ η_ε Z Z V⁻ Z Z Y M Z V_ε⁻ Y M Z

= − − +

∑

− − ^(3-16)

It should be noted that σε² and η are assumed to be given in this case. The Bayesian method, however, does not provide the rational way to determine these values. In order to choose the most

appropriate values of σε² and η based on the information in hand, Akaike’s Bayesian Information Criterion (ABIC) is introduced and presented in the next section.

ドキュメント内 A spatial-temporal random process for geotechnical design based on observation method (ページ 50-55)