• 検索結果がありません。

3.3 Category quantification method

3.3.1 NPCA-based method

3.3.1.2 Algorithm

Here, we explain the algorithm of the NPCA-based method using the K-means type constraint connector matrix. When we set rbx=rby, rcx =rcy, cb =rbx, cc =rcx, Dx = I, and Dy = I, objective function gccca equals objective function gnpc. Therefore, the NPCA-based method is a special case of using theK-means type constrained connector matrix method.

Proposition 3.13.When we setZ(x)= [X1Q](1)J

x,Kx andZ(y)= [Y1W](1)J

y,Ky. The update

formula of Bx, By, Cx, and Cy is obtained as follows:

Bx =UbxVbx (3.58)

By =UbyVby (3.59)

Cx =UcxVcx (3.60)

Cy =UcyVcy (3.61)

where Ubx and Vbx are left and right singular matrixes of Z2(x)(CxI)F2(x). Uby and Vby are left and right singular matrixes of

Z2(y)(CyI)F2(y). Ucx and Vcx are left and right singular matrixes of

Z3(x)(BxI)F3(x). Ucy andVcy are left and right singular matrixes of

Z2(y)(ByI)F3(y).

Proof. First, we explain the update formula of Bx. The term related to Bx is the first term. The first term of objective functiongccca is rewritten as follows:

X1QF1(x)(CxBx)2 =Z1(x)F1(x)(CxBx)2

=∥Z2(x)BxF2(x)(CxI)2

=tr(Z2(x)Z2(x))2tr(Z2(x)BxF2(x)(CxI)) + tr((CxI)F2(x)BxBxF2(x)(CxI))

=tr(Z2(x)Z2(x))2tr(BxF2(x)(CxI)Z2(x)) + tr((CxI)F2(x)F2(x)(CxI)).

Given parameters exceptBx, we obtain the update formula of Bx by maximizing

tr(BxF2(x)(CxI)Z2(x)). From the TenBerge theorem (ten Berge, 1993), tr(BxF2(x)(Cx I)Z2(x)) tr(D)holds. D is the diagonal matrix whose elements are singular values of V DU =F2(x)(CxI)Z2. When we set Bx =U V, the equation holds. Therefore, we obtain the update formula. The update formula ofBy is obtained in the same way asBx. Next, we explain the update formula of Cx, which is obtained in a very similar way to Bx. The term related to Cx is the first term. The first term of objective functiongccca is rewritten as follows:

X1QF1(x)(CxBx)2 =Z1(x)F1(x)(CxBx)2

=∥Z3(x)CxF3(x)(BxI)2

=tr(Z3(x)Z3(x))2tr(Z3(x)CxF3(x)(BxI)) + tr((BxI)F3(x)CxCxF3(x)(BxI))

=tr(Z3(x)Z3(x))2tr(BxF3(x)(CxI)Z3(x)) + tr((BxI)F3(x)F3(x)(BxI)).

Given another parameter exceptCx, we obtain the update formula of Cx by maximizing tr(CxF3(x)(BxI)Z3(x)). From the TenBerge theorem, tr(CxF3(x)(BxI)Z3(x))tr(D) holds. D is a diagonal matrix whose elements are singular values ofV DU =F3(x)(Bx I)Z3. When we set Cx = U V, the equation holds. Therefore, we obtain the update formula. The update formula ofCy is obtained in the same way asCx.

Proposition 3.14. The update formulas of F(x) and F(y) are obtained as follows:

F1(x) =(X1Q(CxBx) +F1(y)DyDx)(I+DxDx)1, (3.62) F1(y) =(Y1W(CyBy) +F1(x)DxDy)(I +DyDy)1. (3.63) Proof. First, we explain about the update formula ofF1(x).

∥X1QF1(x)(CxBx)2+∥F1(x)DxF1(y)Dy2

=2tr(F1(x)(CxBx)QX1)2tr(F1(x)F1(y)DyDx)

+ tr(F1(x)DxDxF1(x)) + tr(F1(x)(CxCxBxBx)F1(x)) + const.,

where const. is constant independence fromF1(x). Thus, the partial derivative function of gccca with respect toF1(x) is obtained as follows:

∂F1(x){∥X1QF1(x)(CxBx)2+F1(x)DxF1(y)Dy2}

=2X1Q(CxBx)2F1(y)DyDx+ 2F1(x)DxDx + 2F1(x).

When we set the partial derivative function of gccca with respect to F1(x) as 0, we obtain the following equation:

2X1Q(CxBx)2F1(y)DyDx + 2F1(x)DxDx + 2F1(x) = 0

⇐⇒F1(x)DxDx +F1(x)=X1Q(CxBx) +F1(y)DyDx

⇐⇒F1(x)= (X1Q(CxBx) +F1(y)DyDx)(I+DxDx)1. The update formula ofF1(y) is obtained in the same way asF1(x).

∥Y1W F1(y)(CyBy)2+∥F1(x)DxF1(y)Dy2

=2tr(F1(y)(CyBy)WY1)2tr(F1(y)F1(x)DxDy)

+ tr(F1(y)DyDyF1(y)) + tr(F1(y)(CyCyByBy)F1(y)) + const.,

where const. is constant independence fromF1(y). Thus, the partial derivative function of gccca with respect toF1(y) is obtained as follows:

∂F1(y){∥Y1W F1(y)(CyBy)2+F1(x)DxF1(y)Dy2}

=2Y1W(CyBy)2F1(x)DxDy + 2F1(y)DyDy + 2F1(y).

When we set the partial derivative function ofF1(y)as 0, we obtain the Following equation:

2Y1W(CyBy)2F1(x)DxDy+ 2F1(y)DyDy+ 2F1(y)= 0

⇐⇒F1(y)DyDy +F1(y)=Y1W(CyBy) +F1(y)DxDy

⇐⇒F1(y)= (Y1W(CyBy) +F1(x)DxDy)(I+DyDy)1.

Proposition 3.15. The update formula ofqkxjx is obtained as follows:

qkxjx = I(Xk

xjx

Xk

xjx)12u(qx)1 , (3.64) where u(qx)1 is the first dimension left singular vector of

(Xk

xjx

Xk

xjx)12Xk

xjx

Jn(∑rcx

c(x)k

xF(ℓ)(x)b(x)j

x ). c(x)k

x is (kx, ℓ) element of Cx. b(x)j

x is the jx-th row vector of Bx. F(ℓ)(x) is the matrix corresponding to dimensionℓ of Cx.

The update formula of wkyjy is obtained as follows:

wkyjy = I(Yk

yjy

Yk

yjy)12u(wy)1 , (3.65) where u(wx)1 is the first dimension left singular vector of

(Yk

yjy

Yk

yjy)12Yk

yjy

Jn(∑rcy

c(y)k

yF(ℓ)(y)b(y)jy ). c(y)k

y is(ky, ℓ) element ofCy. b(y)jy is thejx-th row vector ofBy. F(ℓ)(y) is the matrix corresponding to dimensionℓ of Cy

Proof. First, we explain about the update formula of qkxjx. From definition Q, qkxjx are independent from each other. Thus, the update formula of qkxjx can be calculated individually. The term that is related toqkxjx is the first term of gccca. The first term of gccca is rewritten as follows:

∥X1QF1(x)(CxBx)2=−2tr(QX†′1F1(x)(CxBx)) + const. (3.66) From equation (3.66), we consider the minimization problem as theQ that maximizes tr(QX†′1F1(x)(CxBx)). From the definition ofQ, in order to maximize tr(QX†′1F1(x)(Cx Bx)), we consider each value of qkxjx. Objective function g for qkxjx is obtained as fol-lows:

g(qkxjx |Cx, Bx, Fx, X) = tr(qkxjxXk

xjx

rcx

c(x)k

xF(ℓ)(x)b(x)j

x ),

From the constraint on qkxjx, this objective function g is very similar to the objective function of canonical correlation analysis. From the constraintXj

xkxqjxkx =J Xj

xkxqjxkx, Xj

xkxqjxkx is the element of complementary space of 1. Therefore, first, the Xj

xkx is projectedJ space. Then, we search the parameters maximizingg. Thus, we change the objective functiong tog1 as follows:

g1(qkxjx |Cx, Bx, Fx, X) = tr(qkxjxXk

xjx

J

rcx

c(x)k

xF(ℓ)(x)b(x)jx ).

When we set qkxjx = 1 I(Xk

xjx

Xk

xjx)1/2qkxjx, equation qkxjxqkxjx = 1 holds from the constraint case ofgccca. Therefore, we can rewriteg1 as follows:

g1(qkxjx |Cx, Bx, Fx, X) =

Itr(qkxjx(Xk

xjx

Xk

xjx)12Xk

xjx

J

rcx

c(x)k

xF(ℓ)(x)b(x)j

x ).

g1 is the same as the objective function of the weight matrix in the categorical canonical covariance case. Therefore, we obtain the update formula ofqkxjx as

qkxjx =u(qx)1 , whereu(qx)1 is the first dimension left singular vector of (Xk

xjx

Xk

xjx)12Xk

xjx

Jrcx

c(x)k

xF(ℓ)(x)b(x)j

x . From the definition of qk

xjx, the update for-mula ofqkxjx is obtained as follows:

qkxjx = I(Xk

xjx

Xk

xjx)12u(qx)1 .

The update formula of wkyjy is obtained in the same way asqkxjx.

The update formulas of Dx and Dy are the same as the K-means algorithm.

Proposition 3.16. The update formulas of Dbx, Dby, Dcx, and Dcy are obtained as follows:

d(bx)ℓq =



 1

(

= arg min

[F2(x)(DcxI)]d(by)q F2(y)(DcyI) ) 0 (otherwise)

(q= 1, 2, . . . , cb),

(3.67)

d(by)ℓq =



 1

(

= arg min

[F2(y)(DcyI)]d(bx)q F2(x)(DcxI) ) 0 (otherwise)

(q= 1, 2, . . . , cb),

(3.68)

d(cx)ℓq =



 1

(

= arg min

[F3(x)(DbxI)]d(cy)q F3(y)(DbyIn) ) 0 (otherwise)

(q= 1, 2, . . . , cc),

(3.69)

d(cy)ℓq =



 1

(

= arg min

[F3(y)(DbyI)]d(cx)q F3(x)(DbxIn) ) 0 (otherwise)

(q= 1, 2, . . . , cc),

(3.70)

where d(bx)ℓq , d(by)ℓq , d(cx)ℓq and d(cy)ℓq are the (ℓ, q) element of Dbx, Dby, Dcx, and Dcy, respectively. [A] is the ℓ-th column vector ofA.

Proof. From the definition ofgccca, the third term depends on Dbx, Dby, Dcx, and Dcy. We rewrite the third term ofgccca as follows:

F1(x)DxF1(y)Dy2 =F1(x)(DcxDbx)F1(y)(DcyDby)2

=DbxF2(x)(DcxI)Dby F2(y)(DcxI)2 (3.71)

=DcxF3(x)(DbxI)Dcy F3(y)(DbyI)2 (3.72) From the constraint of Dbx, Dby, equation (3.71) is equivalent to K-means given other parameters. Thus, the update formulas of Dbx, Dby are obtained in the same way as K-means.

From the constraint ofDcx, Dcy, equation (3.72) is equivalent toK-means given other parameters. Thus, the update formulas of Dcx, Dcy are obtained in the same way as K-means.

Summarizing the update formulas, we obtain the algorithm of categorical canonical covariance analysis for three-mode three-way data as algorithm 6.

Algorithm 6 Algorithm of the NPCA-based method

Set the number of dimensionsrbx, rby, rcx, rcy, cb, cc, and stop conditionε Set initial valuesBx(0), Cx(0), By(0), Cy(0), F(0)x , F(0)y , Dbx(0), D(0)by,D(0)cx,and Dcy(0), t←0

S(0)←gccca(F(0)x , F(0)y , Bx(0), By(0), Cx(0), Cy(0), Q(0), W(0), D(0)x , Dy(0)|X, Y) repeat

t←t+ 1

Update Bx and By using Cx(t1), F(tx1),Cy(t1), F(ty1) Update Cx and Cy using Bx(t), F(tx1),By(t), F(ty1)

Update Fx using Bx(t), By(t),Cx(t), Cy(t), F(ty1),Dx(t1), Dy(t1) Update Fy using Bx(t), By(t),Cx(t), Cy(t), F(t)x ,Dx(t1), D(ty1) Update Dbx usingB(t)x , B(t)y ,Cx(t), Cy(t), F(t)x ,F(t)y , D(tcx1), D(ty1) Update Dcx using Bx(t), By(t),Cx(t), Cy(t), F(t)x ,F(t)y , D(t)bx, D(ty1) Update Dby using Bx(t), By(t),Cx(t), Cy(t), F(t)x ,F(t)y , Dx(t), Dcy(t1) Update Dby using Bx(t), By(t),Cx(t), Cy(t), F(t)x ,F(t)y , Dx(t), Dby(t)

S(t)←gccca(F(t)x , F(t)y , Bx(t), By(t), Cx(t), Cy(t), Q(t), W(t), Dx(t), Dy(t)|X, Y) until |S(t1)−S(t)| ≤ε

Chapter 4

Simulation studies

In this chapter, we compare canonical covariance analysis for three-mode three-way data with that for two-mode two-way data using several evaluations. In the K-means based method case, we compare our proposed method with two-mode two-way canonical covariance analysis and no connector matrixes with the mean squared loss between the true and estimated weight matrixes. In the regression-based method case, we compare our proposed method with two-mode two-way methods and three-mode three-way regression using mean squared loss of prediction. In the quantification method case, we compare our proposed method with two-mode two-way canonical covariance analysis, no connector matrixes, and no quantification method using the mean squared loss between the true and estimated weight matrixes.

4.1 Constrained connector method

In this section, we describe a numerical example for the contained connector method.

The purpose of introducing the constraint for parameters is different between theK-means type and the regression type. Therefore, we separate these two types. In theK-means type situation, we evaluate the squared error of parameters. In the regression type situation, we focus on the prediction error.

関連したドキュメント