最近検索した

検索結果がありません。

タグ

検索結果がありません。

ドキュメント

検索結果がありません。

アップロード

ホーム学校トピック

ログイン

補足資料プログラミングの基礎勉強会：チュートリアル首都大学東京自然言語処理研究室（小町研）

シェア "補足資料プログラミングの基礎勉強会：チュートリアル首都大学東京自然言語処理研究室（小町研）"

N/A

N/A

Protected

学年: 2018

Info

ダウンロード

Protected

Academic year: 2018

シェア "補足資料プログラミングの基礎勉強会：チュートリアル首都大学東京自然言語処理研究室（小町研）"

Copied!

20

0

0

20

0

0

読み込み中.... (全文を見る)

今ダウンロードする ( 20 ページ )

全文

(1)

7. Neural Networks

Supplement

(2)

(3)

(4)

Forward Propagation

・・・ t a n h t a n h

�

_" Vector (Vocab)

�

_$ Vector (2)

�

_" Matrix (2, Vocab)

�

_" Vector (2)

(5)

Forward Propagation

t a n h

�

_# Vector

(1)

�

_$

Vector (2)

�

_#

= tanh

�

_$

�

_$

+

�

_$

�

_$

Matrix (1, 2)

�

_$

(6)

Hyperbolic Tangent

・

tanh � =

./0.1/

./2.1/

・

tanh �

3

=

./2.1/ 40 ./0.1/ 4

./2.1/ 4

=

5 ./2.1/ 4

= 1 −

./2.1/ 405

./2.1/ 4

= 1 −

.4/2#2.14/05 ./2.1/ 4

= 1 −

.4/0#2.14/

./2.1/ 4

= 1 −

./0.1/ 4 ./2.1/ 4

= 1 − tanh

#

�

(7)

Why gradient is used ?

・

err =

<

4

0=

4

#

・Minimizing

err

function

・

>?@@

>A

= 0

→ w is ideal

・

>?@@

>A

> 0

→ w is too big (w -=

λ

>?@@

>A

→ w will be decreased)

・

>?@@

>A

< 0

→ w is too small (w -=

λ

>?@@

>A

→ w will be increased)

(8)

Back Propagation

t a n h

�

_#

Vector (1)

err =

�

#

− �

#

2

�

_#

=

derr

d

�

_#

=

�

#

− �

�

_#

3

=

derr

d

�

_$

�

_$

+

�

_$

=

derr

d

�

_#

d

�

_#

d

�

_$

�

_$

+

�

_$

=

�

_#

1 −

�

_#

#

_Vector

�

3#

(9)

Back Propagation

t a n h

�

_$ Vector (2)

�

_$

=

derr

d

�

_$

=

derr

d

�

_$

�

_$

+

�

_$

d

�

_$

�

_$

+

�

_$

d

�

_$

=

�

_#

3

�

_$

�

_$ Matrix (1, 2)

�

_$ Vector (1)

�

3_#

(10)

Back Propagation

t a n h

t a n h

�

_$

3

=

derr

d

�

_"

�

_"

+

�

_"

=

derr

d

�

_$

d

�

_$

d

�

_"

�

_"

+

�

_"

=

�

_$

1 −

�

_$

#

�

3_$

Vector (2)

�

_$

(11)

Back Propagation

・・・ t a n h t a n h

�

_" Matrix (2, Vocab)

�

_" Vector (2)

�

_"

=

derr

d

�

_"

=

derr

d

�

_"

�

_"

+

�

_"

d

�

_"

�

_"

+

�

_"

d

�

_"

=

�

_$

3

�

_"

�

3_$

Vector (2)

�

_"

(12)

Update Weights

・・・ t a n h t a n h

�

_" Matrix (2, Vocab)

�

_" Vector (2)

�

_"

−= λ

>?@@

>

A

_I

= λ

�

_$

3

>

A

I

<

I

2

J

I

>

A

_I

= λ

�

_$

3

�

_"

�

_"

−= λ

>?@@

>

J

_I

= λ

�

_$

3

>

A

I

<

I

2

J

I

>

J

_I

= λ

�

_$

3

�

3_$

Vector (2)

�

_"

(13)

Update Weights

t a n h

�

3_#

Vector (1)

�

_$ Vector (2)

�

_$ Matrix (1, 2)

�

_$ Vector (1)

�

_$

−= λ

>?@@

>

A

_K

= λ

�

_#

3

>

A

K

<

K

2

J

K

>

A

_K

= λ

�

_#

3

�

_$

�

_$

−= λ

>?@@

>

J

_K

= λ

�

_#

3

>

A

K

<

K

2

J

K

>

J

_K

(14)

(15)

Saving Models

・Saving dict to File

・Saving network to file

network = list [ net[0], net[1], …, net[i] ]

net[i] = tuple ( w, b )

w, b = array ([[…],[…], …, […]])

How to save to file ?

from each key, value in dict

(16)

Serializer

・Serializer converts object hierarchy into a byte stream

・Saving

・Loading

Network can be saved easily !

import pickle

pickle.dump(file_object, network)

(17)

(18)

Create Feature

CREATE_FEATURES(x):

create list phi (len = len(ids))

split x into words for word in words

#Training

phi[ids[ UNI: +word]] += 1 #Testing

if UNI: +word in ids:

phi[ids[ UNI: +word]] += 1

(19)

Training

create defaultdict ids, array feat_lab

get len(ids)

for each labeled pair x, y in the data

add (create_features(x), y ) to feat_lab initialize net randomly

for I iterations

for each labeled pair φ0 , y in the feat_lab

φ= forward_nn(net, φ0 )

δ'= backward_nn(net, φ, y) update_weights(net, φ, δ', λ)

(20)

Testing

read ids from id_file

read net from weights_file

for each x in the data

φ0 = create_features(x)

φ= forward_nn(net, φ0 )

参照

今ダウンロードする ( PDF - 20 ページ - 84.11 KB )

関連したドキュメント

京都大学数理解析研究所

Mochizuki, Topics Surrounding the Combinatorial Anabelian Geometry of Hyperbolic Curves III: Tripods and Tempered Fundamental Groups, RIMS Preprint 1763 (November 2012).

京都大学数理解析研究所

Kambe, Acoustic signals associated with vor- page texline reconnection in oblique collision of two vortex rings.. Matsuno, Interaction of an algebraic soliton with uneven bottom

京都大学　数理解析研究所 2014 Research Institute for Mathematical Sciences, Kyoto University

Pacific Institute for the Mathematical Sciences（PIMS）カナダ平成21年３月30日 National Institute for Mathematical Sciences（NIMS）大韓民国平成22年６月24日

巻末２検討委員会での検討内容

関谷直也東京大学大学院情報学環総合防災情報研究センター准教授小宮山庄一危機管理室⻑. 岩田直子

東京都環境科学研究所

るものの、およそ 1：1 の関係が得られた。冬季には TEOM の値はやや小さくなる傾向にあった。これは SHARP

手話言語研究センター講話会

手話言語研究センター講話会.

関西学院大学大学院学則

本研究科は、本学の基本理念のもとに高度な言語コミュニケーション能力を備え、建学

関西学院大学大学院学則

本研究科は、本学の基本理念のもとに高度な言語コミュニケーション能力を備え、建学

学習資料をアップロードして、すべてのドキュメントをダウンロードしてください。

あなたのドキュメントは、123deta JP で共有され、学習を支援するために充実されます。

関連したドキュメント

研究開発費の会計処理基準

研究開発費の会計処理基準

39

0

0

1)研究当時京都大学大学院

1)研究当時京都大学大学院

2

0

0

未来を紡ぐ 44

未来を紡ぐ 44

9

0

0

43 社会連携号

43 社会連携号

9

0

0

(1)鉄細菌が関与する井戸障害と水質変化

(1)鉄細菌が関与する井戸障害と水質変化

18

0

0

代数的組合せ最適化

代数的組合せ最適化

28

0

0

京都大学数理解析研究所

京都大学数理解析研究所

136

0

0

京都大学数理解析研究所

京都大学数理解析研究所

3

0

0