• 検索結果がありません。

An Empirical Study of Contextual Data Augmentation for Japanese Zero Anaphora Resolution

N/A
N/A
Protected

Academic year: 2021

シェア "An Empirical Study of Contextual Data Augmentation for Japanese Zero Anaphora Resolution"

Copied!
1
0
0

読み込み中.... (全文を見る)

全文

(1)

キズ は 直り ませ ん が 、

!-NOM

ほとんど 目立た なく なって

The scratches can’t be repaired, but ! are becoming barely noticeable …

scratches-TOP be-fixed not but , !-NOM almost noticeable not become,…

refer to

There are no scratches, but ! doesn’t stand out anymore …

キズ は あり ませ ん が 、 null

!-NOM

ほとんど 目立た なく なって

scratches-TOP exist not but , !-NOM almost noticeable not become,…

Particle

Noun Verb

訴えた

man ACC sued

[MASK] 訴えた

[MASK] ACC sued

Original

Pseudo

[MASK]

man ACC [MASK] Pretrained

Masked LM MASK

boy-TOP少年は man-ACC男を 殴ったが、struck-but 無罪となった。was acquitted.

The boy struck the man, but ! was acquitted.

!!-NOM

!!-NOM

NOM None None

NOMNone None

訴えた

man ACC sued MASK man ACC [MASK][MASK] man ACC struck殴った

Original

Pretrained Masked LM

Pseudo

Pretrained

Masked LM

Cannot control masked position

boy-TOP 少年は man-ACC 男を 訴えたが、 sued-but 無罪となった。 was acquitted.

The boy sued the man, but ! was acquitted.

!!-NOM

!!-NOM

refer to

An Empirical Study of Contextual Data Augmentation for Japanese Zero Anaphora Resolution

Ryuto Konno

1

, Yuichiroh Matsubayashi

1,2

, Shun Kiyono

2,1

, Hiroki Ouchi

2

, Ryo Takahashi

1,2

, Kentaro Inui

1,2

1Tohoku University 2RIKEN

Summary

• We proposed data augmentation (DA) for zero anaphora resolution (ZAR)

• We augmented labeled data by replacing tokens using language model (LM)

• We improved the performance of ZAR and analyzed the phenomenon in DA

Background Automatic recognition of omitted arguments of a predicate

Task: Zero Anaphora Resolution (ZAR)

Results and analysis

Model ALL NOM ACC DAT

Matsubayashi&Inui’18 55.55 57.99 48.9 23 BASELINE 63.89 66.45 57.2 27 Contextual DA 63.87 66.16 58.5 29 MASKING (all-but-verb) 64.15 66.60 57.9 29

Controlling masked position using POS tags

Wrong label !

Halving the computational cost (one-pass running)

A masked LM has to be run twice

※ Underline: a target predicate

Model (Masking target) ZAR

BASELINE 64.08

All POS 64.89

Only verb 64.15

All POS except for verb 65.02

MASKING improved the performance Masking all POS categories

except for verb is the best score

Masking verb may produce the bad instance Table2: F1 score on dev set

Table1: F1 score of ZAR on test set Original

replacing tokens

clone

Pseudo

NOM None None

Label-1: None

boy-TOP少年は criminal-ACC犯人を 訴えたが、sued-but was acquitted.無罪となった。

Input-1:

NOM None None Label: None

boy-TOP少年は man-ACC男を 訴えたが、sued-but was acquitted.無罪となった。

Input:

Previous Method: Contextual DA Proposed Method: MASKING

Bad pseudo instance

参照

関連したドキュメント

Habiro con- siders an abelian group A k (H) dened by unitrivalent graphs with k trivalent vertices and with univalent vertices labelled by elements of H , subject to anti- symmetry,

The purpose of this study was to examine the invariance of a quality man- agement model (Yavas & Marcoulides, 1996) across managers from two countries: the United States

The purpose of this study was to examine the invariance of a quality man- agement model (Yavas & Marcoulides, 1996) across managers from two countries: the United States

It is suggested by our method that most of the quadratic algebras for all St¨ ackel equivalence classes of 3D second order quantum superintegrable systems on conformally flat

この数字は 2021 年末と比較すると約 40%の減少となっています。しかしひと月当たりの攻撃 件数を見てみると、 2022 年 1 月は 149 件であったのが 2022 年 3

In this paper we investigate some structure properties of the tail o-field and the invariant o-field of both homogeneous and nonhomogeneous Markov chains as representations

Applications of msets in Logic Programming languages is found to over- come “computational inefficiency” inherent in otherwise situation, especially in solving a sweep of

Shi, “The essential norm of a composition operator on the Bloch space in polydiscs,” Chinese Journal of Contemporary Mathematics, vol. Chen, “Weighted composition operators from Fp,