• 検索結果がありません。

Shared Task The AIP-Tohoku System at the BEA-2019

N/A
N/A
Protected

Academic year: 2021

シェア "Shared Task The AIP-Tohoku System at the BEA-2019"

Copied!
1
0
0

読み込み中.... (全文を見る)

全文

(1)

The AIP-Tohoku System at the BEA-2019

Shared Task

Hiroki Asano

12*

, Masato Mita

21

, Tomoya Mizumoto

21†

, Jun Suzuki

12

1Tohoku University, 2RIKEN Center for Advanced Intelligence Project (AIP), *Yahoo Japan Corporation, † Future Corporation

Key Technique: Sentence-level Error Detection (SED)

System Architecture

Mod

el Prec. Rec. F0.5 Rank Track

1 68.62 42.16 60.97 9

th

Track

2 70.6

0 51.0

3 65.5

7 2

nd

Results

Model Prec. Rec. F0.5 GEC 61.97 42.11 56.63 +GenDa

ta 64.57 46.40 59.88 +SED 68.62 42.16 60.97

Ablation Test

• This is the first study that has combined GEC with sentence- level error detection (SED)

• Our result demonstrates SED improve the precision of GEC

• Our system is ranked 9

th

in Track1 and 2

nd

in Track2

Reduce FP by passing only sentences that contain errors to the GEC model using SED

Motivation Base SED

• Performs sentence-level binary

classification of sentences that need editing

Proficiency Prediction Module (PPM)

• Base PP predicts the leaners proficiency

• Employed a multi-task learning

approach in which PP model and SED model simultaneously

Fine-tuned SED

• SED model is fine-tuned for each level of proficiency (Lv. A, Lv. B, Lv. C)

Architecture

Main Leaderboard

Experimental Configurations

Summary

Prec. Rec. F Base SED 88.5 79.8 83.9 Proposed SED 91.3 95.6 93.4

GEC Model

• Transformer-based Model

SED Model

• BERT-based Model

Error Generation Model (GenData)

• Following the system by Edunov et al.

(2018)

Dataset

Model Track1 Track2

GEC

• Official data (564K) • Official data (564K)

• EFCAMDAT [Geertzen et

al+2013] + Non-

public Lang-8 (7.7M)

GenDat a

• Simple Wikipedia + Essay scoring data sets (i.e, ICLE [Granger+2009],

ICNALE[Ishikawa], ASAP, TOEFL

11[Blanchard+2013]) (1.4M)

SED

Official data (564K)

Model

fine-tuned (+9.5 F point)

We input grammatically incorrect

sentences predicted by the SED model into our GEC model

+ 4.05 point

参照

関連したドキュメント

Then the center-valued Atiyah conjecture is true for all elementary amenable extensions of pure braid groups, of right-angled Artin groups, of prim- itive link groups, of

[5℄ Pathak R.S., The wavelet transform of distributions ,

You may contact BASF Corporation for emergency medical treatment information at 1-800-832-HELP (4357).. Batch code: (Printed on Bottle)

She has curated a number of major special exhibitions for the Gotoh Museum, including Meibutsu gire (From Loom to Heirloom: The World of Meibutsu-gire Textiles) in 2001,

In order to provide for compensation payments for nuclear damages concerning the accident of Fukushima Daiichi Nuclear Power station damaged by the Tohoku-Chihou-Taiheiyou-Oki

Command 3ME Microencapsulated Herbicide may be utilized as a soil applied treatment prior to weed emergence, for suppression or control of labeled annual grass and broadleaf weeds

As a result of the Time Transient Response Analysis utilizing the Design Basis Ground Motion (Ss), the shear strain generated in the seismic wall that remained on and below the

The total rate of HARMONY EXTRA SG herbicide for wheat (including durum), barley and triticale cannot exceed 1.5 oz/A (0.0312 lb/A thifensulfuron methyl and 0.0156 lb/A