• 検索結果がありません。

The AIP-Tohoku System at the BEA-2019 Task Shared

N/A
N/A
Protected

Academic year: 2021

シェア "The AIP-Tohoku System at the BEA-2019 Task Shared"

Copied!
1
0
0

読み込み中.... (全文を見る)

全文

(1)

The AIP-Tohoku System at the BEA-2019 Shared Task

Hiroki Asano

12*

, Masato Mita

21

, Tomoya Mizumoto

21†

, Jun Suzuki

12

1

Tohoku University,

2

RIKEN Center for Advanced Intelligence Project (AIP), *Yahoo Japan Corporation, † Future Corporation

Key Technique: Sentence-level Error Detection (SED)

System Architecture

Model Prec. Rec. F0.5 Rank Track1 68.62 42.16 60.97 9 th Track2 70.60 51.03 65.57 2 nd

Results

Model Prec. Rec. F0.5 GEC 61.97 42.11 56.63 +GenData 64.57 46.40 59.88 +SED 68.62 42.16 60.97

Ablation Test

• This is the first study that has

combined GEC with sentence-level error detection (SED)

• Our result demonstrates SED improve the precision of GEC

• Our system is ranked 9 th in Track1 and 2 nd in Track2

Reduce FP by passing only sentences that contain errors to the GEC model using SED

Motivation Base SED

• Performs sentence-level binary classification of sentences that need editing

Proficiency Prediction Module (PPM)

• Base PP predicts the leaners proficiency

• Employed a multi-task learning approach in which PP model and SED model

simultaneously Fine-tuned SED

• SED model is fine-tuned for each level of proficiency (Lv. A, Lv. B, Lv. C)

Architecture

Main Leaderboard

Experimental Configurations

Summary

Prec. Rec. F Base SED 88.5 79.8 83.9 Proposed SED 91.3 95.6 93.4

GEC Model

• Transformer-based Model SED Model

• BERT-based Model

Error Generation Model (GenData)

• Following the system by Edunov et al. (2018)

Dataset

Model Track1 Track2

GEC

Official data (564K)

Official data (564K)

EFCAMDAT [Geertzen et al+2013] + Non- public Lang-8 (7.7M) GenData

Simple Wikipedia + Essay scoring data sets (i.e, ICLE [Granger+2009],

ICNALE[Ishikawa], ASAP, TOEFL 11[Blanchard+2013]) (1.4M)

SED

Official data (564K)

Model

fine-tuned (+9.5 F point)

We input grammatically incorrect sentences predicted by the SED model into our GEC model

+ 4.05 point

参照

関連したドキュメント

Then the center-valued Atiyah conjecture is true for all elementary amenable extensions of pure braid groups, of right-angled Artin groups, of prim- itive link groups, of

[5℄ Pathak R.S., The wavelet transform of distributions ,

You may contact BASF Corporation for emergency medical treatment information at 1-800-832-HELP (4357).. Batch code: (Printed on Bottle)

She has curated a number of major special exhibitions for the Gotoh Museum, including Meibutsu gire (From Loom to Heirloom: The World of Meibutsu-gire Textiles) in 2001,

In order to provide for compensation payments for nuclear damages concerning the accident of Fukushima Daiichi Nuclear Power station damaged by the Tohoku-Chihou-Taiheiyou-Oki

Command 3ME Microencapsulated Herbicide may be utilized as a soil applied treatment prior to weed emergence, for suppression or control of labeled annual grass and broadleaf weeds

As a result of the Time Transient Response Analysis utilizing the Design Basis Ground Motion (Ss), the shear strain generated in the seismic wall that remained on and below the

The total rate of HARMONY EXTRA SG herbicide for wheat (including durum), barley and triticale cannot exceed 1.5 oz/A (0.0312 lb/A thifensulfuron methyl and 0.0156 lb/A