ディストーションリミットによる翻訳への影響

第 6 章 Factored Translation Models を用いた事後並べ替え 19

7.4 実験結果と考察

7.4.3 ディストーションリミットによる翻訳への影響

ディストーションリミットによる翻訳への影響を調査した結果を表7.5に示す．

表7.5より，事後並べ替えにおけるfactored translation modelsは日本語からHFE への翻訳におけるディストーションリミットによって翻訳への影響が異なっていることが分かる．BLEUに関しては，日本語からHFEへの翻訳におけるディストーションリミットを大きく設定することでスコアが高くなった．これは，ディストーションリミットを大きくすることで，日本語からHFEの翻訳の際に単語のアラインメントがうまく学習できていることが影響していると考えられる．一方

のRIBESは，日本語からHFEへの翻訳におけるディストーションリミットを小

さくしたほうがスコアが高くなった．これは，ディストーションリミットを小さくすることで，日本語からHFEへの翻訳における出力文が日本語の語順に近くなり，HFEから英語への翻訳システムの学習に用いたHFEのデータの傾向に近づくことで，並べ替えモデルが有効に働いたためだと考えられる．

また，事後並べ替えにおけるそれぞれのfactorの影響については，ディストー 24

+ 50クラスタ 16.69 65.39

+ 1,000クラスタ 6 12 16.16 65.89

+品詞，50クラスタ 16.55 65.45

+品詞，1,000クラスタ 16.79 65.99

+ 50クラスタ& +品詞，1,000クラスタ 17.16 65.69

表7.4: 最適なfactorを考慮した際の翻訳精度

ションリミットの設定にかかわらず，品詞と1,000クラスタを考慮した場合が最もBLEUが高くなった．RIBESに関しては，ディストーションリミットを小さく設定した際は，品詞と1,000クラスタを考慮した場合が最も高くなったのに対して，ディストーションリミットを大きく設定した際は，1,000クラスタのみを考慮した場合が最も高くなった．

ディストーションリミット

BLEU RIBES 日本語 →HFE HFE→ 英語

事後並べ替え 16.22 65.73

事後並べ替え+品詞 16.22 65.77

事後並べ替え+ 50クラスタ 6 12 16.69 65.39 事後並べ替え+ 1,000クラスタ 16.16 65.89 事後並べ替え+品詞，50クラスタ 16.55 65.45 事後並べ替え+品詞，1,000クラスタ 16.79 65.99

事後並べ替え 16.32 64.64

事後並べ替え+品詞 17.16 64.64

事後並べ替え+ 50クラスタ 20 12 16.62 63.84 事後並べ替え+ 1,000クラスタ 17.01 65.36 事後並べ替え+品詞，50クラスタ 16.84 64.39 事後並べ替え+品詞，1,000クラスタ 17.43 65.25

表7.5: ディストーションリミットによる翻訳への影響

第 8 ^章 ^結言

本研究では日英翻訳に対する事後並べ替えにfactored translation modelsを用いて単語の品詞とクラスタを翻訳における追加の情報として考慮する手法を提案した．実験から，追加の情報を考慮することによって並べ替えの精度が向上することが確認できた．また，factored translation modelsはBLEUによるn-garmの適合

率とRIBESによる語順の適合率に対して異なる影響を持つことが分かった．

今後の課題として，考慮するfactorの調査を行う必要があると考えている．本研究で行った実験から，50クラスタよりも1,000クラスタを考慮することによって翻訳精度が向上したことが確認できた．これによって，クラスタの粒度の違いによって翻訳精度に影響を及ぼすことが分かった．また，今回はクラスタリングの手法としてbrown clusteringを用いたが，deep learningなどのより多くの情報を考慮できるクラスタリングの手法を用いることで，より訓練データの傾向を捉え

たfactorを利用でき，並べ替えの改善につながると考えられる．

また，原言語のfactorを考慮することによって，日本語からHFEへの翻訳における精度の改善が期待できる．原言語側のfactorの情報は，日本語の助詞などの，

単語の表層は一致しているが意味の異なる単語の翻訳において有効であると考えられる．これによって，日本語からHFEへの単語の翻訳の精度が向上すると考えられる．

参考文献

[1] Peter F Brown, Peter V Desouza, Robert L Mercer, Vincent J Della Pietra, and Jenifer C Lai. Class-based n-gram models of natural language. Computational linguistics, Vol. 18, No. 4, pp. 467–479, 1992.

[2] Peter F Brown, Vincent J Della Pietra, Stephen A Della Pietra, and Robert L Mercer. The mathematics of statistical machine translation: Parameter estima-tion. Computational linguistics, Vol. 19, No. 2, pp. 263–311, 1993.

[3] Michael Collins, Philipp Koehn, and Ivona Kuˇcerov´a. Clause restructuring for statistical machine translation. pp. 531–540, 2005.

[4] Michel Galley and Christopher D Manning. A simple and effective hierarchical phrase reordering model. InProceedings of the Conference on Empirical Meth-ods in Natural Language Processing, pp. 848–856, 2008.

[5] Hideki Isozaki, Tsutomu Hirao, Kevin Duh, Katsuhito Sudoh, and Hajime Tsukada. Automatic evaluation of translation quality for distant language pairs.

InProceedings of the 2010 Conference on Empirical Methods in Natural Lan-guage Processing, pp. 944–952, 2010.

[6] Hideki Isozaki, Katsuhito Sudoh, Hajime Tsukada, and Kevin Duh. Head fi-nalization: A simple reordering rule for sov languages. In Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, pp.

244–251, 2010.

[7] Jason Katz-Brown and Michael Collins. Syntactic Reordering in Preprocess-ing for Japanese to English Translation: MIT System Description for NTCIR-7 Patent Translation Task. Proceedings of NTCIR-7 Workshop Meeting, 2008.

[8] Reinhard Kneser and Hermann Ney. Improved backing-off for m-gram language modeling. InAcoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on, Vol. 1, pp. 181–184. IEEE, 1995.

[9] Philipp Koehn and Hieu Hoang. Factored translation models. EMNLP-CoNLL, pp. 868–876, 2007.

[10] Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, et al. Moses: Open source toolkit for statistical machine translation. pp.

177–180, 2007.

[11] Philipp Koehn, Franz Josef Och, and Daniel Marcu. Statistical phrase-based translation. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1, pp. 48–54, 2003.

[12] Yusuke Miyao and Jun’ichi Tsujii. Feature forest models for probabilistic hpsg parsing. Computational Linguistics, Vol. 34, No. 1, pp. 35–80, 2008.

[13] Franz Josef Och. Minimum error rate training in statistical machine translation.

In Proceedings of the 41st Annual Meeting on Association for Computational Linguistics-Volume 1, pp. 160–167, 2003.

[14] Franz Josef Och and Hermann Ney. Discriminative training and maximum en-tropy models for statistical machine translation. InProceedings of the 40th An-nual Meeting on Association for Computational Linguistics, pp. 295–302, 2002.

[15] Franz Josef Och and Hermann Ney. A systematic comparison of various statis-tical alignment models. Computational linguistics, Vol. 29, No. 1, pp. 19–51, 2003.

[16] Franz Josef Och, Christoph Tillmann, Hermann Ney, et al. Improved alignment models for statistical machine translation. In Proc. of the Joint SIGDAT Conf.

on Empirical Methods in Natural Language Processing and Very Large Corpora, pp. 20–28, 1999.

[17] Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. Bleu: a method for automatic evaluation of machine translation. InProceedings of the 40th an-nual meeting on association for computational linguistics, pp. 311–318, 2002.

[18] Katsuhito Sudoh, Xianchao Wu, Kevin Duh, Hajime Tsukada, and Masaaki Na-gata. Post-ordering in statistical machine translation. InProc. MT Summit, 2011.

ドキュメント内 ( ) Kevin Duh (ページ 38-45)

第 6 章 Factored Translation Models を用いた事後並べ替え 19

7.4 実験結果と考察

7.4.3 ディストーションリミットによる翻訳への影響

第 8 章 結言

参考文献

第 8 ^章 ^結言