• 検索結果がありません。

Keynote Speech 3

N/A
N/A
Protected

Academic year: 2021

シェア "Keynote Speech 3"

Copied!
2
0
0

読み込み中.... (全文を見る)

全文

(1)

Keynote Speech 3

Thursday, 29 October 8:30-9:30

Toward Simultaneous, Natural and

Multimodal Speech-to-Speech Translation

Satoshi Nakamura

Nara Institute of Science and Technology, Japan

Abstract

Spoken language technologies enable to support natural oral communication.

Language barrier between different language speaking people is one of the biggest communication problems for human being. Speech-to-speech translation had been studied in order to break the language barrier. In this talk, I would like to introduce brief history of the speech-to-speech translation research and state-of-the-art speech translation system. Then I would introduce new challenges toward a simultaneous speech-to-speech translation including simultaneous incremental machine translation, joint-optimization of ASR and MT modules, Speech-to-speech translation preserving para/non-linguistic information and multimodal speech translation with a talking face.

Biography

Dr. Satoshi Nakamura is Professor of Graduate School of Information Science, Nara Institute of Science and Technology, Japan, Honorar professor of Karlsruhe Institute of Technology, Germany, and ATR Fellow. He received his B.S. from Kyoto Institute of Technology in 1981 and Ph.D. from Kyoto University in 1992. He was Associate Professor of Graduate School of Information Science at Nara Institute of Science and Technology in 1994-2000. He was Director of ATR Spoken Language Communication Research Laboratories in 2000-2008 and Vice president of ATR in 2007-2008. He was Director General of Keihanna Research Laboratories and the Executive Director of Knowledge Creating Communication Research Center, National Institute of Information and Communications Technology, Japan in 2009- 2010.

He is currently Director of Augmented Human Communication laboratory and a full professor of Graduate School of Information Science at Nara Institute of Science and Technology.

He is interested in modeling and systems of speech-to-speech translation and speech

recognition. He is one of the leaders of speech-to-speech translation research and has

been serving for various speech-to-speech translation research projects in the world

including C-STAR, IWSLT and A-STAR.

(2)

He received Yamashita Research Award, Kiyasu Award from the Information

Processing Society of Japan, Telecom System Award, AAMT Nagao Award, Docomo

Mobile Science Award in 2007, ASJ Award for Distinguished Achievements in

Acoustics. He received the Commendation for Science and Technology by the

Minister of Education, Culture, Sports, Science and Technology, and the

Commendation for Science and Technology in Information Technology by the

Minister of Internal Affair and Communications. He was also awarded Antonio

Zampolli Prize by European Language Resource Association. He is currently Elected

Board Member of International Speech Communication Association, ISCA and

Elected Committee Member of IEEE SPS Spoken Language Technology Technical

committee.

参照

関連したドキュメント

In this thesis, I intend to examine how freedom of speech has been legally protected in consideration of fundamental human rights, and how the double standards in the

In order to estimate the noise spectrum quickly and accurately, a detection method for a speech-absent frame and a speech-present frame by using a voice activity detector (VAD)

A study of speech performance degradation caused by social anxiety trend Ayaka IWATA*, Tomonori KAWAI**, Junichi SAITO*, Taiki SHIMA*, Hiroaki KUMANO*** *Graduate School of

The aim of this study is to improve the quality of machine-translated Japanese from an English source by optimizing the source content using a machine translation (MT) engine.. We

Advanced speech technology, such as voice conversion techniques and speech synthesis, can synthesize or clone speech entirely as a human voice.. Distributing users’

Katagiri, “A Derivation of Minimum Classification Error from the Theoretical Classification Risk Using Parzen Estimation”, Computer Speech and Language, vol.

Our proposed method is to improve the trans- lation performance of NMT models by converting only Sino-Korean words into corresponding Chinese characters in Korean sentences using

Today Iʼm going to make a speech about my dream... )in