Multilingual End-to-End Speech Translation

Hirofumi Inaguma,Kevin Duh,Shinji Watanabe,Tatsuya Kawahara

doi:10.1109/asru46091.2019.9003832

Abstract

In this paper, we propose a simple yet effective framework for multilingual end-to-end speech translation (ST), in which speech utterances in source languages are directly translated to the desired target languages with a universal sequence-to-sequence architecture. While multilingual models have shown to be useful for automatic speech recognition (ASR) and machine translation (MT), this is the first time they are applied to the end-to-end ST problem. We show the effectiveness of multilingual end-to-end ST in two scenarios: one-to-many and many-to-many translations with publicly available data. We experimentally confirm that multilingual end-to-end ST models significantly outperform bilingual ones in both scenarios. The generalization of multilingual training is also evaluated in a transfer learning scenario to a very low-resource language pair. All of our codes and the database are publicly available to encourage further research in this emergent multilingual ST topic11Available at https://github.com/espnet/espnet..

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multilingual End-to-End Speech Translation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders
...
-
, et. al. ...
01 Aug 2021
01 Aug 2021

End-to-End Speech Translation With Transcoding by Multi-Task Learning for Distant Language Pairs
Takatomo Kano ... Sakriani Sakti
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28
Takatomo Kano, et. al.Takatomo Kano ... Sakriani Sakti
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28

Consolidation-Based Speech Translation and Evaluation Approach
Chiori Hori ... Hideki Kashioka
IEICE Transactions on Information and Systems | VOL. E92-D
Chiori Hori, et. al.Chiori Hori ... Hideki Kashioka
01 Jan 2009
IEICE Transactions on Information and Systems | VOL. E92-D

Cascaded Models with Cyclic Feedback for Direct Speech Translation
Tsz Kin Lam ... Shigehiko Schamoni
-
Tsz Kin Lam, et. al.Tsz Kin Lam ... Shigehiko Schamoni
06 Jun 2021
06 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multilingual End-to-End Speech Translation

Abstract

Talk to us

Similar Papers