Multi-modal simultaneous machine translation fusion of image information

Yan Huang,Zhanyang Wanga,Tianyuan Zhang,Chun Xu,Hui Lianga

doi:10.1016/j.jer.2023.100085

Abstract

Simultaneous translation is to translate a sentence before people finish it, to understand the speaker's intention in real-time. At present, simultaneous machine translation still relies on text-to-text data resources. However, the output information from the encoder side is used for the decoder as the input data recourse in the pure text translation system. This information is only derived from the text content, and the input information is single, causing a shortage of decoding information at the decoder and the vocabulary is missed in translation. The translator will also visually capture the information of the surrounding scenes to assist himself in the translation work, based on this feature, we propose a multi-modal simultaneous machine translation of fusion image information. We extract information from the image, add the information to the decoder side of the translation system, increase the input data resource of the decoder, and help the system improve the translation quality. We use the Multi30K dataset for experimental verification. Compared with the translation system of plain text, the method we propose can translate more complete sentences, richer content, and better translation results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-modal simultaneous machine translation fusion of image information

Abstract

Talk to us

Similar Papers

More From: Journal of Engineering Research

Lead the way for us

Journal: Journal of Engineering Research	Publication Date: May 2, 2023
License type: cc-by-nc-nd

Similar Papers

Translation-based Supervision for Policy Generation in Simultaneous Neural Machine Translation
Ashkan Alinejad ... Hassan S Shavarani
-
Ashkan Alinejad, et. al.Ashkan Alinejad ... Hassan S Shavarani
01 Jan 2020
01 Jan 2020

Translation-based Supervision for Policy Generation in Simultaneous Neural Machine Translation
...
-
, et. al. ...
15 Oct 2021
15 Oct 2021

Supervised Visual Attention for Simultaneous Multimodal Machine Translation
Veneta Haralampieva ... Ozan Caglayan
Journal of Artificial Intelligence Research | VOL. 74
Veneta Haralampieva, et. al.Veneta Haralampieva ... Ozan Caglayan
05 Jul 2022
Journal of Artificial Intelligence Research | VOL. 74

Stream-level Latency Evaluation for Simultaneous Machine Translation
Javier Iranzo-Sánchez ... Alfons Juan
-
Javier Iranzo-Sánchez, et. al.Javier Iranzo-Sánchez ... Alfons Juan
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-modal simultaneous machine translation fusion of image information

Abstract

Talk to us

Similar Papers

More From: Journal of Engineering Research