Independent Fusion of Words and Image for Multimodal Machine Translation

Junteng Ma,Shihao Qin,Xia Li,Minping Chen

doi:10.1007/978-981-15-1721-1_4

Abstract

Multimodal machine translation which combines visual information of image has become one of the research hotpots in recent years. Most of the existing works project the image feature into the text semantic space and merged into the model in different ways. Actually, different source words may capture different visual information. Therefore, we propose a multimodal neural machine translation (MNMT) model that integrates the words and visual information of image independently. The word itself and different key similarity information of an image are independently fused into the text semantics of the word, thereby assisting and enhancing the textual semantic and corresponding visual information of different words. And then we use them for the calculation of the context vector of the attention of decoder of our model. In this paper, different experiments are carried out on the original English-German sentence pairs of the multimodal machine translation dataset, Multi30k, and the Indonesian-Chinese sentence pairs which is manually annotated by human. Compared with the existing MNMT model based on RNN, our model has a better performance and proves the effectiveness of it.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Independent Fusion of Words and Image for Multimodal Machine Translation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Multimodal Neural Machine Translation Using CNN and Transformer Encoder
Hiroki Takushima ... Takashi Ninomiya
-
Hiroki Takushima, et. al.Hiroki Takushima ... Takashi Ninomiya
02 Apr 2019
02 Apr 2019

An error analysis for image-based multi-modal neural machine translation
Iacer Calixto ... Qun Liu
Machine Translation | VOL. 33
Iacer Calixto, et. al.Iacer Calixto ... Qun Liu
08 Apr 2019
Machine Translation | VOL. 33

Multimodal Neural Machine Translation for English–Assamese Pair
Sahinur Rahman Laskar ... Bishwaraj Paul
-
Sahinur Rahman Laskar, et. al.Sahinur Rahman Laskar ... Bishwaraj Paul
01 Dec 2021
01 Dec 2021

Human Evaluation of Multi-modal Neural Machine Translation: A Case-Study on E-Commerce Listing Titles
Iacer Calixto ... Evgeny Matusov
-
Iacer Calixto, et. al.Iacer Calixto ... Evgeny Matusov
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Independent Fusion of Words and Image for Multimodal Machine Translation

Abstract

Talk to us

Similar Papers