Reference Context Guided Vector to Achieve Multimodal Machine Translation

Xiayang Shi,Yuanyuan Huang,Xinyi Liu,Zhenqiang Yu,Jiaqi Yuan,Pei Cheng

doi:10.1088/1742-6596/2171/1/012076

Abstract

Traditional machine translation mainly realizes the introduction of static images from other modal information to improve translation quality. In processing, a variety of methods are combined to improve the data and features, so that the translation result is close to the upper limit, and some even need to rely on the sensitivity of the sample distance algorithm to the data. At the same time, multi-modal MT will cause problems such as lack of semantic interaction in the attention mechanism in the same corpus, or excessive encoding of the same text image information and corpus irrelevant information, resulting in excessive noise. In order to solve these problems, this article proposes a new input port that adds visual image processing to the decoder. The core idea is to combine visual image information with traditional attention mechanisms at each time step specific to decoding. The dynamic router extracts the relevant visual features, integrates the multi-modal visual features into the decoder, and predicts the target word by introducing the visual image process. At the same time, experiments were carried out on more than 30K datasets translated in the United Kingdom, France and the Czech Republic, which proved the superiority of adding visual images to the decoder to extract features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reference Context Guided Vector to Achieve Multimodal Machine Translation

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Journal: Journal of Physics: Conference Series	Publication Date: Jan 1, 2022
License type: cc-by

Similar Papers

A Novel Hyperspectral Image Classification Model Using Bole Convolution With Three-Direction Attention Mechanism: Small Sample and Unbalanced Learning
Weiwei Cai ... Xin Ning
IEEE Transactions on Geoscience and Remote Sensing | VOL. 61
Weiwei Cai, et. al.Weiwei Cai ... Xin Ning
01 Jan 2023
IEEE Transactions on Geoscience and Remote Sensing | VOL. 61

A novel mapping method for image semantics and visual features
Jun Yang ... Dan-Jun Xing
Journal of Computer Applications | VOL. 28
Jun Yang, et. al.Jun Yang ... Dan-Jun Xing
30 Sep 2009
Journal of Computer Applications | VOL. 28

Research on visual image processing and edge detection method of micro flapping wing flying robot based on cluster analysis
Wei Ding ... Sen Wang
-
Wei Ding, et. al.Wei Ding ... Sen Wang
15 Feb 2022
15 Feb 2022

Remote-sensing image retrieval by combining image visual and semantic features
M Wang ... T.Y Song
International Journal of Remote Sensing | VOL. 34
M Wang, et. al.M Wang ... T.Y Song
04 Mar 2013
International Journal of Remote Sensing | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reference Context Guided Vector to Achieve Multimodal Machine Translation

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series