Learning Domain Specific Sub-layer Latent Variable for Multi-Domain Adaptation Neural Machine Translation

Shuanghong Huang,Ge Shi,Zhengjun Li,Xuan Zhao,Xiaomei Wang,Xinyan Li,Chong Feng

doi:10.1145/3661305

Abstract

Domain adaptation proves to be an effective solution for addressing inadequate translation performance within specific domains. However, the straightforward approach of mixing data from multiple domains to obtain the multi-domain neural machine translation (NMT) model can give rise to the parameter interference between domains problem, resulting in a degradation of overall performance. To address this, we introduce a multi-domain adaptive NMT method aimed at learning domain specific sub-layer latent variable and employ the Gumbel-Softmax reparameterization technique to concurrently train both model parameters and domain specific sub-layer latent variable. This approach facilitates the learning of private domain-specific knowledge while sharing common domain-invariant knowledge, effectively mitigating the parameter interference problem. The experimental results show that our proposed method significantly improved by up to 7.68 and 3.71 BLEU compared with the baseline model in English-German and Chinese-English public multi-domain datasets, respectively.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Domain Specific Sub-layer Latent Variable for Multi-Domain Adaptation Neural Machine Translation

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Apr 29, 2024
License type: mit

Similar Papers

Neural Machine Translation model for University Email Application
Sandhya Aneja ... Siti Nur Afikah Bte Abdul Mazid
-
Sandhya Aneja, et. al.Sandhya Aneja ... Siti Nur Afikah Bte Abdul Mazid
11 Jul 2020
11 Jul 2020

What do Neural Machine Translation Models Learn about Morphology?
Yonatan Belinkov ... Hassan Sajjad
-
Yonatan Belinkov, et. al.Yonatan Belinkov ... Hassan Sajjad
01 Jan 2017
01 Jan 2017

Combining SMT and NMT Back-Translated Data for Efficient NMT
Alberto Poncelas ... Andy Way
-
Alberto Poncelas, et. al.Alberto Poncelas ... Andy Way
22 Oct 2019
22 Oct 2019

Adversarial Subword Regularization for Robust Neural Machine Translation
Jungsoo Park ... Jinhyuk Lee
-
Jungsoo Park, et. al.Jungsoo Park ... Jinhyuk Lee
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Domain Specific Sub-layer Latent Variable for Multi-Domain Adaptation Neural Machine Translation

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing