Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey

Danielle Saunders

doi:10.1613/jair.1.13566

Abstract

The development of deep learning techniques has allowed Neural Machine Translation (NMT) models to become extremely powerful, given sufficient training data and training time. However, systems struggle when translating text from a new domain with a distinct style or vocabulary. Fine-tuning on in-domain data allows good domain adaptation, but requires sufficient relevant bilingual data. Even if this is available, simple fine-tuning can cause overfitting to new data and catastrophic forgetting of previously learned behaviour. We survey approaches to domain adaptation for NMT, particularly where a system may need to translate across multiple domains. We divide techniques into those revolving around data selection or generation, model architecture, parameter adaptation procedure, and inference procedure. We finally highlight the benefits of domain adaptation and multidomain adaptation techniques to other lines of NMT research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Artificial Intelligence Research	Publication Date: Sep 29, 2022
Citations: 22	License type: cc-by

R Discovery Prime

R Discovery Prime

Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research

Lead the way for us

Similar Papers

DaLC: Domain Adaptation Learning Curve Prediction for Neural Machine Translation
...
-
, et. al. ...
11 May 2022
11 May 2022

Comparison of deep learning approaches to estimate injury severity from the International Classification of Diseases codes
Ayush Doshi ... Thomas Hartka
Traffic Injury Prevention | VOL. ahead-of-print
Ayush Doshi, et. al.Ayush Doshi ... Thomas Hartka
20 May 2024
Traffic Injury Prevention | VOL. ahead-of-print

Machine-oriented NMT Adaptation for Zero-shot NLP tasks: Comparing the Usefulness of Close and Distant Languages
...
-
, et. al. ...
25 Nov 2020
25 Nov 2020

Neural Machine Translation model for University Email Application
Sandhya Aneja ... Nagender Aneja
-
Sandhya Aneja, et. al.Sandhya Aneja ... Nagender Aneja
11 Jul 2020
11 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research