Adversarial Subword Regularization for Robust Neural Machine Translation

Jungsoo Park,Jaewoo Kang,Mujeen Sung,Jinhyuk Lee

doi:10.18653/v1/2020.findings-emnlp.175

Jungsoo Park, Jaewoo Kang + Show 2 more

Open Access

PDF Available

https://doi.org/10.18653/v1/2020.findings-emnlp.175

Copy DOI

Export

Save

Cite

Publication Date: Jan 1, 2020

License type: cc-by

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Exposing diverse subword segmentations to neural machine translation (NMT) models often improves the robustness of machine translation as NMT models can experience various subword candidates. However, the diversification of subword segmentations mostly relies on the pre-trained subword language models from which erroneous segmentations of unseen words are less likely to be sampled. In this paper, we present adversarial subword regularization (ADVSR) to study whether gradient signals during training can be a substitute criterion for exposing diverse subword segmentations. We experimentally show that our model-based adversarial samples effectively encourage NMT models to be less sensitive to segmentation errors and improve the performance of NMT models in low-resource and out-domain datasets.

Highlights

Subword segmentation is a method of segmenting an input sentence into a sequence of subword units (Sennrich et al, 2016; Wu et al, 2016; Kudo, 2018)
Our experiment shows that the neural machine translation (NMT) models trained with adversarial subword regularization (ADVSR) improve the performance of baseline NMT models up to 3.2 BLEU scores in IWSLT datasets while outperforming the standard subword regularization method
Exposing multiple subword candidates to the NMT models shows superior performance in domain adaptation, which matches the finding from Müller et al (2019)

Summary

Introduction

Subword segmentation is a method of segmenting an input sentence into a sequence of subword units (Sennrich et al, 2016; Wu et al, 2016; Kudo, 2018). Subword regularization relies on the unigram language models to sample candidates, where the language models are optimized based on the corpus-level statistics from training data with no regard to the translation task objective. This causes NMT models to experience a limited set of subword candidates which are frequently observed in the training data. We adopt the adversarial training framework (Goodfellow et al, 2014; Miyato et al, 2016; Ebrahimi et al, 2017; Cheng et al, 2019) to search for a subword segmentation that effectively regularizes the NMT models. As it is computationally expensive to exactly estimate rin Eq 3, Miyato et al (2016) resort to the linear approximation method (Goodfellow et al, 2014), where ri is approximated as follows:

Background

Approach

Problem Definition

Adversarial Subword Regularization

Experimental Setup

Evaluation

Results on Low-Resource Dataset

Datasets and Implementation Details

Results on Out-Domain Dataset

Results on Synthetic Dataset

Related Work

Conclusions

Details of Training

Details of Experimental Settings

B Sampled Translation Outputs

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Adversarial Subword Regularization for Robust Neural Machine Translation

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Combining SMT and NMT Back-Translated Data for Efficient NMT
Alberto Poncelas ... Dimitar Shterionov
-
Alberto Poncelas, et. al.Alberto Poncelas ... Dimitar Shterionov
22 Oct 2019
22 Oct 2019

On the Copying Behaviors of Pre-Training for Neural Machine Translation
...
-
, et. al. ...
01 Aug 2021
01 Aug 2021

On the Copying Behaviors of Pre-Training for Neural Machine Translation
Xuebo Liu ... Longyue Wang
-
Xuebo Liu, et. al.Xuebo Liu ... Longyue Wang
01 Jan 2020
01 Jan 2020

Confidence Based Bidirectional Global Context Aware Training Framework for Neural Machine Translation
...
-
, et. al. ...
11 May 2022
11 May 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Adversarial Subword Regularization for Robust Neural Machine Translation

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers