Reinforced Curriculum Learning on Pre-Trained Neural Machine Translation Models

Mingjun Zhao,Xiaoli Wang,Di Niu,Haijiang Wu

doi:10.1609/aaai.v34i05.6513

Abstract

The competitive performance of neural machine translation (NMT) critically relies on large amounts of training data. However, acquiring high-quality translation pairs requires expert knowledge and is costly. Therefore, how to best utilize a given dataset of samples with diverse quality and characteristics becomes an important yet understudied question in NMT. Curriculum learning methods have been introduced to NMT to optimize a model's performance by prescribing the data input order, based on heuristics such as the assessment of noise and difficulty levels. However, existing methods require training from scratch, while in practice most NMT models are pre-trained on big data already. Moreover, as heuristics, they do not generalize well. In this paper, we aim to learn a curriculum for improving a pre-trained NMT model by re-selecting influential data samples from the original training set and formulate this task as a reinforcement learning problem. Specifically, we propose a data selection framework based on Deterministic Actor-Critic, in which a critic network predicts the expected change of model performance due to a certain sample, while an actor network learns to select the best sample out of a random batch of samples presented to it. Experiments on several translation datasets show that our method can further improve the performance of NMT when original batch training reaches its ceiling, without using additional new training data, and significantly outperforms several strong baseline methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reinforced Curriculum Learning on Pre-Trained Neural Machine Translation Models

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 20

Similar Papers

Translation Transformers Rediscover Inherent Data Domains
...
-
, et. al. ...
21 Oct 2021
21 Oct 2021

A Study of Reinforcement Learning for Neural Machine Translation
Lijun Wu ... Tie-Yan Liu
-
Lijun Wu, et. al.Lijun Wu ... Tie-Yan Liu
01 Jan 2018
01 Jan 2018

Neural Machine Translation with Monolingual Translation Memory
...
-
, et. al. ...
01 Aug 2021
01 Aug 2021

Character-Aware Low-Resource Neural Machine Translation with Weight Sharing and Pre-training
Yichao Cao ... Miao Li
-
Yichao Cao, et. al.Yichao Cao ... Miao Li
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforced Curriculum Learning on Pre-Trained Neural Machine Translation Models

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence