Non-autoregressive Translation with Layer-Wise Prediction and Deep Supervision

Chenyang Huang,Osmar R Zaïane,Lili Mou,Lei Li,Hao Zhou

doi:10.1609/aaai.v36i10.21323

Abstract

How do we perform efficient inference while retaining high translation quality? Existing neural machine translation models, such as Transformer, achieve high performance, but they decode words one by one, which is inefficient. Recent non-autoregressive translation models speed up the inference, but their quality is still inferior. In this work, we propose DSLP, a highly efficient and high-performance model for machine translation. The key insight is to train a non-autoregressive Transformer with Deep Supervision and feed additional Layer-wise Predictions. We conducted extensive experiments on four translation tasks (both directions of WMT'14 EN-DE and WMT'16 EN-RO). Results show that our approach consistently improves the BLEU scores compared with respective base models. Specifically, our best variant outperforms the autoregressive model on three translation tasks, while being 14.8 times more efficient in inference.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Non-autoregressive Translation with Layer-Wise Prediction and Deep Supervision

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 28, 2022
Citations: 6

Similar Papers

Non-autoregressive neural machine translation with auxiliary representation fusion
Quan Du ... Tong Xiao
Journal of Intelligent & Fuzzy Systems | VOL. 41
Quan Du, et. al.Quan Du ... Tong Xiao
16 Dec 2021
Journal of Intelligent & Fuzzy Systems | VOL. 41

A Multitask-Based Neural Machine Translation Model with Part-of-Speech Tags Integration for Arabic Dialects
Laith H Baniata ... Seong-Bae Park
Applied Sciences | VOL. 8
Laith H Baniata, et. al.Laith H Baniata ... Seong-Bae Park
05 Dec 2018
Applied Sciences | VOL. 8

Multi-Source Neural Model for Machine Translation of Agglutinative Language
Yirong Pan ... Yating Yang
Future Internet | VOL. 12
Yirong Pan, et. al.Yirong Pan ... Yating Yang
03 Jun 2020
Future Internet | VOL. 12

Multilingual Neural Translation

-

14 Feb 2020
14 Feb 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Non-autoregressive Translation with Layer-Wise Prediction and Deep Supervision

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence