Detecting Source Contextual Barriers for Understanding Neural Machine Translation

Guanlin Li,Rui Wang,Tiejun Zhao,Conghui Zhu,Lemao Liu,Shuming Shi

doi:10.1109/taslp.2021.3085119

Abstract

In machine translation evaluation, the traditional wisdom measures model's generalization ability in an average sense, for example by using corpus BLEU. However, the statistics of corpus BLEU cannot provide comprehensive understanding and fine-grained analysis on model's generalization ability. As a remedy, this paper attempts to understand NMT at fine-grained level, by detecting contextual barriers within an unseen input sentence that cause the degradation in model's translation quality. It proposes a principled definition of source contextual barriers as well as its modified version which is tractable in computation and operates at word-level. Based on the modified one, three simple methods are proposed for barrier detection by search-aware risk estimation through counterfactual generation. Extensive analyses are conducted on those detected contextual barrier words on both Zh $\Leftrightarrow$ En NIST benchmarks. Potential usages motivated from barrier words are also discussed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2021
Citations: 1	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Detecting Source Contextual Barriers for Understanding Neural Machine Translation

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Similar Papers

A Naïve Automatic MT Evaluation Method without Reference Translations
Junjie Jiang ... Youfang Lin
-
Junjie Jiang, et. al.Junjie Jiang ... Youfang Lin
01 Jan 2010
01 Jan 2010

Multichannel Adaptive Data Mixture Augmentation for Graph Neural Networks
Zhonglin Ye ... Haixing Zhao
International Journal of Data Warehousing and Mining | VOL. 20
Zhonglin Ye, et. al.Zhonglin Ye ... Haixing Zhao
05 Aug 2024
International Journal of Data Warehousing and Mining | VOL. 20

기계번역 프로그램 품질에 대한 사용자 평가와 사용자의 L2 수준 간 상관관계 고찰 - 한중 언어 쌍을 중심으로 -
Kong Su
Interpretation and Translation | VOL. 21
Kong SuKong Su
10 Dec 2019
Interpretation and Translation | VOL. 21

Constructing the Evaluation Index System of Chinese-Portuguese Machine Translation using the Delphi and Analytic Hierarchy Process Methods
Yuqi Sun ... Sio Kei Im
-
Yuqi Sun, et. al.Yuqi Sun ... Sio Kei Im
29 Jan 2023
29 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detecting Source Contextual Barriers for Understanding Neural Machine Translation

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing