Word-Level Quality Estimation for Korean-English Neural Machine Translation

Sugyeong Eo,Hyeonseok Moon,Chanjun Park,Heuiseok Lim,Jaehyung Seo

doi:10.1109/access.2022.3169155

Abstract

Quality estimation (QE) task aims to predict the machine translation (MT) quality well by referring to the source sentence and its MT output. The various applicability of QE proves the importance of QE research, but the enormous human labor to construct the QE dataset remains a challenge. This study proposes three automatic word-level pseudo-QE data construction strategies using a monolingual or parallel corpus and an external machine translator without human labor. We utilize these individual pseudo-QE datasets to finetune multilingual pretrained language models such as cross-lingual language models (XLM), XLM-RoBERTa, and multilingual BART and comparatively analyze the results. Considering the synthetic dataset creation setup, we attempt to validate the objectivity of the QE model by leveraging four test sets translated by external translators from Google, Amazon, Microsoft, and Systran. As a result, XLM-R-large shows the best performance among mPLMs. We also verify the reliability of the QE model through the close performance gaps between different test sets. To the best of our knowledge, this is the first study to experiment with word-level Korean-English QE.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Word-Level Quality Estimation for Korean-English Neural Machine Translation

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Toward a Cognitive Evaluation Approach for Machine Translation PostEditing
Wajdi Zaghouani ... Irina Temnikova
-
Wajdi Zaghouani, et. al.Wajdi Zaghouani ... Irina Temnikova
01 Jan 2018
01 Jan 2018

Uniformly Interpolated Balancing for Robust Prediction in Translation Quality Estimation
Hyun Kim ... Seung-Hoon Na
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 19
Hyun Kim, et. al.Hyun Kim ... Seung-Hoon Na
19 Jan 2020
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 19

Investigating the Helpfulness of Word-Level Quality Estimation for Post-Editing Machine Translation Output
Raksha Shenoy ... Nico Herbig
-
Raksha Shenoy, et. al.Raksha Shenoy ... Nico Herbig
01 Jan 2020
01 Jan 2020

Investigating the Helpfulness of Word-Level Quality Estimation for Post-Editing Machine Translation Output
...
-
, et. al. ...
21 Oct 2021
21 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Word-Level Quality Estimation for Korean-English Neural Machine Translation

Abstract

Talk to us

Similar Papers

More From: IEEE Access