Function words in statistical machine-translated Chinese and original Chinese: A study into the translationese of machine translation systems

Chen-li Kuo

doi:10.1093/llc/fqy050

Abstract

Abstract Statistical approaches have become the mainstream in machine translation (MT), for their potential in producing less rigid and more natural translations than rule-based approaches. However, on closer examination, the uses of function words between statistical machine-translated Chinese and the original Chinese are different, and such differences may be associated with translationese as discussed in translation studies. This article examines the distribution of Chinese function words in a comparable corpus consisting of MTs and the original Chinese texts extracted from Wikipedia. An attribute selection technique is used to investigate which types of function words are significant in discriminating between statistical machine-translated Chinese and the original texts. The results show that statistical MT overuses the most frequent function words, even when alternatives exist. To improve the quality of the end product, developers of MT should pay close attention to modelling Chinese conjunctions and adverbial function words. The results also suggest that machine-translated Chinese shares some characteristics with human-translated texts, including normalization and being influenced by the source language; however, machine-translated texts do not exhibit other characteristics of translationese such as explicitation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Function words in statistical machine-translated Chinese and original Chinese: A study into the translationese of machine translation systems

Abstract

Talk to us

Similar Papers

More From: Digital Scholarship in the Humanities

Lead the way for us

Journal: Digital Scholarship in the Humanities	Publication Date: Nov 28, 2018
Citations: 4

Similar Papers

Discourse-level Features for Statistical Machine Translation

-

01 Jan 2015
01 Jan 2015

A Pragmatic Analysis of Machine Translation Techniques for Preserving the Authenticity of the Sanskrit Language
Nandini Sethi ... Deepak Kumar Sharma
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -
Nandini Sethi, et. al.Nandini Sethi ... Deepak Kumar Sharma
25 Jul 2023
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -

Searching for Poor Quality Machine Translated Text: Learning the Difference between Human Writing and Machine Translations
Dave Carter ... Diana Inkpen
-
Dave Carter, et. al.Dave Carter ... Diana Inkpen
01 Jan 2012
01 Jan 2012

Baidu Translate: Research and Products
Zhongjun He
-
Zhongjun HeZhongjun He
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Function words in statistical machine-translated Chinese and original Chinese: A study into the translationese of machine translation systems

Abstract

Talk to us

Similar Papers

More From: Digital Scholarship in the Humanities