Abstract

Terminology translation plays a key role in technical machine translation. This paper conducts quantitative and qualitative evaluation and presents a comprehensive error analysis on terminology translation from Korean to English. For this study, 637 terms are selected from a patent domain corpus. Among them, 425 terms are translated correctly while other 212 terms have one or more errors. Those identified errors are classified into seven error categories which include addition, omission, lexical errors, order errors, morphological errors, partial errors, and miscellanies. Among them, partial errors and morphological errors are the two top error categories, and lexical errors and omission cases are also high. The study also presents n-gram wise error rates. The calculation of n-gram wise terminology translation error rates shows that the curve for the Google Translation results goes upwards with the increasing size of n. This shows that neural machine translation can be vulnerable in translating a higher-order of n-gram terms. This study has implications for Korean-English machine translation research, in that it classifies error cases spotted in machine translation of terminology which is critical in securing accuracy of translations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.