A Novel Part-of-Speech Set Developing Method for Statistical Machine Translation

Herry Sujaini,Ayu Purwarianti,Kuspriyanto Kuspriyanto,Arry Akhmad Arman

doi:10.12928/telkomnika.v12i3.79

Abstract

Part of speech (PoS) is one of the features that can be used to improve the quality of statistical-based machine translation. Typically, the language PoS determined based grammar of the language or adopt from other languages PoS. This work aims to formulate a model to developing PoS as linguistic factors to improve the quality of machine translation automatically. The research method using word similarity approach, where we perform clustering of the words contained in a corpus. Further classes will be defined as PoS set obtained for a given language.We evaluated the results of the PoS that defined computational results using machine translation system MOSES as the system by comparing the results of the SMT are using PoS sets generated manually, while the assessment of the system using BLEU method. Language that will be used for evaluation is English as the source language and Indonesian as the target language.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: TELKOMNIKA (Telecommunication Computing Electronics and Control)	Publication Date: Sep 1, 2014
Citations: 19	License type: cc-by-sa

R Discovery Prime

R Discovery Prime

A Novel Part-of-Speech Set Developing Method for Statistical Machine Translation

Abstract

Talk to us

Similar Papers

More From: TELKOMNIKA (Telecommunication Computing Electronics and Control)

Lead the way for us

Similar Papers

Machine translation of standardised medical terminology using natural language processing: A scoping review
Richard Noll ... Jannik Schaaf
New Biotechnology | VOL. 77
Richard Noll, et. al.Richard Noll ... Jannik Schaaf
29 Aug 2023
New Biotechnology | VOL. 77

Summarizing machine translation text: An English-Arabic case study
Houda Bouamor ... Kemal Oflazer
-
Houda Bouamor, et. al.Houda Bouamor ... Kemal Oflazer
01 Jan 2013
01 Jan 2013

Maintaining Sentiment Polarity in Translation of User-Generated Content
Pintu Lohar ... Haithem Afli
The Prague Bulletin of Mathematical Linguistics | VOL. 108
Pintu Lohar, et. al.Pintu Lohar ... Haithem Afli
01 Jun 2017
The Prague Bulletin of Mathematical Linguistics | VOL. 108

Adaptation in Statistical Machine Translation for Low-resource Domains in English-Vietnamese Language
Nghia-Luan Pham ... Van-Vinh Nguyen
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36
Nghia-Luan Pham, et. al.Nghia-Luan Pham ... Van-Vinh Nguyen
30 May 2020
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Part-of-Speech Set Developing Method for Statistical Machine Translation

Abstract

Talk to us

Similar Papers

More From: TELKOMNIKA (Telecommunication Computing Electronics and Control)