Abstract

In this paper, we present our work on a case study between statistical machine translation (SMT) and rule-based machine translation (RBMT) systems on English-Indian language and Indian to Indian language perspective. Main objective of our study is to make a five-way performance comparison: such as, (a) SMT and RBMT; (b) SMT on English–Indian language; (c) RBMT on English–Indian language; (d) SMT on Indian to Indian language perspective; (e) RBMT on Indian to Indian language perspective. Through a detailed analysis, we describe the rule-based and the statistical machine translation system developments and its evaluations. Further, with a detailed error analysis, we point out the relative strengths and weaknesses of both the systems. The observations based on our study are: (a) SMT systems outperform RBMT; (b) In the case of SMT: English to Indian language MT systems perform better than Indian to English language MT systems; (c) In the case of RBMT: English to Indian language MT systems perform better than Indian to English language MT systems; (d) SMT systems perform better for Indian to Indian language MT systems compared to RBMT. Effectively, we shall see that even with a small amount of training corpus SMT system has many advantages for high-quality domain-specific machine translation over that of a rule-based counterpart.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.