Abstract
In this age of information technology, it has become possible for people all over the world to communicate in different languages through social media platforms with the help of machine translation (MT) systems. As far as the Arabic-English language pair is concerned, most studies have been conducted on evaluating the MT output for the standard varieties of Arabic, with fewer studies focusing on the vernacular or colloquial varieties. This study attempts to address this gap through presenting an evaluation of the performance of MT output for vernacular or colloquial Arabic in the social media domain. As it is currently the most widely used MT system, Google Translate (GT) has been chosen for evaluating the reliability of its output in the context of translating the Arabic colloquial language (i.e., Egyptian/Cairene Arabic variety) used in social media into English. With this goal in mind, a corpus consisting of Egyptian dialectal Arabic sentences were collected from social media networks, i.e., Facebook and Twitter, and then fed into GT system. The GT output was then evaluated by three human translators to assess their accuracy of translation in terms of adequacy and fluency. The results of the study show that several translation problems have been spotted for GT output. These problems are mainly concerned with wrong equivalents, inappropriate additions and deletions, and transliteration for out-of-vocabulary (OOV) words, which are mostly due to the literal translation of the Arabic vernacular sentences into English. This can be due to the fact that Arabic vernacular varieties are different from the standard language for which MT systems have been basically developed. This, consequently, necessitates the need to upgrade such MT systems to deal with the vernacular varieties.
Highlights
This study explores Arabic social media texts to evaluate translation adequacy and fluency in order to reach a verdict on the system‟s reliability
This study will look at this harmony by seeing if the dialectal nature of social media works well with English on Google Translate
Several studies have been conducted on evaluating the performance of a number of machine translation (MT) systems, including Google Translate (GT), for the Arabic-English language pair
Summary
For many years writing used to be confined to Modern Standard Arabic (MSA) which was and still considered by some to be more prestigious [1]. With the development of social media networks and platforms, millions of users in the Arab world have been using their vernacular and colloquial dialects in expressing themselves. Social media networks have given legacy to the colloquial forms of Arabic [2]. Millions of Arab users are using these forms in reflecting on different issues and there is an increasing demand from institutions and individuals for translating a lot of this stuff. It is impossible for human translators to meet these translation needs of institutions and individuals
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal of Advanced Computer Science and Applications
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.