Abstract
With the increase of web use in Morocco today, Internet has become an important source of information. Specifically, across social media, Moroccan people use several languages in their communication leaving behind unstructured user-generated text that present several opportunities for Natural Language Processing. Among languages found in this data, Moroccan Dialectal Arabic stands with an important content and several features. In this paper, we investigate online written text generated by Moroccan users in social media with an emphasis on Moroccan Dialectal Arabic. For this purpose, we follow several steps, using some tools such as a language identification system, in order to conduct a deep study of this data. The most interesting findings that have emerged is the use of code switching, multi-script and low amount of words in Moroccan UGT text.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.