Abstract

In recent years, social media, especially Facebook have observed a massive growth of regular posts and their related comments. The users are free to post and comment any kind of information in any language, but there are no explicit mechanisms to reconcile the information expressed in different languages into the useful data set. So, in most of the cases, the contents of the Facebook expressed in different languages remain useless. This paper elucidates the motivation behind the multilingual dataset creation and proposed a framework for the multilingual dataset creation. Besides, the research illustrated the challenges associated with the data set generation, such as separating multilingual data etc. Finally, presents the consequences of multilingual dataset creation due to different challenges. Therefore, the contribution of this research is the creation of multilingual dataset using proposed framework and practically presents the loopholes and consequences.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.