Abstract
Social networking platforms gained widespread popularity and are used for various activities like: promoting products, sharing news, achievements and many more. On the other hand, it is also used for spreading rumors, bullying people, and abusing certain groups of people with hateful words. The hate and offensive posts must be detected and removed as early as possible from the social platforms because such posts are spread very quickly and tend to have a lot of negative impacts on human beings. In the last few years, offensive content and hate speech detection has become popular topic of research. Detecting hate speech on social platforms has many challenges, one of them being the use of code-mixed language. Majority of the social media users usually post their messages in code-mixed languages such as Hindi–English, Tamil–English, Malayalam–English, Telugu–English and others. In this exhaustive study, we explore and compare the use of various machine learning and deep learning approaches. An ensemble model by combining the outcomes of transformer and deep learning-based models is suggested to detect hate speech and offensive language on social networking platforms. The experimental outcomes of the proposed weighted ensemble framework outperformed state-of-the-art models by achieving 0.802 and 0.933 weighted F1-score for Malayalam and Tamil code-mixed datasets.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.