Mind Your Language: Abuse and Offense Detection for Code-Switched Languages

Raghav Kapoor,Roger Zimmermann,Yaman Kumar,Rajiv Ratn Shah,Kshitij Rajput,Ponnurangam Kumaraguru

doi:10.1609/aaai.v33i01.33019951

Mind Your Language: Abuse and Offense Detection for Code-Switched Languages

Raghav Kapoor, Roger Zimmermann + Show 4 more

Open Access

PDF Available

https://doi.org/10.1609/aaai.v33i01.33019951

Copy DOI

Export

Save

Cite

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jul 17, 2019
Citations: 16

Affiliation: Netaji Subhas University of Technology, Indian Institute of Technology Delhi

#Abuse Detection #Apply Transfer Learning #Offense Detection #Code-Switched Languages #Mind Your #Multilingual Societies #Domain Of Text Classification #Indian Subcontinent #Domain Of Classification #Vocabulary Of Language

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

In multilingual societies like the Indian subcontinent, use of code-switched languages is much popular and convenient for the users. In this paper, we study offense and abuse detection in the code-switched pair of Hindi and English (i.e, Hinglish), the pair that is the most spoken. The task is made difficult due to non-fixed grammar, vocabulary, semantics and spellings of Hinglish language. We apply transfer learning and make a LSTM based model for hate speech classification. This model surpasses the performance shown by the current best models to establish itself as the state-of-the-art in the unexplored domain of Hinglish offensive text classification. We also release our model and the embeddings trained for research purposes.

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Mind Your Language: Abuse and Offense Detection for Code-Switched Languages