Leveraging Bias in Pre-trained Word Embeddings for Unsupervised Microaggression Detection

Tolúlọpẹ́ Ògúnrẹ̀mí,Valerio Basile,Tommaso Caselli

doi:10.4000/ijcol.1066

Tolúlọpẹ́ Ògúnrẹ̀mí, Valerio Basile + Show 1 more

Open Access

https://doi.org/10.4000/ijcol.1066

Copy DOI

Journal: Italian Journal of Computational Linguistics	Publication Date: Dec 1, 2022
Citations: 1	License type: cc-by-nc-nd

Abstract

Microaggressions are subtle manifestations of bias (Breitfeller et al. 2019). These demonstrations of bias can often be classified as a subset of abusive language. However, not much focus has been placed on the recognition of these instances. As a result, limited data is available on the topic, and only in English. Being able to detect microaggressions without the need for labeled data would be advantageous since it would allow content moderation also for languages lacking annotated data. In this study, we introduce an unsupervised method to detect microaggressions in natural language expressions. The algorithm relies on pre-trained word-embeddings, leveraging the bias encoded in the model in order to detect microaggressions in unseen textual instances. We test the method on a dataset of racial and gender-based microaggressions, reporting promising results. We further run the algorithm on out-of-domain unseen data with the purpose of bootstrapping corpora of microaggressions “in the wild”, perform a pilot experiment with prompt-based learning, and discuss the benefits and drawbacks of our proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Leveraging Bias in Pre-trained Word Embeddings for Unsupervised Microaggression Detection

Abstract

Talk to us

Similar Papers

More From: Italian Journal of Computational Linguistics

Lead the way for us

Similar Papers

Leveraging Bias in Pre-Trained Word Embeddings for Unsupervised Microaggression Detection
Tolúlọpẹ Ògúnrẹ̀Mí ... Valerio Basile
-
Tolúlọpẹ Ògúnrẹ̀Mí, et. al.Tolúlọpẹ Ògúnrẹ̀Mí ... Valerio Basile
01 Jan 2021
01 Jan 2021

Investigating the Efficient Use of Word Embedding with Neural-Topic Models for Interpretable Topics from Short Texts.
Riki Murakami ... Basabi Chakraborty
Sensors | VOL. 22
Riki Murakami, et. al.Riki Murakami ... Basabi Chakraborty
23 Jan 2022
Sensors | VOL. 22

Automatic text summarization of konkani texts using pre-trained word embeddings and deep learning
Jovi D’Silva ... Uzzal Sharma
International Journal of Electrical and Computer Engineering (IJECE) | VOL. 12
Jovi D’Silva, et. al.Jovi D’Silva ... Uzzal Sharma
01 Apr 2022
International Journal of Electrical and Computer Engineering (IJECE) | VOL. 12

Dictionary-based Debiasing of Pre-trained Word Embeddings
Masahiro Kaneko ... Danushka Bollegala
-
Masahiro Kaneko, et. al.Masahiro Kaneko ... Danushka Bollegala
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Leveraging Bias in Pre-trained Word Embeddings for Unsupervised Microaggression Detection

Abstract

Talk to us

Similar Papers

More From: Italian Journal of Computational Linguistics