A Human-centered Evaluation of a Toxicity Detection API: Testing Transferability and Unpacking Latent Attributes

Meena Devii Muralikumar,Yun Shan Yang,David W Mcdonald

doi:10.1145/3582568

Abstract

Perspective is a publicly available, machine learning API that can score text for toxicity. It is available for use in online platforms and communities to limit toxicity and promote civil dialogue. In this work, we adopt a human-centered approach to evaluating Perspective by investigating if human ratings of toxicity align with Perspective’s toxicity scores. We also test its transferability by making this comparison for comments from three platforms that have different commenting styles and moderation strategies: news websites, YouTube, and Twitter. Apart from toxicity, the main attribute, we collect participant ratings for three additional attributes: respectfulness, formality, and presence of stereotypes. While disrespect is part of how Perspective defines toxicity, formality and presence of stereotypes were included in the study to explore if they could be hidden/latent attributes that affect toxicity scores from Perspective. We analyzed how participant ratings for these additional attributes vary with respect to Perspective’s toxicity score for comments from each platform. We find that for high toxicity scores, Perspective strongly aligns with participant ratings of toxicity and disrespectfulness across all three platforms, providing weak evidence of its transferability. However, our evaluation also surfaced formality and presence of stereotypes as latent attributes that are unrecognized parts of Perspective’s scores. We discuss how and why this evaluation is “human-centered,” the importance of conducting such evaluations, and implications of these results for content moderation in social platforms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Human-centered Evaluation of a Toxicity Detection API: Testing Transferability and Unpacking Latent Attributes

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Social Computing

Lead the way for us

Journal: ACM Transactions on Social Computing	Publication Date: Jun 26, 2023
Citations: 2

Similar Papers

Solace in Social Media: Women Unite Under COVID-19
Jyoti Ahlawat
-
Jyoti AhlawatJyoti Ahlawat
23 Mar 2022
23 Mar 2022

The surge of spreading harmful information through digital technologies: a distressing reality in complex humanitarian emergencies
Dilshad Jaff
The Lancet Global Health | VOL. 11
Dilshad JaffDilshad Jaff
16 May 2023
The Lancet Global Health | VOL. 11

Going Viral: The 3 Rs of Social Media Messaging during Public Health Emergencies.
Bhavini Patel Murthy ... Tanya Telfair Leblanc
Health security | VOL. 19
Bhavini Patel Murthy, et. al.Bhavini Patel Murthy ... Tanya Telfair Leblanc
01 Feb 2021
Health security | VOL. 19

Role of Social Media in Real-Time Flood Early Warning Systems: Data Regression using Data Mining Models
Fatemeh Kaleshani ... Farzad Piadeh
-
Fatemeh Kaleshani, et. al.Fatemeh Kaleshani ... Farzad Piadeh
08 Mar 2024
08 Mar 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Human-centered Evaluation of a Toxicity Detection API: Testing Transferability and Unpacking Latent Attributes

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Social Computing