Towards Measuring Fairness in AI: The Casual Conversations Dataset

Caner Hazirbas,Brian Dolhansky,Cristian Canton Ferrer,Joanna Bitton,Albert Gordo,Jacqueline Pan

doi:10.1109/tbiom.2021.3132237

Abstract

This paper introduces a novel dataset to help researchers evaluate their computer vision and audio models for accuracy across a diverse set of age, genders, apparent skin tones and ambient lighting conditions. Our dataset is composed of 3,011 subjects and contains over 45,000 videos, with an average of 15 videos per person. The videos were recorded in multiple U.S. states with a diverse set of adults in various age, gender and apparent skin tone groups. A key feature is that each subject agreed to participate for their likenesses to be used. Additionally, our age and gender annotations are provided by the subjects themselves. A group of trained annotators labeled the subjects’ apparent skin tone using the Fitzpatrick skin type scale. Moreover, annotations for videos recorded in low ambient lighting are also provided. As an application to measure robustness of predictions across certain attributes, we provide a comprehensive study on the top five winners of the DeepFake Detection Challenge (DFDC). Experimental evaluation shows that the winning models are less performant on some specific groups of people, such as subjects with darker skin tones and thus may not generalize to all people. In addition, we also evaluate the state-of-the-art apparent age and gender classification methods. Our experiments provides a thorough analysis on these models in terms of fair treatment of people from various backgrounds.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards Measuring Fairness in AI: The Casual Conversations Dataset

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Biometrics, Behavior, and Identity Science

Lead the way for us

Journal: IEEE Transactions on Biometrics, Behavior, and Identity Science	Publication Date: Jul 1, 2022
Citations: 27

Similar Papers

Casual Conversations: A dataset for measuring fairness in AI
Caner Hazirbas ... Albert Gordo
-
Caner Hazirbas, et. al.Caner Hazirbas ... Albert Gordo
01 Jun 2021
01 Jun 2021

Diagnostic Accuracy of Caries and Periapical Lesions on a Monitor with and without DICOM-GSDF Calibration Under Different Ambient Light Conditions.
Rangel Teles Freire ... John Nadson Andrade Pinho
Journal of Digital Imaging | VOL. 35
Rangel Teles Freire, et. al.Rangel Teles Freire ... John Nadson Andrade Pinho
15 Feb 2022
Journal of Digital Imaging | VOL. 35

P‐17.3: Visibility Research Based on Ambient Light Contrast
Ling Xintong ... Tian Fan
SID Symposium Digest of Technical Papers | VOL. 54
Ling Xintong, et. al.Ling Xintong ... Tian Fan
01 Apr 2023
SID Symposium Digest of Technical Papers | VOL. 54

Guest editors' introduction to the special section on graphical models in computer vision
J.M Rehg ... V Pavlovic
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 25
J.M Rehg, et. al.J.M Rehg ... V Pavlovic
01 Jul 2003
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards Measuring Fairness in AI: The Casual Conversations Dataset

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Biometrics, Behavior, and Identity Science