Multidimensional Face Representation in a Deep Convolutional Neural Network Reveals the Mechanism Underlying AI Racism.

Jinhua Tian,Hailun Xie,Siyuan Hu,Jia Liu

doi:10.3389/fncom.2021.620281

Jinhua Tian, Hailun Xie + Show 2 more

Open Access

https://doi.org/10.3389/fncom.2021.620281

Copy DOI

Abstract

The increasingly popular application of AI runs the risk of amplifying social bias, such as classifying non-white faces as animals. Recent research has largely attributed this bias to the training data implemented. However, the underlying mechanism is poorly understood; therefore, strategies to rectify the bias are unresolved. Here, we examined a typical deep convolutional neural network (DCNN), VGG-Face, which was trained with a face dataset consisting of more white faces than black and Asian faces. The transfer learning result showed significantly better performance in identifying white faces, similar to the well-known social bias in humans, the other-race effect (ORE). To test whether the effect resulted from the imbalance of face images, we retrained the VGG-Face with a dataset containing more Asian faces, and found a reverse ORE that the newly-trained VGG-Face preferred Asian faces over white faces in identification accuracy. Additionally, when the number of Asian faces and white faces were matched in the dataset, the DCNN did not show any bias. To further examine how imbalanced image input led to the ORE, we performed a representational similarity analysis on VGG-Face's activation. We found that when the dataset contained more white faces, the representation of white faces was more distinct, indexed by smaller in-group similarity and larger representational Euclidean distance. That is, white faces were scattered more sparsely in the representational face space of the VGG-Face than the other faces. Importantly, the distinctiveness of faces was positively correlated with identification accuracy, which explained the ORE observed in the VGG-Face. In summary, our study revealed the mechanism underlying the ORE in DCNNs, which provides a novel approach to studying AI ethics. In addition, the face multidimensional representation theory discovered in humans was also applicable to DCNNs, advocating for future studies to apply more cognitive theories to understand DCNNs' behavior.

Highlights

With enormous progress in artificial intelligence (AI), deep convolutional neural networks (DCNN) have shown extraordinary performance in computer vision, natural language processing, and complex strategy video games
One influential human recognition theory, the face multidimensional representation space (MDS) theory, proposes that other race effect (ORE) comes from the difference in representing faces in a multidimensional space, or “face space” (Valentine, 1991; Valentine et al, 2016; O’toole et al, 2018)
As faces in the dataset were overwhelmingly white, the better identification accuracy for white faces suggested that the ORE existed in the VGG-Face

Summary

Introduction

With enormous progress in artificial intelligence (AI), deep convolutional neural networks (DCNN) have shown extraordinary performance in computer vision, natural language processing, and complex strategy video games. One influential human recognition theory, the face multidimensional representation space (MDS) theory, proposes that ORE comes from the difference in representing faces in a multidimensional space, or “face space” (Valentine, 1991; Valentine et al, 2016; O’toole et al, 2018). According to this theory, face space is a Euclidean multidimensional space, with dimensions representing facial features. We examine whether the ORE in DCNNs, if observed, may be accounted for by a similar mechanism

Methods

Results

Conclusion