Efficient inverse graphics in biological face processing.

Ilker Yildirim,Mario Belledonne,Winrich Freiwald,Josh Tenenbaum

doi:10.1126/sciadv.aax5979

Ilker Yildirim, Mario Belledonne + Show 2 more

Open Access

https://doi.org/10.1126/sciadv.aax5979

Copy DOI

Journal: Science Advances	Publication Date: Mar 4, 2020
Citations: 73	License type: cc-by-nc

Affiliation: Yale University, Rockefeller University

Abstract

Vision not only detects and recognizes objects, but performs rich inferences about the underlying scene structure that causes the patterns of light we see. Inverting generative models, or "analysis-by-synthesis", presents a possible solution, but its mechanistic implementations have typically been too slow for online perception, and their mapping to neural circuits remains unclear. Here we present a neurally plausible efficient inverse graphics model and test it in the domain of face recognition. The model is based on a deep neural network that learns to invert a three-dimensional face graphics program in a single fast feedforward pass. It explains human behavior qualitatively and quantitatively, including the classic "hollow face" illusion, and it maps directly onto a specialized face-processing circuit in the primate brain. The model fits both behavioral and neural data better than state-of-the-art computer vision models, and suggests an interpretable reverse-engineering account of how the brain transforms images into percepts.

Full Text