Deep Neural Networks and Visuo-Semantic Models Explain Complementary Components of Human Ventral-Stream Representational Dynamics.

Kamila M Jozwik,Tim C Kietzmann,Radoslaw M Cichy,Nikolaus Kriegeskorte,Marieke Mur

doi:10.1523/jneurosci.1424-22.2022

Abstract

Deep neural networks (DNNs) are promising models of the cortical computations supporting human object recognition. However, despite their ability to explain a significant portion of variance in neural data, the agreement between models and brain representational dynamics is far from perfect. We address this issue by asking which representational features are currently unaccounted for in neural time series data, estimated for multiple areas of the ventral stream via source-reconstructed magnetoencephalography data acquired in human participants (nine females, six males) during object viewing. We focus on the ability of visuo-semantic models, consisting of human-generated labels of object features and categories, to explain variance beyond the explanatory power of DNNs alone. We report a gradual reversal in the relative importance of DNN versus visuo-semantic features as ventral-stream object representations unfold over space and time. Although lower-level visual areas are better explained by DNN features starting early in time (at 66 ms after stimulus onset), higher-level cortical dynamics are best accounted for by visuo-semantic features starting later in time (at 146 ms after stimulus onset). Among the visuo-semantic features, object parts and basic categories drive the advantage over DNNs. These results show that a significant component of the variance unexplained by DNNs in higher-level cortical dynamics is structured and can be explained by readily nameable aspects of the objects. We conclude that current DNNs fail to fully capture dynamic representations in higher-level human visual cortex and suggest a path toward more accurate models of ventral-stream computations.SIGNIFICANCE STATEMENT When we view objects such as faces and cars in our visual environment, their neural representations dynamically unfold over time at a millisecond scale. These dynamics reflect the cortical computations that support fast and robust object recognition. DNNs have emerged as a promising framework for modeling these computations but cannot yet fully account for the neural dynamics. Using magnetoencephalography data acquired in human observers during object viewing, we show that readily nameable aspects of objects, such as 'eye', 'wheel', and 'face', can account for variance in the neural dynamics over and above DNNs. These findings suggest that DNNs and humans may in part rely on different object features for visual recognition and provide guidelines for model improvement.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The Journal of neuroscience : the official journal of the Society for Neuroscience	Publication Date: Feb 9, 2023
Citations: 10	License type: CC BY-NC-SA 4.0

R Discovery Prime

R Discovery Prime

Deep Neural Networks and Visuo-Semantic Models Explain Complementary Components of Human Ventral-Stream Representational Dynamics.

Abstract

Talk to us

Similar Papers

More From: The Journal of neuroscience : the official journal of the Society for Neuroscience

Lead the way for us

Similar Papers

Growing random forest on deep convolutional neural networks for scene categorization
Shuang Bai
Expert Systems with Applications | VOL. 71
Shuang BaiShuang Bai
17 Oct 2016
Expert Systems with Applications | VOL. 71

Integrative Benchmarking to Advance Neurally Mechanistic Models of Human Intelligence
Martin Schrimpf ... James J Dicarlo
Neuron | VOL. 108
Martin Schrimpf, et. al.Martin Schrimpf ... James J Dicarlo
11 Sep 2020
Neuron | VOL. 108

High-Level Visual Encoding Model Framework with Hierarchical Ventral Stream-Optimized Neural Networks.
Wulue Xiao ... Chi Zhang
Brain sciences | VOL. 12
Wulue Xiao, et. al.Wulue Xiao ... Chi Zhang
19 Aug 2022
Brain sciences | VOL. 12

Cancer detection in breast cells using a hybrid method based on deep complex neural network and data mining.
Ling Yang ... Rebaz Othman Yahya
Journal of cancer research and clinical oncology | VOL. 149
Ling Yang, et. al.Ling Yang ... Rebaz Othman Yahya
24 Jul 2023
Journal of cancer research and clinical oncology | VOL. 149

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Neural Networks and Visuo-Semantic Models Explain Complementary Components of Human Ventral-Stream Representational Dynamics.

Abstract

Talk to us

Similar Papers

More From: The Journal of neuroscience : the official journal of the Society for Neuroscience