The attentive reconstruction of objects facilitates robust object recognition.

Seoyoung Ahn,Hossein Adeli,Gregory J Zelinsky

doi:10.1371/journal.pcbi.1012159

Abstract

Humans are extremely robust in our ability to perceive and recognize objects-we see faces in tea stains and can recognize friends on dark streets. Yet, neurocomputational models of primate object recognition have focused on the initial feed-forward pass of processing through the ventral stream and less on the top-down feedback that likely underlies robust object perception and recognition. Aligned with the generative approach, we propose that the visual system actively facilitates recognition by reconstructing the object hypothesized to be in the image. Top-down attention then uses this reconstruction as a template to bias feedforward processing to align with the most plausible object hypothesis. Building on auto-encoder neural networks, our model makes detailed hypotheses about the appearance and location of the candidate objects in the image by reconstructing a complete object representation from potentially incomplete visual input due to noise and occlusion. The model then leverages the best object reconstruction, measured by reconstruction error, to direct the bottom-up process of selectively routing low-level features, a top-down biasing that captures a core function of attention. We evaluated our model using the MNIST-C (handwritten digits under corruptions) and ImageNet-C (real-world objects under corruptions) datasets. Not only did our model achieve superior performance on these challenging tasks designed to approximate real-world noise and occlusion viewing conditions, but also better accounted for human behavioral reaction times and error patterns than a standard feedforward Convolutional Neural Network. Our model suggests that a complete understanding of object perception and recognition requires integrating top-down and attention feedback, which we propose is an object reconstruction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The attentive reconstruction of objects facilitates robust object recognition.

Abstract

Talk to us

Similar Papers

More From: PLoS computational biology

Lead the way for us

Journal: PLoS computational biology	Publication Date: Jun 1, 2024
License type: cc-by

Similar Papers

Multimodal deep learning for robust RGB-D object recognition
Andreas Eitel ... Luciano Spinello
-
Andreas Eitel, et. al.Andreas Eitel ... Luciano Spinello
01 Sep 2015
01 Sep 2015

Decision letter: Causal neural mechanisms of context-based object recognition
Redmond G O'Connell ... Joshua I Gold
-
Redmond G O'Connell, et. al.Redmond G O'Connell ... Joshua I Gold
03 Jun 2021
03 Jun 2021

Does training with blurred images bring convolutional neural networks closer to humans with respect to robust object recognition and internal representations?
Sou Yoshihara ... Shin'Ya Nishida
Frontiers in psychology | VOL. 14
Sou Yoshihara, et. al.Sou Yoshihara ... Shin'Ya Nishida
15 Feb 2023
Frontiers in psychology | VOL. 14

Robust deep learning object recognition models rely on low frequency information in natural images.
Zhe Li ... Xaq Pitkow
PLoS computational biology | VOL. 19
Zhe Li, et. al.Zhe Li ... Xaq Pitkow
27 Mar 2023
PLoS computational biology | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The attentive reconstruction of objects facilitates robust object recognition.

Abstract

Talk to us

Similar Papers

More From: PLoS computational biology