Motor-related signals support localization invariance for stable visual perception.

Andrea Benucci

doi:10.1371/journal.pcbi.1009928

Abstract

Our ability to perceive a stable visual world in the presence of continuous movements of the body, head, and eyes has puzzled researchers in the neuroscience field for a long time. We reformulated this problem in the context of hierarchical convolutional neural networks (CNNs)—whose architectures have been inspired by the hierarchical signal processing of the mammalian visual system—and examined perceptual stability as an optimization process that identifies image-defining features for accurate image classification in the presence of movements. Movement signals, multiplexed with visual inputs along overlapping convolutional layers, aided classification invariance of shifted images by making the classification faster to learn and more robust relative to input noise. Classification invariance was reflected in activity manifolds associated with image categories emerging in late CNN layers and with network units acquiring movement-associated activity modulations as observed experimentally during saccadic eye movements. Our findings provide a computational framework that unifies a multitude of biological observations on perceptual stability under optimality principles for image classification in artificial neural networks.

Highlights

When reading this paper while sitting still at your desk, unperceived head and body adjustments, along with continuous eye movements—fixational eye movements [1]—jitter the visual image across arrays of photoreceptors in the retinas of the eyes
We explore the hypothesis that perception equates to the activity states of networks trained to classify “features” in the visual scene, and perceptual stability equates to robust classification of these features relative to self-generated movements, that is, a “what” type of information processing
We demonstrate in convolutional neural networks (CNNs) that neural signals related to eye and body movements support accurate image classification by making “where” type of computations—localization invariances— faster to learn and more robust relative to input perturbations

Summary

Introduction

When reading this paper while sitting still at your desk, unperceived head and body adjustments, along with continuous eye movements—fixational eye movements [1]—jitter the visual image across arrays of photoreceptors in the retinas of the eyes. A branch of modeling works has linked the ability to accurately recognize objects during movements—which could support perceptual stability—to invariances for translations, rotations, and expansions learned directly from the statistics of the visual inputs This class of models, e.g., unsupervised temporal learning models [7,8] and slow feature analysis models [9–14] has found supporting evidence in psychophysical [15,16] and physiological studies [7,17,18] and has inspired deep learning approaches for unsupervised rules to learn coherent visual representations in the presence of moving stimuli e.g., contrastive embedding [19,20]. These models are agnostic with respect to whether retinal activations are due to objects moving in the environment or to movements of the organism, with the latter characteristically defining the phenomenon of perceptual stability. Another branch of works has hypothesized that extra-retinal signals produced during body movements, corollary discharges [21–24], could be used by brain networks for perceptual stabilization when retinal activations are due to the movements of the eyes, head, and body, without affecting the percept of movements during changes in the environment

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS computational biology	Publication Date: Mar 14, 2022
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Motor-related signals support localization invariance for stable visual perception.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS computational biology

Lead the way for us

Similar Papers

A plant disease image using convolutional recurrent neural network procedure intended for big data plant classification
S Gopinath ... K Sakthivel
Journal of Intelligent & Fuzzy Systems | VOL. 43
S Gopinath, et. al.S Gopinath ... K Sakthivel
10 Aug 2022
Journal of Intelligent & Fuzzy Systems | VOL. 43

Implementation of CNN and ANN for Fashion-MNIST-Dataset using Different Optimizers
Sumera Sumera ... R Sirisha
Indian Journal Of Science And Technology | VOL. 15
Sumera Sumera, et. al.Sumera Sumera ... R Sirisha
20 Dec 2022
Indian Journal Of Science And Technology | VOL. 15

Detection and Classification of Diabetic Retinopathy Using DCNN and BSN Models
S Sudha ... A Srinivasan
Computers, Materials & Continua | VOL. 72
S Sudha, et. al.S Sudha ... A Srinivasan
01 Jan 2021
Computers, Materials & Continua | VOL. 72

<title>Computer-assisted diagnosis of lung nodule detection using artificial convoultion neural network</title>
Shih-Chung B Lo ... Murray H Loew
-
Shih-Chung B Lo, et. al.Shih-Chung B Lo ... Murray H Loew
14 Sep 1993
14 Sep 1993

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Motor-related signals support localization invariance for stable visual perception.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS computational biology