Invited Session III: Neural network models of the visual system: Reverse-engineering neural code in the language of objects and generative models.

Ilker Yildirim

doi:10.1167/jov.23.11.19

Abstract

When we open our eyes, we do not see a jumble of light or colorful patterns. There lies a great distance from the raw inputs sensed at our retinas to what we experience as the contents of our perception. How in the brain are incoming sense inputs transformed into rich, discrete structures that we can think about and plan with? These "world models" include representations of objects with kinematic and dynamical properties, scenes with navigational affordances, and events with temporally demarcated dynamics. Real world scenes are complex, but given a momentary task, only a fraction of this complexity is relevant to the observer. Attention allows us to selectively form these world models, driving flexible action as task-driven, simulatable state-spaces. How in the mind and brain do we build and use such internal models of the world from raw visual inputs? In this talk, I will begin to address this question by presenting two new computational modeling frameworks. First, in high-level vision, I will show how we can reverse-engineer population-level neural activity in the macaque visual cortex in the language of three-dimensional objects and computer graphics, by combining generative models with deep neural networks. Second, I will present a novel account of attention based on adaptive computation that situates vision in the broader context of an agent with goals, and show how it explains internal representations and implicit goals underlying the selectivity of scene perception.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Invited Session III: Neural network models of the visual system: Reverse-engineering neural code in the language of objects and generative models.

Abstract

Talk to us

Similar Papers

More From: Journal of vision

Lead the way for us

Journal: Journal of vision	Publication Date: Sep 1, 2023
License type: CC BY-NC-ND 4.0

Similar Papers

Speeding Up Affordance Learning for Tool Use, Using Proprioceptive and Kinesthetic Inputs
Khuong N Nguyen ... Jaewook Yoo
-
Khuong N Nguyen, et. al.Khuong N Nguyen ... Jaewook Yoo
01 Jul 2019
01 Jul 2019

Deep imitation learning for 3D navigation tasks
Ahmed Hussein ... Mohamed Medhat Gaber
Neural Computing and Applications | VOL. 29
Ahmed Hussein, et. al.Ahmed Hussein ... Mohamed Medhat Gaber
04 Dec 2017
Neural Computing and Applications | VOL. 29

Context-dependent extinction learning emerging from raw sensory inputs: a reinforcement learning approach
Thomas Walther ... Sen Cheng
Scientific Reports | VOL. 11
Thomas Walther, et. al.Thomas Walther ... Sen Cheng
01 Feb 2021
Scientific Reports | VOL. 11

Learning Semantic Keypoint Representations for Door Opening Manipulation
Jiayu Wang ... Chuxiong Hu
IEEE Robotics and Automation Letters | VOL. 5
Jiayu Wang, et. al.Jiayu Wang ... Chuxiong Hu
01 Oct 2020
IEEE Robotics and Automation Letters | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Invited Session III: Neural network models of the visual system: Reverse-engineering neural code in the language of objects and generative models.

Abstract

Talk to us

Similar Papers

More From: Journal of vision