Active Fovea-Based Vision Through Computationally-Effective Model-Based Prediction.

Emmanuel Daucé

doi:10.3389/fnbot.2018.00076

Abstract

What motivates an action in the absence of a definite reward? Taking the case of visuomotor control, we consider a minimal control problem that is how select the next saccade, in a sequence of discrete eye movements, when the final objective is to better interpret the current visual scene. The visual scene is modeled here as a partially-observed environment, with a generative model explaining how the visual data is shaped by action. This allows to interpret different action selection metrics proposed in the literature, including the Salience, the Infomax and the Variational Free Energy, under a single information theoretic construct, namely the view-based Information Gain. Pursuing this analytic track, two original action selection metrics named the Information Gain Lower Bound (IGLB) and the Information Gain Upper Bound (IGUB) are then proposed. Showing either a conservative or an optimistic bias regarding the Information Gain, they strongly simplify its calculation. An original fovea-based visual scene decoding setup is then proposed, with numerical experiments highlighting different facets of artificial fovea-based vision. A first and principal result is that state-of-the-art recognition rates are obtained with fovea-based saccadic exploration, using less than 10% of the original image's data. Those satisfactory results illustrate the advantage of mixing predictive control with accurate state-of-the-art predictors, namely a deep neural network. A second result is the sub-optimality of some classical action-selection metrics widely used in the literature, that is not manifest with finely-tuned inference models, but becomes patent when coarse or faulty models are used. Last, a computationally-effective predictive model is developed using the IGLB objective, with pre-processed visual scan-path read-out from memory, bypassing computationally-demanding predictive calculations. This last simplified setting is shown effective in our case, showing both a competing accuracy and a good robustness to model flaws.

Highlights

In complement with goal-oriented activity, animal motor control relates to the search for sensory cues in order to better interpret its sensory environment and improve action efficacy
We take here benefit of the viewpoint-based variational encoding setup to propose a new quantification the mutual information shared across different sensory fields, locally estimated with a view-based Information Gain metric
View-based mutual information and information gain The sharing of information between two sensory fields x|u and x′|u′ should be quantified by their Mutual Information

Summary

INTRODUCTION

In complement with goal-oriented activity, animal motor control relates to the search for sensory cues in order to better interpret its sensory environment and improve action efficacy. The “maximum effect” principle encourages actions that are well discriminated, i.e., that have a visible effect on the sensors This is formally quantified by the “empowerment” information gain objective (Klyubin et al, 2005; Tishby and Polani, 2011), or by the more informal measures of surprise, like the “Salience” metric (Itti and Baldi, 2005), or the different “curiosity” metrics, like the ones proposed in Schmidhuber (1991), Oudeyer and Kaplan (2008), and Pathak et al (2017). An actual implementation of a sequential foveabased scene decoding setup is developed in section 3.2, allowing to quantitatively compare those different metrics, and propose new avenues toward parsimonious active vision through computationally-effective model-based prediction

PRINCIPLES AND METHODS

A Mixed Generative Model

Active Vision and Predictive Control

Accuracy-Based Action Selection

View-Based Information Gain Metrics

Fovea-Based Visual Scene Decoding

Metrics Comparison

CONCLUSION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in neurorobotics	Publication Date: Dec 14, 2018
Citations: 10	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Active Fovea-Based Vision Through Computationally-Effective Model-Based Prediction.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in neurorobotics

Lead the way for us

Similar Papers

Two new feature selection metrics for text classification
Erdal Kılıç ... Durmuş Özkan Şahin
Automatika | VOL. 60
Erdal Kılıç, et. al.Erdal Kılıç ... Durmuş Özkan Şahin
03 Apr 2019
Automatika | VOL. 60

Aggressive and effective feature selection using genetic programming
Felipe Viegas ... Daniel Madeira
-
Felipe Viegas, et. al.Felipe Viegas ... Daniel Madeira
01 Jun 2012
01 Jun 2012

Towards Exploring the Limitations of Active Learning: An Empirical Study
Yves Le Traon ... Xiaofei Xie
-
Yves Le Traon, et. al.Yves Le Traon ... Xiaofei Xie
01 Nov 2021
01 Nov 2021

Roles for globus pallidus externa revealed in a computational model of action selection in the basal ganglia
Kevin N Gurney ... Shreyas M Suryanarayana
Neural Networks | VOL. 109
Kevin N Gurney, et. al.Kevin N Gurney ... Shreyas M Suryanarayana
19 Oct 2018
Neural Networks | VOL. 109

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Active Fovea-Based Vision Through Computationally-Effective Model-Based Prediction.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in neurorobotics