Contextual working memory for trans-saccadic object recognition using reinforcement learning and informative local descriptors

Lucas Paletta ,Christin Seifert ,Gerald Fritz

doi:10.1068/v050402

Abstract

Previous research on behavioural modelling of saccade-driven image interpretation (Henderson, 1982 Psychological Science 8 51 ^ 55) has emphasised the sampling of informative parts under visual attention to guide visual perception. We propose two major innovations to trans-saccadic object recognition: first, we model contextual tuning at the early visual processing stage. Salience in pre-processing is determined from descriptors in terms of local gradient histogram patterns - SIFT features (Lowe, 2004 International Journal of Computer Vision 60 91 ^ 110). SIFT features are scale-, rotation-, and to a high degree illumination-tolerant, in a substantial extension to previously used edge features (Rybak et al, 1998 Vision Research 38 2387 ^ 2400) or appearance patterns (Paletta et al, 2004 Perception 33 Supplement, 126). Descriptors that are informative with respect to an information theoretic framework (Fritz et al, 2004, in Proceedings of the International Conference on Pattern Recognition volume 2, pp 15 ^ 18) are selected and weighted according to contextual salience. Second, we develop a behavioural strategy for saccade-driven information access, operating on contextually selected features and attention shifts, being performed in terms of a partially observable Markovian decision process and represented by a short-term working memory generating discriminative perception ^ action sequences. It is developed under exploration and reinforcement feedback using Q-learning, a machine-learning methodology representing operant conditioning. Saccadic targets are selected for attention only in a local neighbourhood of a currently focused descriptor. The learned strategy proposes next actions that support expected maximisation of reward, eg minimisation of entropy in posterior object discrimination. We demonstrate the performance of using the sensory ^motor context of trans-saccadic outdoor object recognition, efficiently identifying building facades from different viewpoints, distances, and varying illumination conditions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Contextual working memory for trans-saccadic object recognition using reinforcement learning and informative local descriptors

Abstract

Talk to us

Similar Papers

More From: Perception

Lead the way for us

Similar Papers

Binocular Coordination of the Eyes during Reading
Simon P Liversedge ... Eugene Mcsorley
Current Biology | VOL. 16
Simon P Liversedge, et. al.Simon P Liversedge ... Eugene Mcsorley
01 Sep 2006
Current Biology | VOL. 16

Robust training attenuates TBI-induced deficits in reference and working memory on the radial 8-arm maze.
Veronica Sebastian ... Aissatou Diallo
Frontiers in Behavioral Neuroscience | VOL. 7
Veronica Sebastian, et. al.Veronica Sebastian ... Aissatou Diallo
01 Jan 2013
Frontiers in Behavioral Neuroscience | VOL. 7

利用 RGB-D 影像串流建構 3D 場景貼合圖之研究

-

01 Jan 2013
01 Jan 2013

The neural correlates of impaired collision avoidance in hemianopic patients
Eleni Papageorgiou ... Horst Wiethölter
Acta Ophthalmologica | VOL. 90
Eleni Papageorgiou, et. al.Eleni Papageorgiou ... Horst Wiethölter
16 Dec 2011
Acta Ophthalmologica | VOL. 90

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Contextual working memory for trans-saccadic object recognition using reinforcement learning and informative local descriptors

Abstract

Talk to us

Similar Papers

More From: Perception