Understanding Eye Movements: Psychophysics and a Model of Primary Visual Cortex

David Berga

doi:10.5565/rev/elcvia.1193

Abstract

Humans move their eyes in order to learn visual representations of the world. These eye movements depend on distinct factors, either by the scene that we perceive or by our own decisions. To select what is relevant to attend is part of our survival mechanisms and the way we build reality, as we constantly react both consciously and unconsciously to all the stimuli that is projected into our eyes. In this thesis we try to explain (1) how we move our eyes, (2) how to build machines that understand visual information and deploy eye movements, and (3) how to make these machines understand tasks in order to decide for eye movements.(1) We provided the analysis of eye movement behavior elicited by low-level feature distinctiveness with a dataset of 230 synthetically-generated image patterns. A total of 15 types of stimuli has been generated (e.g. orientation, brightness, color, size, etc.), with 7 feature contrasts for each feature category. Eye-tracking data was collected from 34 participants during the viewing of the dataset, using Free-Viewing and Visual Search task instructions. Results showed that saliency is predominantly and distinctively influenced by: 1. feature type, 2. feature contrast, 3. temporality of fixations, 4. task difficulty and 5. center bias. From such dataset (SID4VAM), we have computed a benchmark of saliency models by testing performance using psychophysical patterns. Model performance has been evaluated considering model inspiration and consistency with human psychophysics. Our study reveals that state-of-the-art Deep Learning saliency models do not perform well with synthetic pattern images, instead, models with Spectral/Fourier inspiration outperform others in saliency metrics and are more consistent with human psychophysical experimentation.(2) Computations in the primary visual cortex (area V1 or striate cortex) have long been hypothesized to be responsible, among several visual processing mechanisms, of bottom-up visual attention (also named saliency). In order to validate this hypothesis, images from eye tracking datasets have been processed with a biologically plausible model of V1 (named Neurodynamic Saliency Wavelet Model or NSWAM). Following Li's neurodynamic model, we define V1's lateral connections with a network of firing rate neurons, sensitive to visual features such as brightness, color, orientation and scale. Early subcortical processes (i.e. retinal and thalamic) are functionally simulated. The resulting saliency maps are generated from the model output, representing the neuronal activity of V1 projections towards brain areas involved in eye movement control. We want to pinpoint that our unified computational architecture is able to reproduce several visual processes (i.e. brightness, chromatic induction and visual discomfort) without applying any type of training or optimization and keeping the same parametrization. The model has been extended (NSWAM-CM) with an implementation of the cortical magnification function to define the retinotopical projections towards V1, processing neuronal activity for each distinct view during scene observation. Novel computational definitions of top-down inhibition (in terms of inhibition of return and selection mechanisms), are also proposed to predict attention in Free-Viewing and Visual Search conditions. Results show that our model outpeforms other biologically-inpired models of saliency prediction as well as to predict visual saccade sequences, specifically for nature and synthetic images. We also show how temporal and spatial characteristics of inhibition of return can improve prediction of saccades, as well as how distinct search strategies (in terms of feature-selective or category-specific inhibition) predict attention at distinct image contexts.(3) Although previous scanpath models have been able to efficiently predict saccades during Free-Viewing, it is well known that stimulus and task instructions can strongly affect eye movement patterns. In particular, task priming has been shown to be crucial to the deployment of eye movements, involving interactions between brain areas related to goal-directed behavior, working and long-term memory in combination with stimulus-driven eye movement neuronal correlates. In our latest study we proposed an extension of the Selective Tuning Attentive Reference Fixation Controller Model based on task demands (STAR-FCT), describing novel computational definitions of Long-Term Memory, Visual Task Executive and Task Working Memory. With these modules we are able to use textual instructions in order to guide the model to attend to specific categories of objects and/or places in the scene. We have designed our memory model by processing a visual hierarchy of low- and high-level features. The relationship between the executive task instructions and the memory representations has been specified using a tree of semantic similarities between the learned features and the object category labels. Results reveal that by using this model, the resulting object localization maps and predicted saccades have a higher probability to fall inside the salient regions depending on the distinct task instructions compared to saliency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ELCVIA Electronic Letters on Computer Vision and Image Analysis	Publication Date: Feb 7, 2020
Citations: 2	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

Understanding Eye Movements: Psychophysics and a Model of Primary Visual Cortex

Abstract

Talk to us

Similar Papers

More From: ELCVIA Electronic Letters on Computer Vision and Image Analysis

Lead the way for us

Similar Papers

A Neurodynamic Model of Saliency Prediction in V1.
David Berga ... Xavier Otazu
Neural computation | VOL. 34
David Berga, et. al.David Berga ... Xavier Otazu
14 Jan 2022
Neural computation | VOL. 34

SID4VAM: A Benchmark Dataset With Synthetic Images for Visual Attention Modeling
David Berga ... Xose M Pardo
-
David Berga, et. al.David Berga ... Xose M Pardo
01 Oct 2019
01 Oct 2019

Neuroanatomical Structures in Extraocular Muscles and Their Potential Implication in the Management of Strabismus
Jan Richard Bruenech
Advances in Ophthalmology and Optometry | VOL. 6
Jan Richard BruenechJan Richard Bruenech
04 Jun 2021
Advances in Ophthalmology and Optometry | VOL. 6

Saccade, search and orient – the neural control of saccadic eye movements
Douglas P Munoz ... Brian C Coe
European Journal of Neuroscience | VOL. 33
Douglas P Munoz, et. al.Douglas P Munoz ... Brian C Coe
01 Jun 2011
European Journal of Neuroscience | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Understanding Eye Movements: Psychophysics and a Model of Primary Visual Cortex

Abstract

Talk to us

Similar Papers

More From: ELCVIA Electronic Letters on Computer Vision and Image Analysis