Saliency Model Research Articles

With the rapid development of 5G technologies, it is possible to perform high-speed transmission of virtual reality (VR) visual contents. As one of techniques related to VR visual contents, omnidirectional image/video quality assessment has been becoming one of important issues. In this paper, based on latitude characteristics of omnidirectional image and binocular perception of human visual system, we propose a new blind stereoscopic omnidirectional image quality assessment (BSOIQA) method. Firstly, with the latitude characteristics of omnidirectional image, an improved omnidirectional image saliency model is established to estimate the visual behavior of users. Secondly, to describe the perceptual characteristics of the left and right views of stereoscopic omnidirectional image (SOI), Weber local descriptor is used to obtain differential excitation image and orientation image. The differential excitation image is combined with saliency map to obtain the statistical features of quantized orientation image, and multi-scale statistical features are extracted from the left and right views of SOI. Thirdly, tensor decomposition is used to fuse the left and right views of SOI to extract binocular perception features. Dual-tree dual-density complex wavelet transform, image information entropy and singular value decomposition are used to extract binocular fusion features, and the latitude characteristics of SOI are taken into account in image blocking. The gradient map of the absolute difference image of the left and right views of SOI is calculated, and its histogram statistical features are extracted to represent binocular rivalry. In addition, the statistical characteristics of the binocular product image are used to represent the local structure information of binocular perception. Finally, the feature vector of SOI is created with feature screening and used to predict the objective quality of SOI, and an SOI subjective assessment dataset (NBU-SOID) is also established to evaluate the proposed BSOIQA method. Experimental results on NBU-SOID and other databases show that the proposed BSOIQA method is in better consistence with human visual perception, compared with the other state-of-the-art methods.

Humans move their eyes in order to learn visual representations of the world. These eye movements depend on distinct factors, either by the scene that we perceive or by our own decisions. To select what is relevant to attend is part of our survival mechanisms and the way we build reality, as we constantly react both consciously and unconsciously to all the stimuli that is projected into our eyes. In this thesis we try to explain (1) how we move our eyes, (2) how to build machines that understand visual information and deploy eye movements, and (3) how to make these machines understand tasks in order to decide for eye movements.(1) We provided the analysis of eye movement behavior elicited by low-level feature distinctiveness with a dataset of 230 synthetically-generated image patterns. A total of 15 types of stimuli has been generated (e.g. orientation, brightness, color, size, etc.), with 7 feature contrasts for each feature category. Eye-tracking data was collected from 34 participants during the viewing of the dataset, using Free-Viewing and Visual Search task instructions. Results showed that saliency is predominantly and distinctively influenced by: 1. feature type, 2. feature contrast, 3. temporality of fixations, 4. task difficulty and 5. center bias. From such dataset (SID4VAM), we have computed a benchmark of saliency models by testing performance using psychophysical patterns. Model performance has been evaluated considering model inspiration and consistency with human psychophysics. Our study reveals that state-of-the-art Deep Learning saliency models do not perform well with synthetic pattern images, instead, models with Spectral/Fourier inspiration outperform others in saliency metrics and are more consistent with human psychophysical experimentation.(2) Computations in the primary visual cortex (area V1 or striate cortex) have long been hypothesized to be responsible, among several visual processing mechanisms, of bottom-up visual attention (also named saliency). In order to validate this hypothesis, images from eye tracking datasets have been processed with a biologically plausible model of V1 (named Neurodynamic Saliency Wavelet Model or NSWAM). Following Li's neurodynamic model, we define V1's lateral connections with a network of firing rate neurons, sensitive to visual features such as brightness, color, orientation and scale. Early subcortical processes (i.e. retinal and thalamic) are functionally simulated. The resulting saliency maps are generated from the model output, representing the neuronal activity of V1 projections towards brain areas involved in eye movement control. We want to pinpoint that our unified computational architecture is able to reproduce several visual processes (i.e. brightness, chromatic induction and visual discomfort) without applying any type of training or optimization and keeping the same parametrization. The model has been extended (NSWAM-CM) with an implementation of the cortical magnification function to define the retinotopical projections towards V1, processing neuronal activity for each distinct view during scene observation. Novel computational definitions of top-down inhibition (in terms of inhibition of return and selection mechanisms), are also proposed to predict attention in Free-Viewing and Visual Search conditions. Results show that our model outpeforms other biologically-inpired models of saliency prediction as well as to predict visual saccade sequences, specifically for nature and synthetic images. We also show how temporal and spatial characteristics of inhibition of return can improve prediction of saccades, as well as how distinct search strategies (in terms of feature-selective or category-specific inhibition) predict attention at distinct image contexts.(3) Although previous scanpath models have been able to efficiently predict saccades during Free-Viewing, it is well known that stimulus and task instructions can strongly affect eye movement patterns. In particular, task priming has been shown to be crucial to the deployment of eye movements, involving interactions between brain areas related to goal-directed behavior, working and long-term memory in combination with stimulus-driven eye movement neuronal correlates. In our latest study we proposed an extension of the Selective Tuning Attentive Reference Fixation Controller Model based on task demands (STAR-FCT), describing novel computational definitions of Long-Term Memory, Visual Task Executive and Task Working Memory. With these modules we are able to use textual instructions in order to guide the model to attend to specific categories of objects and/or places in the scene. We have designed our memory model by processing a visual hierarchy of low- and high-level features. The relationship between the executive task instructions and the memory representations has been specified using a tree of semantic similarities between the learned features and the object category labels. Results reveal that by using this model, the resulting object localization maps and predicted saccades have a higher probability to fall inside the salient regions depending on the distinct task instructions compared to saliency.

Saliency Model Research Articles

Related Topics

Articles published on Saliency Model

Priorização de Stakeholders em Empresas Familiares: Um Ensaio Teórico

Saliency and priority modulation in a pop-out paradigm: Pupil size and microsaccades

Ship Target Automatic Detection Based on Hypercomplex Flourier Transform Saliency Model in High Spatial Resolution Remote-Sensing Images.

Mobile Interface Attentional Priority Model

Explore double-opponency and skin color for saliency detection

Viewport-Dependent Saliency Prediction in 360° Video

A brief survey of visual saliency detection

Towards Making Videos Accessible for Low Vision Screen Magnifier Users.

Latitude and binocular perception based blind stereoscopic omnidirectional image quality assessment for VR system

CMP-based saliency model for stereoscopic omnidirectional images

Exploiting Surroundedness and Superpixel cues for salient region detection

Robust image hashing with visual attention model and invariant moments

Big Decisions in Small Business Ecosystems: Stakeholder Priority

Detecting visually salient scene areas and deriving their relative spatial relations from continuous street-view panoramas

Rule-based aggregation driven by similar images for visual saliency detection

Understanding Eye Movements: Psychophysics and a Model of Primary Visual Cortex

Research on Generating an Indoor Landmark Salience Model for Self-Location and Spatial Orientation from Eye-Tracking Data

Attention-guided RGBD saliency detection using appearance information

A novel image recognition approach using multiscale saliency model and GoogLeNet

Saliency detection using adversarial learning networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Saliency Model Research Articles

Related Topics

Articles published on Saliency Model

Priorização de Stakeholders em Empresas Familiares: Um Ensaio Teórico

Saliency and priority modulation in a pop-out paradigm: Pupil size and microsaccades

Ship Target Automatic Detection Based on Hypercomplex Flourier Transform Saliency Model in High Spatial Resolution Remote-Sensing Images.

Mobile Interface Attentional Priority Model

Explore double-opponency and skin color for saliency detection

Viewport-Dependent Saliency Prediction in 360° Video

A brief survey of visual saliency detection

Towards Making Videos Accessible for Low Vision Screen Magnifier Users.

Latitude and binocular perception based blind stereoscopic omnidirectional image quality assessment for VR system

CMP-based saliency model for stereoscopic omnidirectional images

Exploiting Surroundedness and Superpixel cues for salient region detection

Robust image hashing with visual attention model and invariant moments

Big Decisions in Small Business Ecosystems: Stakeholder Priority

Detecting visually salient scene areas and deriving their relative spatial relations from continuous street-view panoramas

Rule-based aggregation driven by similar images for visual saliency detection

Understanding Eye Movements: Psychophysics and a Model of Primary Visual Cortex

Research on Generating an Indoor Landmark Salience Model for Self-Location and Spatial Orientation from Eye-Tracking Data

Attention-guided RGBD saliency detection using appearance information

A novel image recognition approach using multiscale saliency model and GoogLeNet

Saliency detection using adversarial learning networks