Abstract

In the first seconds of observation of an image, several visual attention processes are involved in the identification of the visual targets that pop-out from the scene to our eyes. Saliency is the quality that makes certain regions of an image stand out from the visual field and grab our attention. Saliency detection models, inspired by visual cortex mechanisms, employ both colour and luminance features. Furthermore, both locations of pixels and presence of objects influence the Visual Attention processes. In this paper, we propose a new saliency method based on the combination of the distribution of interest points in the image with multiscale analysis, a centre bias module and a machine learning approach. We use perceptually uniform colour spaces to study how colour impacts on the extraction of saliency. To investigate eye-movements and assess the performances of saliency methods over object-based images, we conduct experimental sessions on our dataset ETTO (Eye Tracking Through Objects). Experiments show our approach to be accurate in the detection of saliency concerning state-of-the-art methods and accessible eye-movement datasets. The performances over object-based images are excellent and consistent on generic pictures. Besides, our work reveals interesting findings on some relationships between saliency and perceptually uniform colour spaces.

Highlights

  • The human visual process starts outside the brain with the projection of the light onto the retina

  • We studied the impact of perceptually uniform colour spaces such as CIE lightness values (L*)a*b* and CIE L*u*v* in the extraction of saliency maps

  • As it is observed in tables 1, 2, 3, 4 and 5 we compared the performance of our proposed method against several state-of-the-art methods, which are based on different approaches and principles

Read more

Summary

Introduction

The human visual process starts outside the brain with the projection of the light onto the retina. The lack of storage capacity of our brain concerning the huge amount of information going from our eyes towards the cerebral cortex to be processed at much higher levels. Due to the limits of our brain, we cannot simultaneously perform complex analysis on all the input visual information [1]. The detection of the most critical visual subset occurs as one of the most important tasks of the Human Visual System (HVS). When a person performs any visual task (watching TV, driving a car) the eyes flick rapidly from place to place to inspect the visual scene.

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call