Abstract

The human attention mechanism can be understood and simulated by closely associating the saliency prediction task to neuroscience and psychology. Furthermore, saliency prediction is widely used in computer vision and interdisciplinary subjects. In recent years, with the rapid development of deep learning, deep models have made amazing achievements in saliency prediction. Deep learning models can automatically learn features, thus solving many drawbacks of the classic models, such as handcrafted features and task settings, among others. Nevertheless, the deep models still have some limitations, for example in tasks involving multi-modality and semantic understanding. This study focuses on summarizing the relevant achievements in the field of saliency prediction, including the early neurological and psychological mechanisms and the guiding role of classic models, followed by the development process and data comparison of classic and deep saliency prediction models. This study also discusses the relationship between the model and human vision, as well as the factors that cause the semantic gaps, the influences of attention in cognitive research, the limitations of the saliency model, and the emerging applications, to provide new saliency predictions for follow-up work and the necessary help and advice.

Highlights

  • 80% of the information that humans receive every day comes from vision

  • The development of visual saliency prediction tasks has produced numerous methods, and all of them have played an important role in various research directions

  • Performance can be significantly improved with respect to the classic model that uses handcrafted features

Read more

Summary

Introduction

80% of the information that humans receive every day comes from vision. Visual saliency prediction is a mechanism that imitates human visual attention, including relevant knowledge such as neurobiological, psychological, and computer vision. The significance of the research on visual saliency detection lies in two aspects: first, as a verifiable prediction, it can be used as a model-based hypothesis test to understand human attention mechanisms at the behavioral and neural levels. The current research on saliency detection mainly involves two types of tasks, namely, saliency prediction (or eye fixation prediction) and Salient Object Detection (SOD). The limitations of the current deep learning model were analyzed, the possible directions for improvement were proposed, new application areas based on the latest progress of deep learning were discussed, and the contribution and significance of saliency prediction with respect to future development trends were presented

Psychological and Neurobiological Basis of Visual Saliency
Classic Visual Saliency Models
Bottom-Up Visual Saliency Models
Top-Down Visual Saliency Models
Deep Visual Saliency Models
Static Models
Dynamic Models
Visual Saliency Prediction Datasets
Static Datasets
Dynamic Datasets
Evaluation Measures for Visual Saliency Prediction
Performance of Visual Saliency Prediction Models
Commonalities and Limitations of the Deep Saliency Models
Multi-modal approaches
Visualization
Understand high-level semantics
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call