SeqRank: Sequential Ranking of Salient Objects

Huankang Guan,Rynson W.H Lau

doi:10.1609/aaai.v38i3.27964

Abstract

Salient Object Ranking (SOR) is the process of predicting the order of an observer's attention to objects when viewing a complex scene. Existing SOR methods primarily focus on ranking various scene objects simultaneously by exploring their spatial and semantic properties. However, their solutions of simultaneously ranking all salient objects do not align with human viewing behavior, and may result in incorrect attention shift predictions. We observe that humans view a scene through a sequential and continuous process involving a cycle of foveating to objects of interest with our foveal vision while using peripheral vision to prepare for the next fixation location. For instance, when we see a flying kite, our foveal vision captures the kite itself, while our peripheral vision can help us locate the person controlling it such that we can smoothly divert our attention to it next. By repeatedly carrying out this cycle, we can gain a thorough understanding of the entire scene. Based on this observation, we propose to model the dynamic interplay between foveal and peripheral vision to predict human attention shifts sequentially. To this end, we propose a novel SOR model, SeqRank, which reproduces foveal vision to extract high-acuity visual features for accurate salient instance segmentation while also modeling peripheral vision to select the object that is likely to grab the viewer’s attention next. By incorporating both types of vision, our model can mimic human viewing behavior better and provide a more faithful ranking among various scene objects. Most notably, our model improves the SA-SOR/MAE scores by +6.1%/-13.0% on IRSR, compared with the state-of-the-art. Extensive experiments show the superior performance of our model on the SOR benchmarks. Code is available at https://github.com/guanhuankang/SeqRank.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SeqRank: Sequential Ranking of Salient Objects

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Position jitter and undersampling in pattern perception
Dennis M Levi ... Vineeta Sharma
Vision Research | VOL. 39
Dennis M Levi, et. al.Dennis M Levi ... Vineeta Sharma
09 Dec 1998
Vision Research | VOL. 39

Visual search in naturalistic scenes from foveal to peripheral vision: A comparison between dynamic and static displays.
Antje Nuthmann ... Teresa Canas-Bajo
Journal of Vision | VOL. 22
Antje Nuthmann, et. al.Antje Nuthmann ... Teresa Canas-Bajo
19 Jan 2022
Journal of Vision | VOL. 22

Binocular summation for grating detection and resolution in foveal and peripheral vision
Margarita B Zlatkova ... Fergal A Ennis
Vision Research | VOL. 41
Margarita B Zlatkova, et. al.Margarita B Zlatkova ... Fergal A Ennis
01 Nov 2001
Vision Research | VOL. 41

How do the regions of the visual field contribute to object search in real-world scenes? Evidence from eye movements.
Antje Nuthmann
Journal of Experimental Psychology: Human Perception and Performance | VOL. 40
Antje NuthmannAntje Nuthmann
01 Feb 2014
Journal of Experimental Psychology: Human Perception and Performance | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SeqRank: Sequential Ranking of Salient Objects

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence