An Object-Based Bayesian Framework for Top-Down Visual Attention

Ali Borji,Dicky Sihite,Laurent Itti

doi:10.1609/aaai.v26i1.8334

Abstract

We introduce a new task-independent framework to model top-down overt visual attention based on graph-ical models for probabilistic inference and reasoning. We describe a Dynamic Bayesian Network (DBN) that infers probability distributions over attended objects and spatial locations directly from observed data. Probabilistic inference in our model is performed over object-related functions which are fed from manual annotations of objects in video scenes or by state-of-the-art object detection models. Evaluating over ∼3 hours (appx. 315,000 eye fixations and 12,600 saccades) of observers playing 3 video games (time-scheduling, driving, and flight combat), we show that our approach is significantly more predictive of eye fixations compared to: 1) simpler classifier-based models also developed here that map a signature of a scene (multi-modal information from gist, bottom-up saliency, physical actions, and events) to eye positions, 2) 14 state-of-the-art bottom-up saliency models, and 3) brute-force algorithms such as mean eye position. Our results show that the proposed model is more effective in employing and reasoning over spatio-temporal visual data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Object-Based Bayesian Framework for Top-Down Visual Attention

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Sep 20, 2021
Citations: 5

Similar Papers

What/Where to Look Next? Modeling Top-Down Visual Attention in Complex Interactive Environments
Ali Borji ... Dicky N Sihite
IEEE Transactions on Systems, Man, and Cybernetics: Systems | VOL. 44
Ali Borji, et. al.Ali Borji ... Dicky N Sihite
01 May 2014
IEEE Transactions on Systems, Man, and Cybernetics: Systems | VOL. 44

Modeling the influence of action on spatial attention in visual interactive environments
Ali Borji ... Laurent Itti
-
Ali Borji, et. al.Ali Borji ... Laurent Itti
01 May 2012
01 May 2012

Velocity storage mechanism in zebrafish larvae
Chien‐Cheng Chen ... Itsaso Olasagasti
The Journal of Physiology | VOL. 592
Chien‐Cheng Chen, et. al.Chien‐Cheng Chen ... Itsaso Olasagasti
05 Dec 2013
The Journal of Physiology | VOL. 592

Contribution of color in saliency model for videos
Shahrbanoo Hamel ... Dominique Houzet
Signal, Image and Video Processing | VOL. 10
Shahrbanoo Hamel, et. al.Shahrbanoo Hamel ... Dominique Houzet
24 Mar 2015
Signal, Image and Video Processing | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Object-Based Bayesian Framework for Top-Down Visual Attention

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence