Toward Improving the Evaluation of Visual Attention Models: a Crowdsourcing Approach

Dario Zanca,Stefano Melacci,Marco Gori

doi:10.1109/ijcnn48605.2020.9207438

Abstract

Human visual attention is a complex phenomenon. A computational modeling of this phenomenon must take into account where people look in order to evaluate which are the salient locations (spatial distribution of the fixations), when they look in those locations to understand the temporal development of the exploration (temporal order of the fixations), and how they move from one location to another with respect to the dynamics of the scene and the mechanics of the eyes (dynamics). State-of-the-art models focus on learning saliency maps from human data, a process that only takes into account the spatial component of the phenomenon and ignore its temporal and dynamical counterparts. In this work we focus on the evaluation methodology of models of human visual attention. We underline the limits of the current metrics for saliency prediction and scanpath similarity, and we introduce a statistical measure for the evaluation of the dynamics of the simulated eye movements. While deep learning models achieve astonishing performance in saliency prediction, our analysis shows their limitations in capturing the dynamics of the process. We find that unsupervised gravitational models, despite of their simplicity, outperform all competitors. Finally, exploiting a crowd-sourcing platform, we present a study aimed at evaluating how strongly the scanpaths generated with the unsupervised gravitational models appear plausible to naive and expert human observers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Toward Improving the Evaluation of Visual Attention Models: a Crowdsourcing Approach

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Computational Models of Human Visual Attention and Their Implementations: A Survey
Akisato Kimura ... Ryo Yonetani
IEICE Transactions on Information and Systems | VOL. E96.D
Akisato Kimura, et. al.Akisato Kimura ... Ryo Yonetani
01 Jan 2013
IEICE Transactions on Information and Systems | VOL. E96.D

Human Visual Attention Model Based on Analysis of Magic for Smooth Human–Robot Interaction
Yusuke Tamura ... Takafumi Akashi
International Journal of Social Robotics | VOL. 8
Yusuke Tamura, et. al.Yusuke Tamura ... Takafumi Akashi
27 May 2016
International Journal of Social Robotics | VOL. 8

Visual Attention Driven by Auditory Cues
Jiro Nakajima ... Akihiro Sugimoto
-
Jiro Nakajima, et. al.Jiro Nakajima ... Akihiro Sugimoto
01 Jan 2015
01 Jan 2015

A Perception-based Color Correction Method for Multi-view Images
Feng Shao ... Gangyi Jiang
KSII Transactions on Internet and Information Systems | VOL. 5
Feng Shao, et. al.Feng Shao ... Gangyi Jiang
01 Jan 2010
KSII Transactions on Internet and Information Systems | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Toward Improving the Evaluation of Visual Attention Models: a Crowdsourcing Approach

Abstract

Talk to us

Similar Papers