Scanpath and saliency prediction on 360 degree images

Marc Assens,Xavier Giro-I-Nieto,Kevin Mcguinness,Noel E O’Connor

doi:10.1016/j.image.2018.06.006

Marc Assens, Xavier Giro-I-Nieto + Show 2 more

Open Access

https://doi.org/10.1016/j.image.2018.06.006

Copy DOI

Abstract

We introduce deep neural networks for scanpath and saliency prediction trained on 360-degree images. The scanpath prediction model called SaltiNet is based on a temporal-aware novel representation of saliency information named the saliency volume. The first part of the network consists of a model trained to generate saliency volumes, whose parameters are fit by back-propagation using a binary cross entropy (BCE) loss over downsampled versions of the saliency volumes. Sampling strategies over these volumes are used to generate scanpaths over the 360-degree images. Our experiments show the advantages of using saliency volumes, and how they can be used for related tasks. We also show how a similar architecture achieves state-of-the-art performance for the related task of saliency map prediction. Our source code and trained models available at https://github.com/massens/saliency-360salient-2017.

Full Text