Visual Attention Prediction for Stereoscopic Video by Multi-Module Fully Convolutional Network.

Yuming Fang,Jianjun Lei,Hanqin Huang,Chi Zhang

doi:10.1109/tip.2019.2916766

Abstract

Visual attention is an important mechanism in the human visual system (HVS) and there have been numerous saliency detection algorithms designed for 2D images/video recently. However, the research for fixation detection of stereoscopic video is still limited and challenging due to the complicated depth and motion information. In this paper, we design a novel multi-module fully convolutional network (MM-FCN) for fixation detection of stereoscopic video. Specifically, we design a fully convolutional network for spatial saliency prediction (S-FCN), where the initial spatial saliency map of stereoscopic video is learned by image database of object detection. Furthermore, the fully convolutional network for temporal saliency prediction (T-FCN) is constructed by combining saliency results from S-FCN and motion information from video frames. Finally, the fully convolutional network for depth fixation prediction (D-FCN) is designed to compute the final fixation map of stereoscopic video by learning depth features with spatiotemporal features from T-FCN. The experimental results show that the proposed MM-FCN can predict fixation results for stereoscopic video more effectively and efficiently than other related fixation prediction methods.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Visual Attention Prediction for Stereoscopic Video by Multi-Module Fully Convolutional Network.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

Lead the way for us

Journal: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society	Publication Date: May 20, 2019
Citations: 87

Similar Papers

Superpixel-Based Spatiotemporal Saliency Detection
Zhi Liu ... Olivier Le Meur
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 24
Zhi Liu, et. al.Zhi Liu ... Olivier Le Meur
19 May 2014
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 24

Superpixel-Based Stereoscopic Video Saliency Detection Using Support Vector Regression Learning
Ting-Yu Chou ... Jin-Jang Leou
-
Ting-Yu Chou, et. al.Ting-Yu Chou ... Jin-Jang Leou
01 Jan 2020
01 Jan 2020

DeepGaze IIE: Calibrated prediction in and out-of-domain for state-of-the-art saliency modeling
Akis Linardos ... Ori Press
-
Akis Linardos, et. al.Akis Linardos ... Ori Press
01 Oct 2021
01 Oct 2021

Visual Attention Model Aided Non-Uniform Asymmetric Coding of Stereoscopic Video
Erhan Ekmekcioglu ... Peter Tho Pesch
IEEE Journal of Selected Topics in Signal Processing | VOL. 8
Erhan Ekmekcioglu, et. al.Erhan Ekmekcioglu ... Peter Tho Pesch
01 Jun 2014
IEEE Journal of Selected Topics in Signal Processing | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Visual Attention Prediction for Stereoscopic Video by Multi-Module Fully Convolutional Network.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society