A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction.

Kao Zhang,Shan Liu,Zhenzhong Chen

doi:10.1109/tip.2020.3036749

Abstract

In this paper, a recurrent neural network is designed for video saliency prediction considering spatial-temporal features. In our work, video frames are routed through the static network for spatial features and the dynamic network for temporal features. For the spatial-temporal feature integration, a novel select and re-weight fusion model is proposed which can learn and adjust the fusion weights based on the spatial and temporal features in different scenes automatically. Finally, an attention-aware convolutional long short term memory (ConvLSTM) network is developed to predict salient regions based on the features extracted from consecutive frames and generate the ultimate saliency map for each video frame. The proposed method is compared with state-of-the-art saliency models on five public video saliency benchmark datasets. The experimental results demonstrate that our model can achieve advanced performance on video saliency prediction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing

Lead the way for us

Journal: IEEE Transactions on Image Processing	Publication Date: Nov 18, 2020
Citations: 102

Similar Papers

Video saliency prediction with optimized optical flow and gravity center bias
Zhe Wu ... Bo Wu
-
Zhe Wu, et. al.Zhe Wu ... Bo Wu
01 Jul 2016
01 Jul 2016

TwinLSTM: Two-channel LSTM Network for Online Action Detection
Yunfei Han ... Shan Tan
-
Yunfei Han, et. al.Yunfei Han ... Shan Tan
21 Aug 2022
21 Aug 2022

DeepVS: A Deep Learning Based Video Saliency Prediction Approach
Lai Jiang ... Mai Xu
-
Lai Jiang, et. al.Lai Jiang ... Mai Xu
01 Jan 2018
01 Jan 2018

Tropical Cyclone Track Prediction with an Encoding-to-Forecasting Deep Learning Model
Pingping Dong ... Yuping Zhang
Weather and Forecasting | VOL. 37
Pingping Dong, et. al.Pingping Dong ... Yuping Zhang
01 Jun 2022
Weather and Forecasting | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing