A Gated Fusion Network for Dynamic Saliency Prediction

Aysun Kocak,Aykut Erdem,Erkut Erdem

doi:10.1109/tcds.2021.3094974

Aysun Kocak, Aykut Erdem + Show 1 more

Open Access

https://doi.org/10.1109/tcds.2021.3094974

Copy DOI

Abstract

Predicting saliency in videos is a challenging problem due to complex modeling of interactions between spatial and temporal information, especially when ever-changing, dynamic nature of videos is considered. Recently, researchers have proposed large-scale data sets and models that take advantage of deep learning as a way to understand what is important for video saliency. These approaches, however, learn to combine spatial and temporal features in a static manner and do not adapt themselves much to the changes in the video content. In this article, we introduce the gated fusion network for dynamic saliency (GFSalNet), the first deep saliency model capable of making predictions in a dynamic way via the gated fusion mechanism. Moreover, our model also exploits spatial and channelwise attention within a multiscale architecture that further allows for highly accurate predictions. We evaluate the proposed approach on a number of data sets, and our experimental analysis demonstrates that it outperforms or is highly competitive with the state of the art. Importantly, we show that it has a good generalization ability, and moreover, exploits temporal information more effectively via its adaptive fusion scheme.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Gated Fusion Network for Dynamic Saliency Prediction

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cognitive and Developmental Systems

Lead the way for us

Journal: IEEE Transactions on Cognitive and Developmental Systems	Publication Date: Sep 1, 2022
Citations: 5

Similar Papers

Video saliency prediction with optimized optical flow and gravity center bias
Zhe Wu ... Bo Wu
-
Zhe Wu, et. al.Zhe Wu ... Bo Wu
01 Jul 2016
01 Jul 2016

Spatial-temporal interaction learning based two-stream network for action recognition
Tianyu Liu ... Ping Jiang
Information Sciences | VOL. 606
Tianyu Liu, et. al.Tianyu Liu ... Ping Jiang
28 May 2022
Information Sciences | VOL. 606

DS-Net: Dynamic spatiotemporal network for video salient object detection
Jing Liu ... Yuting Su
Digital Signal Processing | VOL. 130
Jing Liu, et. al.Jing Liu ... Yuting Su
19 Aug 2022
Digital Signal Processing | VOL. 130

Patch-Wise Spatial-Temporal Quality Enhancement for HEVC Compressed Video.
Qing Ding ... Liquan Shen
IEEE Transactions on Image Processing | VOL. 30
Qing Ding, et. al.Qing Ding ... Liquan Shen
01 Jan 2020
IEEE Transactions on Image Processing | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Gated Fusion Network for Dynamic Saliency Prediction

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cognitive and Developmental Systems