Learning multiscale hierarchical attention for video summarization

Wencheng Zhu,Jiwen Lu,Yucheng Han,Jie Zhou

doi:10.1016/j.patcog.2021.108312

Abstract

In this paper, we propose a multiscale hierarchical attention approach for supervised video summarization. Different from most existing supervised methods which employ bidirectional long short-term memory networks, our method exploits the underlying hierarchical structure of video sequences and learns both the short-range and long-range temporal representations via a intra-block and a inter-block attention. Specifically, we first separate each video sequence into blocks of equal length and employ the intra-block and inter-block attention to learn local and global information, respectively. Then, we integrate the frame-level, block-level, and video-level representations for the frame-level importance score prediction. Next, we conduct shot segmentation and compute shot-level importance scores. Finally, we perform key shot selection to produce video summaries. Moreover, we extend our method into a two-stream framework, where appearance and motion information is leveraged. Experimental results on the SumMe and TVSum datasets validate the effectiveness of our method against state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning multiscale hierarchical attention for video summarization

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Sep 20, 2021
Citations: 37

Similar Papers

Local and global information affect cooperation in networked Prisoner’s dilemma games
M. Zhang ... K. Alfaro-Bittner
Chaos, Solitons & Fractals | VOL. 150
M. Zhang, et. al.M. Zhang ... K. Alfaro-Bittner
01 Sep 2021
Chaos, Solitons & Fractals | VOL. 150

Multiple Uses of Global and Local Features for Person Re-identification
Dawei Niu ... Meibin Qi
-
Dawei Niu, et. al.Dawei Niu ... Meibin Qi
28 May 2020
28 May 2020

Unsupervised anomaly detection in time series exploiting local and global information
Emanuele La Malfa ... Gabriele La Malfa
-
Emanuele La Malfa, et. al.Emanuele La Malfa ... Gabriele La Malfa
01 Jan 2019
01 Jan 2019

Active Contour Model Integrating Global and Local Information
Teng Wu ... Jiaxin Wang
-
Teng Wu, et. al.Teng Wu ... Jiaxin Wang
12 Mar 2021
12 Mar 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning multiscale hierarchical attention for video summarization

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition