Video Summarization With Attention-Based Encoder–Decoder Networks

Zhong Ji,Xuelong Li,Kailin Xiong,Yanwei Pang

doi:10.1109/tcsvt.2019.2904996

Abstract

This paper addresses the problem of supervised video summarization by formulating it as a sequence-to-sequence learning problem, where the input is a sequence of original video frames, and the output is a keyshot sequence. Our key idea is to learn a deep summarization network with attention mechanism to mimic the way of selecting the keyshots of human. To this end, we propose a novel video summarization framework named attentive encoder-decoder networks for video summarization (AVS), in which the encoder uses a bidirectional long short-term memory (BiLSTM) to encode the contextual information among the input video frames. As for the decoder, two attention-based LSTM networks are explored by using additive and multiplicative objective functions, respectively. Extensive experiments are conducted on two video summarization benchmark datasets, i.e., SumMe and TVSum. The results demonstrate the superiority of the proposed AVS-based approaches against the state-of-the-art approaches, with remarkable improvements on both datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Video Summarization With Attention-Based Encoder–Decoder Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Jul 30, 2019
Citations: 299

Similar Papers

Deep hierarchical LSTM networks with attention for video summarization
Jingxu Lin ... Ahmed Fares
Computers & Electrical Engineering | VOL. 97
Jingxu Lin, et. al.Jingxu Lin ... Ahmed Fares
08 Dec 2021
Computers & Electrical Engineering | VOL. 97

Video Salient Object Detection Via Multi-level Spatiotemporal Bidirectional Network Using Multi-scale Transfer Learning
Gaurav Sharma ... Krishan Berwal
IETE Journal of Research | VOL. ahead-of-print
Gaurav Sharma, et. al.Gaurav Sharma ... Krishan Berwal
06 Aug 2024
IETE Journal of Research | VOL. ahead-of-print

Domain independent redundancy elimination based on flow vectors for static video summarization
Jesna Mohan ... Madhu S Nair
Heliyon | VOL. 5
Jesna Mohan, et. al.Jesna Mohan ... Madhu S Nair
01 Oct 2019
Heliyon | VOL. 5

Activity Recognition Based on FR-CNN and Attention-Based LSTM Network
Tan-Hsu Tan ... Ching-Jung Huang
-
Tan-Hsu Tan, et. al.Tan-Hsu Tan ... Ching-Jung Huang
07 Oct 2021
07 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Video Summarization With Attention-Based Encoder–Decoder Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology