Reconstructive network under contrastive graph rewards for video summarization

Guangli Wu,Shanshan Song,Xingyue Wang,Jing Zhang

doi:10.1016/j.eswa.2024.123860

Abstract

Video summarization aims to condense video content by extracting pivotal frames or shots. Most existing methods focus on maximizing the intersection between predicted summary and ground truth, overlooking whether users can infer the content of the original video from the summary. Additionally, these approaches heavily rely on annotated data, posing limitations. Therefore, we propose a reconstructive network under contrastive graph rewards for video summarization, comprising a summary generator and a video reconstructor. The summary generator employs graph contrastive learning to distill essential video information to generate summary. Meanwhile, the video reconstructor employs reinforcement learning within an unsupervised training framework to optimize the summary generator, addressing the shortage of annotated video data in summarization tasks. Leveraging reconstruction loss, our approach ensures that predicted summary encapsulate main video content and inter-shots dependencies. Notably, we innovatively devise a mutual information maximization reconstruction reward function to preserve shared information between the summary and the original video, facilitating users in comprehending the original video content. We conduct massive experiments on the TVSum and SumMe datasets, and our network achieved F1 scores of 58.8% and 48.0%, respectively. Experimental results validate the superiority of our method over both state-of-the-art unsupervised and many supervised video summarization techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reconstructive network under contrastive graph rewards for video summarization

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Similar Papers

Property-Constrained Dual Learning for Video Summarization.
Bin Zhao ... Xiaoqiang Lu
IEEE Transactions on Neural Networks and Learning Systems | VOL. 31
Bin Zhao, et. al.Bin Zhao ... Xiaoqiang Lu
05 Dec 2019
IEEE Transactions on Neural Networks and Learning Systems | VOL. 31

Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks
Li Yuan ... Jiashi Feng
IEEE Transactions on Multimedia | VOL. 22
Li Yuan, et. al.Li Yuan ... Jiashi Feng
24 Sep 2020
IEEE Transactions on Multimedia | VOL. 22

Hierarchical Recurrent Neural Network for Video Summarization
Bin Zhao ... Xuelong Li
-
Bin Zhao, et. al.Bin Zhao ... Xuelong Li
19 Oct 2017
19 Oct 2017

Learning user interest with improved triplet deep ranking and web-image priors for topic-related video summarization
Mengjuan Fei ... Weijie Mao
Expert Systems with Applications | VOL. 166
Mengjuan Fei, et. al.Mengjuan Fei ... Weijie Mao
29 Sep 2020
Expert Systems with Applications | VOL. 166

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reconstructive network under contrastive graph rewards for video summarization

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications