Causal Video Summarizer for Video Exploration

Jia-Hong Huang,Marcel Worring,Chao-Han Huck Yang,Andrew Brown,Pin-Yu Chen

doi:10.1109/icme52920.2022.9859948

Abstract

Recently, video summarization has been proposed as a method to help video exploration. However, traditional video summarization models only generate a fixed video summary which is usually independent of user-specific needs and hence limits the effectiveness of video exploration. Multi-modal video summarization is one of the approaches utilized to address this issue. Multi-modal video summarization has a video input and a text-based query input. Hence, effective modeling of the interaction between a video input and text-based query is essential to multi-modal video summarization. In this work, a new causality-based method named Causal Video Summarizer (CVS) is proposed to effectively capture the interactive information between the video and query to tackle the task of multi-modal video summarization. The proposed method consists of a probabilistic encoder and a probabilistic decoder. Based on the evaluation of the existing multi-modal video summarization dataset, experimental results show that the proposed approach is effective with the increase of +5.4% in accuracy and +4.92% increase of F1-score, compared with the state-of-the-art method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Causal Video Summarizer for Video Exploration

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization
Jia-Hong Huang ... Marcel Worring
-
Jia-Hong Huang, et. al.Jia-Hong Huang ... Marcel Worring
24 Aug 2021
24 Aug 2021

Graph-based Multimodal Ranking Models for Multimodal Summarization
Junnan Zhu ... Yu Zhou
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 20
Junnan Zhu, et. al.Junnan Zhu ... Yu Zhou
26 May 2021
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 20

Query-controllable Video Summarization
Jia-Hong Huang ... Marcel Worring
-
Jia-Hong Huang, et. al.Jia-Hong Huang ... Marcel Worring
08 Jun 2020
08 Jun 2020

Deep Learning Assists Surveillance Experts: Toward Video Data Prioritization
Tanveer Hussain ... Samee Ullah Khan
IEEE Transactions on Industrial Informatics | VOL. 19
Tanveer Hussain, et. al.Tanveer Hussain ... Samee Ullah Khan
01 Jul 2023
IEEE Transactions on Industrial Informatics | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Causal Video Summarizer for Video Exploration

Abstract

Talk to us

Similar Papers