Weakly-supervised content-based video moment retrieval using low-rank video representation

Shuwei Huo,Yuan Zhou,Wei Xiang,Sun-Yuan Kung

doi:10.1016/j.knosys.2023.110776

Shuwei Huo, Yuan Zhou + Show 2 more

Open Access

https://doi.org/10.1016/j.knosys.2023.110776

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Content-based video moment retrieval (CVMR) aims to localize a successive sequence of frames in an untrimmed reference video, called target moment, that is semantically corresponding to a given query video. Current state-of-the-art CVMR methods are mainly developed using frame-level annotation, which is often quite expensive to collect. In this paper, we aim to develop a weakly-supervised CVMR method, which uses coarse-grained video-level annotations during training. Under weak supervision, video localizers require more discriminative frame-level video features. To achieve this goal, we proposed a novel prior, termed low-rank prior, based on an observation that the frame-level feature of a video should have low-rank properties. We demonstrated that the low-rank features are more discriminative and are beneficial to accurately localize the action boundaries. To produce a low-rank feature, we designed a low-rank feature reconstruction (LFR) operator. A new differentiable matrix decomposition approach is proposed to generate the low-rank reconstruction of the input matrix, meanwhile ensuring that the matrix decomposition process is differentiable. Based on the LFR, we developed a new weakly-supervised CVMR model which produces low-rank video representation and performs semantic consistency measures to discover the semantically matched segment in the reference video to the query video. Extensive experiments demonstrate that our method outperforms state-of-the-art weakly-supervised methods consistently and even achieves competing performance to fully-supervised baselines.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Knowledge-Based Systems	Publication Date: Jul 5, 2023
Citations: 3	License type: cc-by

R Discovery Prime

Weakly-supervised content-based video moment retrieval using low-rank video representation

Abstract

Published Version

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Similar Papers

Weakly-Supervised Video Re-Localization with Multiscale Attention Model
Yung-Han Huang ... Kuang-Jui Hsu
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34
Yung-Han Huang, et. al.Yung-Han Huang ... Kuang-Jui Hsu
03 Apr 2020
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34

Enhanced low-rank representation via sparse manifold adaption for semi-supervised learning
Yong Peng ... Suhang Wang
Neural Networks | VOL. 65
Yong Peng, et. al.Yong Peng ... Suhang Wang
10 Jan 2015
Neural Networks | VOL. 65

LR-Net: Low-Rank Spatial-Spectral Network for Hyperspectral Image Denoising.
Hongyan Zhang ... Guangyi Yang
IEEE Transactions on Image Processing | VOL. 30
Hongyan Zhang, et. al.Hongyan Zhang ... Guangyi Yang
01 Jan 2020
IEEE Transactions on Image Processing | VOL. 30

Robust GBM hyperspectral image unmixing with superpixel segmentation based low rank and sparse representation
Xiaoguang Mei ... Jiayi Ma
Neurocomputing | VOL. 275
Xiaoguang Mei, et. al.Xiaoguang Mei ... Jiayi Ma
05 Dec 2017
Neurocomputing | VOL. 275

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Weakly-supervised content-based video moment retrieval using low-rank video representation

Abstract

Published Version

Talk to us

Similar Papers

More From: Knowledge-Based Systems