Weakly Supervised Video Moment Localization with Contrastive Negative Sample Mining

Minghang Zheng,Yang Liu,Qingchao Chen,Yanjie Huang

doi:10.1609/aaai.v36i3.20263

Abstract

Video moment localization aims at localizing the video segments which are most related to the given free-form natural language query. The weakly supervised setting, where only video level description is available during training, is getting more and more attention due to its lower annotation cost. Prior weakly supervised methods mainly use sliding windows to generate temporal proposals, which are independent of video content and low quality, and train the model to distinguish matched video-query pairs and unmatched ones collected from different videos, while neglecting what the model needs is to distinguish the unaligned segments within the video. In this work, we propose a novel weakly supervised solution by introducing Contrastive Negative sample Mining (CNM). Specifically, we use a learnable Gaussian mask to generate positive samples, highlighting the video frames most related to the query, and consider other frames of the video and the whole video as easy and hard negative samples respectively. We then train our network with the Intra-Video Contrastive loss to make our positive and negative samples more discriminative. Our method has two advantages: (1) Our proposal generation process with a learnable Gaussian mask is more efficient and makes our positive sample higher quality. (2) The more difficult intra-video negative samples enable our model to distinguish highly confusing scenes. Experiments on two datasets show the effectiveness of our method. Code can be found at https://github.com/minghangz/cnm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Weakly Supervised Video Moment Localization with Contrastive Negative Sample Mining

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 28, 2022
Citations: 32

Similar Papers

Generating Counterfactual Hard Negative Samples for Graph Contrastive Learning
Haoran Yang ... Xiangyu Zhao
-
Haoran Yang, et. al.Haoran Yang ... Xiangyu Zhao
30 Apr 2023
30 Apr 2023

Distance learning by mining hard and easy negative samples for person re-identification
Xiaoke Zhu ... Xiang Cui
Pattern Recognition | VOL. 95
Xiaoke Zhu, et. al.Xiaoke Zhu ... Xiang Cui
11 Jun 2019
Pattern Recognition | VOL. 95

Hard Sample Aware Network for Contrastive Deep Graph Clustering
Yue Liu ... Xinwang Liu
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37
Yue Liu, et. al.Yue Liu ... Xinwang Liu
26 Jun 2023
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37

Enhancing Biomedical ReQA With Adversarial Hard In-Batch Negative Samples.
Bo Zhao ... Wenge Rong
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 20
Bo Zhao, et. al.Bo Zhao ... Wenge Rong
01 Sep 2023
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Weakly Supervised Video Moment Localization with Contrastive Negative Sample Mining

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence