Semantic-driven dual consistency learning for weakly supervised video anomaly detection

Yong Su,Yuyu Tan,Simin An,Meng Xing,Zhiyong Feng

doi:10.1016/j.patcog.2024.110898

Abstract

Video anomaly detection presents a significant challenge in computer vision, with the aim of distinguishing various anomaly events from numerous normal ones. Weakly supervised video anomaly detection has recently emerged as a promising solution, enabling the detection of anomaly snippets with only video-level annotations. However, knowledge about anomaly annotation remains underutilized, resulting in a gap between visual space and semantic understanding of anomalies, thus failing to capture the clear boundary between anomalies and normalities. Therefore, we propose a weakly supervised paradigm of cross-modal detection and consistency learning, leveraging dual consistency to provide discriminative representations for anomalies at both the semantic-to-target and target-to-snippet levels. Specifically, we introduce a cross-modal detection network, which detects the targets in each frame according to given semantic rules, to derive semantic-consistent visual embeddings. To depict the clear boundary between anomalies and normalities, a cross-domain alignment module is proposed to enhance the discriminative representation of abnormal targets by learning the contextual consistency between the target and snippet embeddings. Our architecture integrates the detection of semantic-consistent targets based on variable semantic rules, ensuring transferable deployment across scenarios and enabling comprehensive identification, localization, and recognition of abnormal events through a “when-where-which” pipeline. The evaluation of our approach is conducted on four widely used public benchmarks: ShanghaiTech, UCSD Ped2, CUHK Avenue, and UBnormal through extensive qualitative and quantitative analyzes. The results demonstrate the remarkable performance of our approach in dealing with the VAD task.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Semantic-driven dual consistency learning for weakly supervised video anomaly detection

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Similar Papers

VPE-WSVAD: Visual prompt exemplars for weakly-supervised video anomaly detection
Yong Su ... Simin An
Knowledge-Based Systems | VOL. 299
Yong Su, et. al.Yong Su ... Simin An
31 May 2024
Knowledge-Based Systems | VOL. 299

Cross-Epoch Learning for Weakly Supervised Anomaly Detection in Surveillance Videos
Shenghao Yu ... Qiaomei Mao
IEEE Signal Processing Letters | VOL. 28
Shenghao Yu, et. al.Shenghao Yu ... Qiaomei Mao
01 Jan 2020
IEEE Signal Processing Letters | VOL. 28

Pose Estimation and Detection for Event Recognition using Sense-Aware Features and Adaboost Classifier
Israr Akhter ... Ahmad Jalal
-
Israr Akhter, et. al.Israr Akhter ... Ahmad Jalal
12 Jan 2021
12 Jan 2021

MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection
Jia-Chang Feng ... Fa-Ting Hong
-
Jia-Chang Feng, et. al.Jia-Chang Feng ... Fa-Ting Hong
01 Jun 2021
01 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semantic-driven dual consistency learning for weakly supervised video anomaly detection

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition