Video Anomaly Detection Utilizing Efficient Spatiotemporal Feature Fusion with 3D Convolutions and Long Short‐Term Memory Modules

Sareer Ul Amin,Sangoh Park,Bumsoo Kim,Sanghyun Seo,Yonghoon Jung

doi:10.1002/aisy.202300706

Abstract

Surveillance cameras produce vast amounts of video data, posing a challenge for analysts due to the infrequent occurrence of unusual events. To address this, intelligent surveillance systems leverage AI and computer vision to automatically detect anomalies. This study proposes an innovative method combining 3D convolutions and long short‐term memory (LSTM) modules to capture spatiotemporal features in video data. Notably, a structured coarse‐level feature fusion mechanism enhances generalization and mitigates the issue of vanishing gradients. Unlike traditional convolutional neural networks, the approach employs depth‐wise feature stacking, reducing computational complexity and enhancing the architecture. Additionally, it integrates microautoencoder blocks for downsampling, eliminates the computational load of ConvLSTM2D layers, and employs frequent feature concatenation blocks during upsampling to preserve temporal information. Integrating a Conv‐LSTM module at the down‐ and upsampling stages enhances the model's ability to capture short‐ and long‐term temporal features, resulting in a 42‐layer network while maintaining robust performance. Experimental results demonstrate significant reductions in false alarms and improved accuracy compared to contemporary methods, with enhancements of 2.7%, 0.6%, and 3.4% on the UCSDPed1, UCSDPed2, and Avenue datasets, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Video Anomaly Detection Utilizing Efficient Spatiotemporal Feature Fusion with 3D Convolutions and Long Short‐Term Memory Modules

Abstract

Talk to us

Similar Papers

More From: Advanced Intelligent Systems

Lead the way for us

Journal: Advanced Intelligent Systems	Publication Date: Jun 19, 2024
License type: CC BY 4.0

Similar Papers

Deep LSTM-Based Sequence Learning Approaches for Action and Activity Recognition
Amin Ullah ... Khan Muhammad
-
Amin Ullah, et. al.Amin Ullah ... Khan Muhammad
23 Mar 2020
23 Mar 2020

The Real-time Big Data Processing Method Based on LSTM for the Intelligent Workshop Production Process
Wenbo Du ... Zhixiang Zhu
-
Wenbo Du, et. al.Wenbo Du ... Zhixiang Zhu
01 May 2020
01 May 2020

The real-time big data processing method based on LSTM or GRU for the smart job shop production process
Chuang Wang ... Wenbo Du
Journal of Algorithms & Computational Technology | VOL. 14
Chuang Wang, et. al.Chuang Wang ... Wenbo Du
01 Jan 2020
Journal of Algorithms & Computational Technology | VOL. 14

Improved two-stream model for human action recognition
Yuxuan Zhao ... Sheng-Uei Guan
EURASIP Journal on Image and Video Processing | VOL. 2020
Yuxuan Zhao, et. al.Yuxuan Zhao ... Sheng-Uei Guan
17 Jun 2020
EURASIP Journal on Image and Video Processing | VOL. 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Video Anomaly Detection Utilizing Efficient Spatiotemporal Feature Fusion with 3D Convolutions and Long Short‐Term Memory Modules

Abstract

Talk to us

Similar Papers

More From: Advanced Intelligent Systems