Violence Detection In Videos Research Articles

AbstractThe violence detection is mostly achieved through handcrafted feature descriptors, while some researchers have also employed deep learning-based representation models for violent activity recognition. Deep learning-based models have achieved encouraging results for fight activity recognition on benchmark data sets such as hockey and movies. However, these models have limitations in learning discriminating features for violence activity classification with abrupt camera motion. This research work investigated deep representation models using transfer learning for handling the issue of abrupt camera motion. Consequently, a novel deep multi-net (DMN) architecture based on AlexNet and GoogleNet is proposed for violence detection in videos. AlexNet and GoogleNet are top-ranked pre-trained models for image classification with distinct pre-learnt potential features. The fusion of these models can yield superior performance. The proposed DMN unleashed the integrated potential by concurrently coalescing both networks. The results confirmed that DMN outperformed state-of-the-art methods by learning finest discriminating features and achieved 99.82% and 100% accuracy on hockey and movies data sets, respectively. Moreover, DMN has faster learning capability i.e. 1.33 and 2.28 times faster than AlexNet and GoogleNet, which makes it an effective learning architecture on images and videos.

Action recognition is an active research area in computer vision as it has enormous applications in today’s world, out of which, recognizing violent action is of great importance since it is closely related to our safety and security. An intelligent surveillance system is the idea of automatically recognizing suspicious activities in surveillance videos and thereby supporting security personals to take up right action on the right time. Under this area, most of the researchers were focused on people detection and tracking, loitering, etc., whereas detecting violent actions or fights is comparatively a less studied area. Previous works considered the local spatiotemporal feature extractors; however, it accompanies the overhead of complex optical flow estimation. Even though the temporal derivative is a fast alternative to optical flow, it alone gives very low accuracy and scales-dependent result. Hence, here we propose a cascaded method of violence detection based on motion boundary SIFT (MoBSIFT) and movement filtering. In this method, the surveillance videos are checked through a movement filtering algorithm based on temporal derivative and avoid most of the nonviolent actions from going through feature extraction. Only the filtered frames may allow going through feature extraction. In addition to scale-invariant feature transform (SIFT) and histogram of optical flow feature, motion boundary histogram is also extracted and combined to form MoBSIFT descriptor. The experimental results show that the proposed MoBSIFT outperforms the existing methods in accuracy by its high tolerance to camera movements. Time complexity has also proved to be reduced by the use of movement filtering along with MoBSIFT.

Violence Detection In Videos Research Articles

Articles published on Violence Detection In Videos

An ensemble based approach for violence detection in videos using deep transfer learning

A shallow 3D convolutional neural network for violence detection in videos

A Combination of Light Pre-trained Convolutional Neural Networks and Long Short-Term Memory for Real-Time Violence Detection in Videos

Towards evaluating robustness of violence detection in videos using cross-domain transferability

CrimeNet: Neural Structured Learning using Vision Transformer for violence detection

FTCF: Full temporal cross fusion network for violence detection in videos

Violence detection in videos using interest frame extraction and 3D convolutional neural network

Harnessing high-level concepts, visual, and auditory features for violence detection in videos

Fast Learning Through Deep Multi-Net CNN Model For Violence Recognition In Video Surveillance

Violence Detection in Videos by Combining 3D Convolutional Neural Networks and Support Vector Machines

Violence detection in videos for an intelligent surveillance system using MoBSIFT and movement filtering algorithm

Violent activity detection with transfer learning method

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Violence Detection In Videos Research Articles

Articles published on Violence Detection In Videos

An ensemble based approach for violence detection in videos using deep transfer learning

A shallow 3D convolutional neural network for violence detection in videos

A Combination of Light Pre-trained Convolutional Neural Networks and Long Short-Term Memory for Real-Time Violence Detection in Videos

Towards evaluating robustness of violence detection in videos using cross-domain transferability

CrimeNet: Neural Structured Learning using Vision Transformer for violence detection

FTCF: Full temporal cross fusion network for violence detection in videos

Violence detection in videos using interest frame extraction and 3D convolutional neural network

Harnessing high-level concepts, visual, and auditory features for violence detection in videos

Fast Learning Through Deep Multi-Net CNN Model For Violence Recognition In Video Surveillance

Violence Detection in Videos by Combining 3D Convolutional Neural Networks and Support Vector Machines

Violence detection in videos for an intelligent surveillance system using MoBSIFT and movement filtering algorithm

Violent activity detection with transfer learning method