Inflated 3D ConvNet context analysis for violence detection

David Freire-Obregón,Maria De Marsico,Modesto Castrillón-Santana,Paola Barra

doi:10.1007/s00138-021-01264-9

David Freire-Obregón, Maria De Marsico + Show 2 more

Open Access

https://doi.org/10.1007/s00138-021-01264-9

Copy DOI

Abstract

According to the Wall Street Journal, one billion surveillance cameras will be deployed around the world by 2021. This amount of information can be hardly managed by humans. Using a Inflated 3D ConvNet as backbone, this paper introduces a novel automatic violence detection approach that outperforms state-of-the-art existing proposals. Most of those proposals consider a pre-processing step to only focus on some regions of interest in the scene, i.e., those actually containing a human subject. In this regard, this paper also reports the results of an extensive analysis on whether and how the context can affect or not the adopted classifier performance. The experiments show that context-free footage yields substantial deterioration of the classifier performance (2% to 5%) on publicly available datasets. However, they also demonstrate that performance stabilizes in context-free settings, no matter the level of context restriction applied. Finally, a cross-dataset experiment investigates the generalizability of results obtained in a single-collection experiment (same dataset used for training and testing) to cross-collection settings (different datasets used for training and testing).

Highlights

Continuous monitoring of visual streams for the timely detection of emergency/anomalous situations is critical for effective intervention whenever two or more persons can interact, especially in public spaces
– We introduce a violence classifier built on top of a pretrained deep neural network that reports highly competitive results in action recognition
The 3D ConvNet consists of a 2D convolutional neural network that takes as input frames in gray scale in which the third dimension is the temporal information

Summary

Introduction

Continuous monitoring of visual streams for the timely detection of emergency/anomalous situations is critical for effective intervention whenever two or more persons can interact, especially in public spaces. Violence detection stems in a sense from action recognition but aims solely at recognizing violent actions. From one side it is more general, since it relies on a pure binary classification, but on the other side just for the same reason it may result more complex. It requires to train a classifier on a whole class of actions. It could be worth clarifying the terms used in the following.

15 Page 2 of 13

Classical approaches

Deep learning approaches

Violence classification pipeline

15 Page 4 of 13

People tracking

Two-stream inflated 3D ConvNets for action recognition

Classification approaches

Experimental setup

Datasets

Experimental results

15 Page 8 of 13

Cross-dataset experiment

15 Page 10 of 13

Responses to research questions

Conclusions

15 Page 12 of 13

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Vision and Applications	Publication Date: Dec 31, 2021
Citations: 25	License type: open-access

R Discovery Prime

R Discovery Prime

Inflated 3D ConvNet context analysis for violence detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Vision and Applications

Lead the way for us

Similar Papers

Lightweight Violence Detection Model Based on 2D CNN with Bi-Directional Motion Attention
Jingwen Wang ... Haoming Li
Applied Sciences | VOL. 14
Jingwen Wang, et. al.Jingwen Wang ... Haoming Li
05 Jun 2024
Applied Sciences | VOL. 14

A Comprehensive Review on Vision-Based Violence Detection in Surveillance Videos
Fath U Min Ullah ... Amin Ullah
ACM Computing Surveys | VOL. 55
Fath U Min Ullah, et. al.Fath U Min Ullah ... Amin Ullah
02 Feb 2023
ACM Computing Surveys | VOL. 55

Toward Fast and Accurate Violence Detection for Automated Video Surveillance Applications
Viktor Dènes Huszár ... Imre Négyesi
IEEE Access | VOL. 11
Viktor Dènes Huszár, et. al.Viktor Dènes Huszár ... Imre Négyesi
01 Jan 2023
IEEE Access | VOL. 11

Review of Video Analytics Method for Video Surveillance
M Jayamohan ... S Yuvaraj
-
M Jayamohan, et. al.M Jayamohan ... S Yuvaraj
11 Feb 2022
11 Feb 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Inflated 3D ConvNet context analysis for violence detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Vision and Applications