Abstract

Surveillance cameras are being installed in many primary daily living places to maintain public safety. In this video-surveillance context, anomalies occur only for a very short time, and very occasionally. Hence, manual monitoring of such anomalies may be exhaustive and monotonous, resulting in a decrease in reliability and speed in emergency situations due to monitor tiredness. Within this framework, the importance of automatic detection of anomalies is clear, and, therefore, an important amount of research works have been made lately in this topic. According to these earlier studies, supervised approaches perform better than unsupervised ones. However, supervised approaches demand manual annotation, making dependent the system reliability of the different situations used in the training (something difficult to set in anomaly context). In this work, it is proposed an approach for anomaly detection in video-surveillance scenes based on a weakly supervised learning algorithm. Spatio-temporal features are extracted from each surveillance video using a temporal convolutional 3D neural network (T-C3D). Then, a novel ranking loss function increases the distance between the classification scores of anomalous and normal videos, reducing the number of false negatives. The proposal has been evaluated and compared against state-of-art approaches, obtaining competitive performance without fine-tuning, which also validates its generalization capability. In this paper, the proposal design and reliability is presented and analyzed, as well as the aforementioned quantitative and qualitative evaluation in-the-wild scenarios, demonstrating its high sensitivity in anomaly detection in all of them.

Highlights

  • In recent years, surveillance cameras have gained high popularity for taking care of public safety against the increased security threats in various forms, such as robbery, accidents, or illegal and anti-social activities

  • Every video in the training and testing sets is split into 32 non-overlapping video segments, the spatio-temporal features are extracted for each video segment using a temporal convolutional 3D neural network (T-C3D) [30] as a feature extractor

  • Detection is explored in videos captured from surveillance cameras in the wild in real daily living scenarios

Read more

Summary

Introduction

Surveillance cameras have gained high popularity for taking care of public safety against the increased security threats in various forms, such as robbery, accidents, or illegal and anti-social activities. Nowadays, these cameras are commonly installed in public places such as banks, shopping markets, railway and bus stations, crowded streets or heavy traffic areas, which are liable to security threats, in order to guarantee public safety. The task becomes monotonous and boring as the occurrence of abnormal events is very minimal when compared to that of the normal events This drawback may lead to the under-utilization of surveillance cameras. Automation of this anomaly detection task finds its applicability in several industrial contexts such as security guards, traffic security, or crime prevention

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call