Abstract
Unsupervised anomaly detection defines an abnormal event as an event that does not conform to expected behavior. In the field of unsupervised anomaly detection, it is a pioneering work that leverages the difference between a future frame predicted by a generative adversarial network and its ground truth to detect an abnormal event. Based on the work, we improve the ability of video prediction framework to detect abnormal events by enhancing the difference between prediction results for normal and abnormal events. We incorporate super-resolution and self-attention mechanism to design a generative adversarial network. We propose an auto-encoder as a generator, which incorporates dense residual networks and self-attention. Moreover, we propose a new discriminator, which introduces self-attention on the basis of a relativistic discriminator. To predict a future frame with higher quality for normal events, we impose a constraint on the motion in video prediction by fusing optical flow and gradient difference between frames. We also introduce a perception constraint in video prediction to enrich the texture details of a frame. The AUC of our method on CUHK Avenue and Shanghai Tech datasets reaches 89.2% and 75.7% respectively, which is better than most existing methods. In addition, we propose a processing flow that can realize real-time anomaly detection in videos. The average running time of our video prediction framework is 37 frames per second. Among all real-time methods for abnormal event detection in videos, our method is competitive with the state-of-the-art methods.
Highlights
With the development of intelligent security, more and more security cameras are used in various occasions to ensure public safety
We increase the difference between prediction results for normal and abnormal events, thereby improving the accuracy of our framework for abnormal event detection
We leverage the difference between a future frame predicted by a video prediction framework and its ground truth to achieve unsupervised anomaly detection
Summary
With the development of intelligent security, more and more security cameras are used in various occasions to ensure public safety. It will consume a lot of labor and material resources to realize abnormal event detection in videos by only hand. Abnormal event detection in videos is a long-standing and extremely challenging vision problem. Sultani et al [2] proposed an anomaly dataset composed of 1900 real-world surveillance videos, which is currently the largest anomaly dataset. Videos in this dataset are collected from video websites.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.