Abstract

With recent advances in deep learning technologies, many commercialized video surveillance systems have adopted Artificial Intelligence (AI)-powered video analytics technologies as a way to make our life smarter and safer. Nevertheless, there is no robust architecture with an appropriate network model for commercial services considering both high accuracy and low computational cost. Existing deep learning technologies would not be enough to model and represent the dynamics of the real-world scene, so it is difficult to satisfy all environments using a generic model. Appropriate training data from false-alarm and/or missed cases can address this limitation but is rarely available due to legal issues relating to the privacy of personal data and the unpredictability of new incoming data. In this paper, we propose a novel end-to-end hybrid video surveillance architecture for reliable object detection, consisting of front-end and back-end intelligence. For the intelligent front-end, we propose a new object detector with a Multi-scale ResBlock scheme to consider the scalability and flexibility of the system. We are also developing a new domain adaptation method to replace the generic model with each camera&#x2019;s individual personal model by understanding real-time space and context information for intelligent back-end architecture. It is an iterative and continuous process in which new upcoming data and previous models are consistently engaged in a continuous improvement process. We conducted a series of experiments, including an interesting proof-of-concept tests called the <i>Chameleon</i> project, which demonstrated the high accuracy and versatility of the new architecture, while producing robust results that can be implemented in practice.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call