This paper aims to address the problem of profiling human activities captured in surveillance videos for the applications of online normal human activity recognition and anomaly detection. A novel framework is developed for automatic human activity modeling and online anomaly detection without any manual labeling of the training dataset. The framework consists of the following key components: 1) A compact and effective activity representation method is developed based on a stochastic sequence of spatiotemporal actions. 2) The natural grouping of activities is discovered through a novel clustering algorithm with unsupervised model selection. 3) A runtime accumulative anomaly measure is introduced to detect abnormal activities, whereas normal human activities are recognized when sufficient visual evidence has become available based on an online Likelihood Ratio Test (LRT) method. This ensures robust and reliable anomaly detection and normal activity recognition at the shortest possible time. Experimental results demonstrate the effectiveness and robustness of our approach using noisy and sparse datasets collected from a real surveillance scenario.
Read full abstract