Large-scale Video Analysis Research Articles

Intelligent video surveillance (IVS) is always an interesting research topic to utilize visual analysis algorithms for exploring richly structured information from big surveillance data. However, existing IVS systems either struggle to utilize computing resources adequately to improve the efficiency of large-scale video analysis, or present a customized system for specific video analytic functions. It still lacks of a comprehensive computing architecture to enhance efficiency, extensibility and flexibility of IVS system. Moreover, it is also an open problem to study the effect of the combinations of multiple vision modules on the final performance of end applications of IVS system. Motivated by these challenges, we develop an Intelligent Scene Exploration and Evaluation (ISEE) platform based on a heterogeneous CPU-GPU cluster and some distributed computing tools, where Spark Streaming serves as the computing engine for efficient large-scale video processing and Kafka is adopted as a middle-ware message center to decouple different analysis modules flexibly. To validate the efficiency of the ISEE and study the evaluation problem on composable systems, we instantiate the ISEE for an end application on person retrieval with three visual analysis modules, including pedestrian detection with tracking, attribute recognition and re-identification. Extensive experiments are performed on a large-scale surveillance video dataset involving 25 camera scenes, totally 587 hours 720p synchronous videos, where a two-stage question-answering procedure is proposed to measure the performance of execution pipelines composed of multiple visual analysis algorithms based on millions of attribute-based and relationship-based queries. The case study of system-level evaluations may inspire researchers to improve visual analysis algorithms and combining strategies from the view of a scalable and composable system in the future.

Recently, newly invented features (e.g. Fisher vector, VLAD) have achieved state-of-the-art performance in large-scale video analysis systems that aims to understand the contents in videos, such as concept recognition and event detection. However, these features are in high-dimensional representations, which remarkably increases computation costs and correspondingly deteriorates the performance of subsequent learning tasks. Notably, the situation becomes even worse when dealing with large-scale video data where the number of class labels are limited. To address this problem, we propose a novel algorithm to compactly represent huge amounts of unconstrained video data. Specifically, redundant feature dimensions are removed by using our proposed feature selection algorithm. Considering unlabeled videos that are easy to obtain on the web, we apply this feature selection algorithm in a semi-supervised framework coping with a shortage of class information. Different from most of the existing semi-supervised feature selection algorithms, our proposed algorithm does not rely on manifold approximation, i.e. graph Laplacian, which is quite expensive for a large number of data. Thus, it is possible to apply the proposed algorithm to a real large-scale video analysis system. Besides, due to the difficulty of solving the non-smooth objective function, we develop an efficient iterative approach to seeking the global optimum. Extensive experiments are conducted on several real-world video datasets, including KTH, CCV, and HMDB. The experimental results have demonstrated the effectiveness of the proposed algorithm.

Large-scale Video Analysis Research Articles

Related Topics

Articles published on Large-scale Video Analysis

Large-scale Video Analytics with Cloud–Edge Collaborative Continuous Learning

Assessing Cardiac Functions of Zebrafish from Echocardiography Using Deep Learning

Exploring the Temporal Cues to Enhance Video Retrieval on Standardized CDVA

ISEE: An Intelligent Scene Exploration and Evaluation Platform for Large-Scale Visual Surveillance

AI-Oriented Large-Scale Video Management for Smart City: Technologies, Standards, and Beyond

HNIP: Compact Deep Invariant Representations for Video Matching, Localization, and Retrieval

Compact representation for large-scale unconstrained video analysis

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Large-scale Video Analysis Research Articles

Related Topics

Articles published on Large-scale Video Analysis

Large-scale Video Analytics with Cloud–Edge Collaborative Continuous Learning

Assessing Cardiac Functions of Zebrafish from Echocardiography Using Deep Learning

Exploring the Temporal Cues to Enhance Video Retrieval on Standardized CDVA

ISEE: An Intelligent Scene Exploration and Evaluation Platform for Large-Scale Visual Surveillance

AI-Oriented Large-Scale Video Management for Smart City: Technologies, Standards, and Beyond

HNIP: Compact Deep Invariant Representations for Video Matching, Localization, and Retrieval

Compact representation for large-scale unconstrained video analysis