Abstract

Video surveillance image analysis and processing is an important field in computer vision and among the challenging tasks for Person Re-Identification (PRe-ID). The latter aims at finding a target person who has already been identified and appeared on a camera network using a powerful description of their pedestrian images. The success of recent research on person PRe-ID is largely backed into the effective features extraction and representation with a powerful learning of these features to correctly discriminate pedestrian images. To this end, two powerful features, Convolutional Neural Network (CNN) and Local Maximal Occurrence (LOMO) are modeled on a multidimensional data in the proposed method, High-Dimensional Feature Fusion (HDFF). Specifically, a new tensor fusion scheme is introduced to take advantage and combine two types of features in the same tensor data even if its dimensions are not the same. To improve the accuracy, we use Tensor Cross-View Quadratic Analysis (TXQDA) to perform multilinear subspace learning followed by the Cosine similarity for matching. TXQDA efficiently ensures the learning ability and reduces the high dimensionality resulting from high-order tensor data. The effectiveness of the proposed method is verified through experiments on three challenging widely-used PRe-ID datasets namely, VIPeR, GRID, and PRID450S. Extensive experiments show that the proposed method performs very well when compared with recent state-of-the-art methods.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call