In recent years, person re-identification (re-id) has primarily been studied using visible light (VL) images. However, the challenges of employing VL images in nighttime environments have prompted research into using infrared light (IR) images. Yet, the utilization of both VL and IR images in person re-id has resulted in increased computational cost and processing time in multi-modality systems, leading to studies focusing solely on IR images. Nevertheless, IR images, lacking color and texture information, generally yield lower recognition performance in existing person re-id studies. In addition, previous studies have shown that person re-id performance suffers in the presence of complex background noise. To tackle these challenges, this study proposes a new weak saliency ensemble network (WSE-Net) for person re-id using IR images. WSE-Net incorporates a channel reduction of feature (CRF) method to reduce computational cost in the ensemble network, a technique for converting input images into group of patch images and feeding them into the ensemble model to enhance the reduced feature information, and a grouped convolution ensemble network (GCE-Net) that enables the fusion of features extracted from original and attention-guided ensemble models.The performance of person re-id using WSE-Net was evaluated on the Dongguk body-based person recognition database version 1 (DBPerson-Recog-DB1) and the Sun Yat-sen university multiple modality re-identification version 1 (SYSU-MM01). Experimental results demonstrated that on DBPerson-Recog-DB1, WSE-Net achieved 93.65% in rank 1, 95.28% in mean average precision (mAP), and 93.52% in the harmonic mean of precision and recall. Additionally, on SYSU-MM01, WSE-Net achieved 86.85% in rank 1, 44.58% in mAP, and 40.06% in the harmonic mean of precision and recall. Furthermore, the accuracy of WSE-Net on both datasets surpassed that of state-of-the-art methods.
Read full abstract