Abstract

In general, vehicle images have varying resolutions due to vehicles’ movements and different camera settings. However, most existing vehicle re-identification models are single-resolution deep networks trained with pre-uniformly resizing vehicle images, which underestimates adverse effects of varying resolutions and leads to unsatisfactory performance. A straightforward solution for dealing with varying resolutions is to train multiple vehicle re-identification models. Each model is independently trained with images of a specific resolution. However, this straightforward solution requires significant overhead and ignores intrinsic associations among different resolution images. For that, an efficient multi-resolution network (EMRN) is proposed for vehicle re-identification in this paper. Firstly, EMRN embeds a newly-designed multi-resolution feature dimension uniform module (MR-FDUM) behind a traditional backbone network (i.e., ResNet-50). As a result, the whole model can extract fixed dimensional features from different resolution images so that it can be trained with one loss function of fixed dimensional parameters rather than training multiple models. Secondly, a multi-resolution image randomly feeding strategy is designed to train EMRN, making each mini-batch data of a random resolution during the training process. Consequently, EMRN can implicitly learn collaborative multi-resolution features via only a unitary deep network. Experiments on three large scale datasets, i.e., VeRi776, VehicleID, and VRIC, demonstrate that EMRN is superior to state-of-the-art vehicle re-identification methods.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call