Abstract

Despite significant progress made in image semantic segmentation, most research has primarily focused on daytime scenes. Semantic segmentation of nighttime images is equally critical for autonomous driving; however, this task poses more challenges due to inadequate lighting and difficulties associated with obtaining accurate manual annotations. In this paper, we propose a novel method called Latent Domain Knowledge Distillation (LDKD) for accurate nighttime semantic segmentation. Within a teacher-student framework, our LDKD combines domain alignment and knowledge distillation in a mutually reinforcing manner. First, we introduce a bidirectional photometric alignment module capable of generating daytime-like and nighttime-like latent images in real-time. This module effectively bridges the appearance gap between the source domain (daytime) and the target domain (nighttime). Second, we enhance collaboration between the teacher and student networks. The student network unifies image-level and feature-level domain alignment, while the teacher network generates reliable pseudo-labels by distilling knowledge from the latent domain. Furthermore, to account for potential noise in pseudo-labels, we propose a noise-tolerant learning method aimed at mitigating the risks associated with overreliance on pseudo-labels during model training. Extensive experiments on Dark Zurich and ACDC datasets show that our LDKD achieves state-of-the-art performance, demonstrating the effectiveness of our method for nighttime semantic segmentation.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call