LC-MSM: Language-Conditioned Masked Segmentation Model for unsupervised domain adaptation

Young-Eun Kim,Yu-Won Lee,Seong-Whan Lee

doi:10.1016/j.patcog.2023.110201

Abstract

Unsupervised domain adaptation (UDA) is an important research topic in semantic segmentation tasks, wherein pixel-wise annotations are often difficult to collect in a test environment due to their high labeling costs. Previous UDA-based studies trained their segmentation networks using labeled synthetic data and unlabeled realistic data as source and target domains, respectively. However, they often fail to distinguish semantically similar classes, such as person vs. rider and road vs. sidewalk, because these classes are prone to confusion in domain-shifted environments. In this paper, we introduce a Language-Conditioned Masked Segmentation Model (LC-MSM), which is a new framework for the joint learning of context relations and domain-agnostic information for domain-adaptive semantic segmentation. Specifically, we reconstruct semantic labels with masked image conditions on the generalized text embeddings of the corresponding semantic class from OpenCLIP, which contains domain-invariant knowledge from large-scale data. To this end, we correlate the generalized text embeddings onto the per-pixel image feature of a masked image that learned the spatial context to further append domain-agnostic language information to the semantic decoder. This facilitates the generalization of our model to the target domain via the learning of context information within individual training instances, while considering cross-domain representations spanning the entire dataset. LC-MSM achieves an unprecedented UDA performance of 71.8 and 62.8 mIoU on GTA→Cityscapes and SYNTHIA→Cityscapes, respectively, which corresponds to an improvement of +3.5 and +1.9 percent points over the baseline method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LC-MSM: Language-Conditioned Masked Segmentation Model for unsupervised domain adaptation

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Similar Papers

Unsupervised Adversarial Domain Adaptation Network for Semantic Segmentation
Wei Liu ... Fulin Su
IEEE Geoscience and Remote Sensing Letters | VOL. 17
Wei Liu, et. al.Wei Liu ... Fulin Su
26 Dec 2019
IEEE Geoscience and Remote Sensing Letters | VOL. 17

Super-resolution domain adaptation networks for semantic segmentation via pixel and output level aligning
Junfeng Wu ... Long Gao
Frontiers in Earth Science | VOL. 10
Junfeng Wu, et. al.Junfeng Wu ... Long Gao
25 Aug 2022
Frontiers in Earth Science | VOL. 10

Multi-Anchor Active Domain Adaptation for Semantic Segmentation
Munan Ning ... Shuang Yu
-
Munan Ning, et. al.Munan Ning ... Shuang Yu
01 Oct 2021
01 Oct 2021

DASRSNet: Multitask Domain Adaptation for Super-Resolution-Aided Semantic Segmentation of Remote Sensing Images
Yuxiang Cai ... Zhengwei Shen
IEEE Transactions on Geoscience and Remote Sensing | VOL. 61
Yuxiang Cai, et. al.Yuxiang Cai ... Zhengwei Shen
01 Jan 2023
IEEE Transactions on Geoscience and Remote Sensing | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LC-MSM: Language-Conditioned Masked Segmentation Model for unsupervised domain adaptation

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition