Multi-Granularity Denoising and Bidirectional Alignment for Weakly Supervised Semantic Segmentation.

Tao Chen,Yazhou Yao,Jinhui Tang

doi:10.1109/tip.2023.3275913

Abstract

Weakly supervised semantic segmentation (WSSS) models relying on class activation maps (CAMs) have achieved desirable performance comparing to the non-CAMs-based counterparts. However, to guarantee WSSS task feasible, we need to generate pseudo labels by expanding the seeds from CAMs which is complex and time-consuming, thus hindering the design of efficient end-to-end (single-stage) WSSS approaches. To tackle the above dilemma, we resort to the off-the-shelf and readily accessible saliency maps for directly obtaining pseudo labels given the image-level class labels. Nevertheless, the salient regions may contain noisy labels and cannot seamlessly fit the target objects, and saliency maps can only be approximated as pseudo labels for simple images containing single-class objects. As such, the achieved segmentation model with these simple images cannot generalize well to the complex images containing multi-class objects. To this end, we propose an end-to-end multi-granularity denoising and bidirectional alignment (MDBA) model, to alleviate the noisy label and multi-class generalization issues. Specifically, we propose the online noise filtering and progressive noise detection modules to tackle image-level and pixel-level noise, respectively. Moreover, a bidirectional alignment mechanism is proposed to reduce the data distribution gap at both input and output space with simple-to-complex image synthesis and complex-to-simple adversarial learning. MDBA can reach the mIoU of 69.5% and 70.2% on validation and test sets for the PASCAL VOC 2012 dataset. The source codes and models have been made available at https://github.com/NUST-Machine-Intelligence-Laboratory/MDBA.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Granularity Denoising and Bidirectional Alignment for Weakly Supervised Semantic Segmentation.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing

Lead the way for us

Journal: IEEE Transactions on Image Processing	Publication Date: Jan 1, 2023
Citations: 12

Similar Papers

Inferring the Class Conditional Response Map for Weakly Supervised Semantic Segmentation
Weixuan Sun ... Jing Zhang
-
Weixuan Sun, et. al.Weixuan Sun ... Jing Zhang
01 Jan 2021
01 Jan 2021

Learning class-agnostic masks with cross-task refinement for weakly supervised semantic segmentation
Lian Xu ... Dan Xu
Neural Computing and Applications | VOL. 35
Lian Xu, et. al.Lian Xu ... Dan Xu
19 Jul 2023
Neural Computing and Applications | VOL. 35

Auxiliary Tasks Enhanced Dual-Affinity Learning for Weakly Supervised Semantic Segmentation.
Lian Xu ... Dan Xu
IEEE transactions on neural networks and learning systems | VOL. PP
Lian Xu, et. al.Lian Xu ... Dan Xu
13 Mar 2024
IEEE transactions on neural networks and learning systems | VOL. PP

Weakly supervised fine-grained semantic segmentation via spatial correlation-guided learning
Zihao Dong ... Xiuli Shao
Computer Vision and Image Understanding | VOL. 236
Zihao Dong, et. al.Zihao Dong ... Xiuli Shao
04 Sep 2023
Computer Vision and Image Understanding | VOL. 236

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Granularity Denoising and Bidirectional Alignment for Weakly Supervised Semantic Segmentation.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing