Enhancing Query Formulation for Universal Image Segmentation.

Yipeng Qu,Joohee Kim

doi:10.3390/s24061879

Abstract

Recent advancements in image segmentation have been notably driven by Vision Transformers. These transformer-based models offer one versatile network structure capable of handling a variety of segmentation tasks. Despite their effectiveness, the pursuit of enhanced capabilities often leads to more intricate architectures and greater computational demands. OneFormer has responded to these challenges by introducing a query-text contrastive learning strategy active during training only. However, this approach has not completely addressed the inefficiency issues in text generation and the contrastive loss computation. To solve these problems, we introduce Efficient Query Optimizer (EQO), an approach that efficiently utilizes multi-modal data to refine query optimization in image segmentation. Our strategy significantly reduces the complexity of parameters and computations by distilling inter-class and inter-task information from an image into a single template sentence. Furthermore, we propose a novel attention-based contrastive loss. It is designed to facilitate a one-to-many matching mechanism in the loss computation, which helps object queries learn more robust representations. Beyond merely reducing complexity, our model demonstrates superior performance compared to OneFormer across all three segmentation tasks using the Swin-T backbone. Our evaluations on the ADE20K dataset reveal that our model outperforms OneFormer in multiple metrics: by 0.2% in mean Intersection over Union (mIoU), 0.6% in Average Precision (AP), and 0.8% in Panoptic Quality (PQ). These results highlight the efficacy of our model in advancing the field of image segmentation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Mar 14, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Enhancing Query Formulation for Universal Image Segmentation.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Monotonic sequences for image enhancement and segmentation
Artyom M Grigoryan ... Sos S Agaian
Digital Signal Processing | VOL. 41
Artyom M Grigoryan, et. al.Artyom M Grigoryan ... Sos S Agaian
05 Mar 2015
Digital Signal Processing | VOL. 41

Pan-cancer image segmentation based on feature pyramids and Mask R-CNN framework.
Juan Wang ... Jian Zhou
Medical Physics | VOL. 51
Juan Wang, et. al.Juan Wang ... Jian Zhou
04 Mar 2024
Medical Physics | VOL. 51

Edge Intelligence Empowered Vehicle Detection and Image Segmentation for Autonomous Vehicles
Chen Chen ... Bin Liu
IEEE Transactions on Intelligent Transportation Systems | VOL. 24
Chen Chen, et. al.Chen Chen ... Bin Liu
01 Nov 2023
IEEE Transactions on Intelligent Transportation Systems | VOL. 24

Chapter 5 - Segmentation: intracardiac echocardiography contouring
Haofu Liao ... S Kevin Zhou
Deep Network Design for Medical Image Computing | VOL. -
Haofu Liao, et. al.Haofu Liao ... S Kevin Zhou
01 Jan 2023
Deep Network Design for Medical Image Computing | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing Query Formulation for Universal Image Segmentation.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)