Efficient multi-task progressive learning for semantic segmentation and disparity estimation

Hanz Cuevas-Velasquez,Alejandro Galán-Cuenca,Robert B Fisher,Antonio Javier Gallego

doi:10.1016/j.patcog.2024.110601

Abstract

Scene understanding is an important area in robotics and autonomous driving. To accomplish these tasks, the 3D structures in the scene have to be inferred to know what the objects and their locations are. To this end, semantic segmentation and disparity estimation networks are typically used, but running them individually is inefficient since they require high-performance resources. A possible solution is to learn both tasks together using a multi-task approach. Some current methods address this problem by learning semantic segmentation and monocular depth together. However, monocular depth estimation from single images is an ill-posed problem. A better solution is to estimate the disparity between two stereo images and take advantage of this additional information to improve the segmentation. This work proposes an efficient multi-task method that jointly learns disparity and semantic segmentation. Employing a Siamese backbone architecture for multi-scale feature extraction, the method integrates specialized branches for disparity estimation and coarse and refined segmentations, leveraging progressive task-specific feature sharing and attention mechanisms to enhance accuracy for solving both tasks concurrently. The proposal achieves state-of-the-art results for joint segmentation and disparity estimation on three distinct datasets: Cityscapes, TrimBot2020 Garden, and S-ROSeS, using only 1/3 of the parameters of previous approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient multi-task progressive learning for semantic segmentation and disparity estimation

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: May 17, 2024
Citations: 1

Similar Papers

3SP-Net: Semantic Segmentation Network with Stereo Image Pairs for Urban Scene Parsing
Lingli Zhou ... Haofeng Zhang
-
Lingli Zhou, et. al.Lingli Zhou ... Haofeng Zhang
01 Jan 2018
01 Jan 2018

Multi-Scale Multi-Task FCN for Semantic Page Segmentation and Table Detection
Dafang He ... Scott Cohen
-
Dafang He, et. al.Dafang He ... Scott Cohen
01 Nov 2017
01 Nov 2017

Competitive Simplicity for Multi-Task Learning for Real-Time Foggy Scene Understanding via Domain Adaptation
Naif Alshammari ... Toby P Breckon
-
Naif Alshammari, et. al.Naif Alshammari ... Toby P Breckon
11 Jul 2021
11 Jul 2021

An End-to-End Geometric Characterization-aware Semantic Instance Segmentation Network for ALS Point Clouds
Jinhong Wang ... Wei Yao
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XLVIII-2-2024
Jinhong Wang, et. al.Jinhong Wang ... Wei Yao
11 Jun 2024
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XLVIII-2-2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient multi-task progressive learning for semantic segmentation and disparity estimation

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition