DBCAN: DFormer-Based Cross-Attention Network for RGB Depth Semantic Segmentation

Aihua Wu,Liuxu Fu

doi:10.3390/app14188329

Abstract

Existing RGB-depth semantic segmentation methods primarily rely on symmetric two-stream Convolutional Neural Networks (CNNs) to extract RGB and spatial features separately. However, these architectures have limitations in incorporating spatial features and efficiently fusing RGB and depth information. In this study, we propose a novel architecture called the DFormer-Based Cross-Attention Network (DBCAN), which utilizes DFormer as an encoder for feature extraction and integrates several modifications to address these challenges. While DFormer is leveraged for its strong feature extraction capabilities, our modifications in the decoder focus on improving cross-modal fusion and spatial feature incorporation. We introduce three modules in the decoding process: the Object-Region Generated Module (ORGM), the Feature-Region Relation Module (FRRM), and the Spatial-Semantic Fusion Module (SSFM), which enhance feature interaction and segmentation accuracy. Experimental results on the NYUDepthv2 and SUN-RGBD datasets demonstrate that DBCAN achieves state-of-the-art performance, highlighting the effectiveness of our architectural enhancements in overcoming the limitations of existing models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DBCAN: DFormer-Based Cross-Attention Network for RGB Depth Semantic Segmentation

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Journal: Applied Sciences	Publication Date: Sep 16, 2024
License type: CC BY 4.0

Similar Papers

Deep Unsupervised Workload Sequence Anomaly Detection with Fusion of Spatial and Temporal Features in the Cloud
Mengqing Wang ... Chunhong Liu
-
Mengqing Wang, et. al.Mengqing Wang ... Chunhong Liu
01 Oct 2020
01 Oct 2020

A Multi-Scale and Multi-Level Spectral-Spatial Feature Fusion Network for Hyperspectral Image Classification
Caihong Mu ... Yi Liu
Remote Sensing | VOL. 12
Caihong Mu, et. al.Caihong Mu ... Yi Liu
01 Jan 2020
Remote Sensing | VOL. 12

BDR6D: Bidirectional Deep Residual Fusion Network for 6D Pose Estimation
Penglei Liu ... Qieshi Zhang
IEEE Transactions on Automation Science and Engineering | VOL. 21
Penglei Liu, et. al.Penglei Liu ... Qieshi Zhang
01 Apr 2024
IEEE Transactions on Automation Science and Engineering | VOL. 21

Graph convolutional network – Long short term memory neural network- multi layer perceptron- Gaussian progress regression model: A new deep learning model for predicting ozone concertation
Mohammad Ehteram ... Ahmed El-Shafie
Atmospheric Pollution Research | VOL. 14
Mohammad Ehteram, et. al.Mohammad Ehteram ... Ahmed El-Shafie
18 Apr 2023
Atmospheric Pollution Research | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DBCAN: DFormer-Based Cross-Attention Network for RGB Depth Semantic Segmentation

Abstract

Talk to us

Similar Papers

More From: Applied Sciences