Dual-resolution transformer combined with multi-layer separable convolution fusion network for real-time semantic segmentation

Kaidi Hu,Zongxia Xie,Qinghua Hu

doi:10.1016/j.cag.2023.12.015

Abstract

Environmental perception is crucial for unmanned mobile platforms such as autonomous vehicles and robots. Precise and fast semantic segmentation of the surrounding scene is a key task to enhance this capability. Existing real-time semantic segmentation networks are typically based on convolutional neural networks (CNNs), which have achieved good results, but they still lack control over global context features. In recent years, the Transformer architecture has achieved significant success in capturing global context, which is beneficial for improving segmentation accuracy. However, Transformers tend to ignore local connections, and their computational complexity makes real-time segmentation challenging. We propose a lightweight real-time semantic segmentation network called DTMC-Net, which combines the advantages of CNNs and Transformers. We design a special residual convolution module called the Lightweight Multi-layer Separable Convolution Attention module (LMSCA) to reduce the parameter count and perform multi-scale feature fusion to capture local features effectively. We introduce the Simple Dual-Resolution Transformer (SDR Transformer) that utilizes lightweight attention mechanisms and residual feed forward networks to capture and maintain features, with multiple bilateral fusions between two branches to exchange information. The proposed Anti-artifact Aggregation Pyramid Pooling Module (AAPPM) optimizes the upsampling process, refines features, and performs multi-scale feature fusion again. DTMC-Net only contains 4.2M parameters and achieves good performance on multiple public datasets with different scenarios.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dual-resolution transformer combined with multi-layer separable convolution fusion network for real-time semantic segmentation

Abstract

Talk to us

Similar Papers

More From: Computers & Graphics

Lead the way for us

Journal: Computers & Graphics	Publication Date: Dec 29, 2023
Citations: 2

Similar Papers

Real-Time Semantic Understanding and Segmentation of Urban Scenes for Vehicle Visual Sensors by Optimized DCNN Algorithm
Yanyi Li ... Jian Shi
Applied Sciences | VOL. 12
Yanyi Li, et. al.Yanyi Li ... Jian Shi
03 Aug 2022
Applied Sciences | VOL. 12

Real-time semantic segmentation on edge devices: A performance comparison of segmentation models
Myeongseok Lee ... Chi Yoon Jeong
-
Myeongseok Lee, et. al.Myeongseok Lee ... Chi Yoon Jeong
19 Oct 2022
19 Oct 2022

MSPPNet: A Lightweight Network for Real-time Semantic Image Segmentation
Yuting Liang ... Lei Liu
Journal of Physics: Conference Series | VOL. 2010
Yuting Liang, et. al.Yuting Liang ... Lei Liu
01 Sep 2021
Journal of Physics: Conference Series | VOL. 2010

Fast and Accurate Real-Time Semantic Segmentation with Dilated Asymmetric Convolutions

-

29 Dec 2020
29 Dec 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dual-resolution transformer combined with multi-layer separable convolution fusion network for real-time semantic segmentation

Abstract

Talk to us

Similar Papers

More From: Computers &amp; Graphics

More From: Computers & Graphics