Global–Local Deep Fusion: Semantic Integration with Enhanced Transformer in Dual-Branch Networks for Ultra-High Resolution Image Segmentation

Chenjing Liang,Kai Huang,Jian Mao

doi:10.3390/app14135443

Abstract

The fusion of global contextual information with local cropped block details is crucial for segmenting ultra-high resolution images. In this study, A novel fusion mechanism termed global–local deep fusion (GL-Deep Fusion) is introduced, based on an enhanced transformer architecture that efficiently integrates global contextual information and local details. Specifically, we propose the global–local synthesis networks (GLSNet), a dual-branch network where one branch processes the entire original image, while the other branch handles cropped local patches as input. The feature fusion of different branches in GLSNet is achieved through GL-Deep Fusion, significantly enhancing the accuracy of ultra-high resolution image segmentation. Identifying tiny overlapping items is a task where the model excels, demonstrating its particular effectiveness. To optimize GPU memory utilization, a dual-branch architecture was meticulously designed. This architecture proficiently leverages the features it extracts and seamlessly integrates them into the enhanced transformer framework of GL-Deep Fusion. Benchmarks on the DeepGlobe and Vaihingen datasets demonstrate the efficiency and accuracy of the proposed model. It significantly reduces GPU memory usage by 24.1% on the DeepGlobe dataset, enhancing segmentation accuracy by 0.8% over the baseline model. On the Vaihingen dataset, our model delivers a Mean F1 score of 90.2% and achieves a mIoU of 90.9%, highlighting its exceptional memory efficiency and segmentation precision.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Global–Local Deep Fusion: Semantic Integration with Enhanced Transformer in Dual-Branch Networks for Ultra-High Resolution Image Segmentation

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Journal: Applied Sciences	Publication Date: Jun 23, 2024
License type: CC BY 4.0

Similar Papers

Skull Segmentation from CBCT Images via Voxel-Based Rendering.
Qin Liu ... Lei Ma
Machine learning in medical imaging. MLMI (Workshop) | VOL. 12966
Qin Liu, et. al.Qin Liu ... Lei Ma
01 Jan 2020
Machine learning in medical imaging. MLMI (Workshop) | VOL. 12966

Dense Dilated Convolutions’ Merging Network for Land Cover Classification
Qinghui Liu ... Michael Kampffmeyer
IEEE Transactions on Geoscience and Remote Sensing | VOL. 58
Qinghui Liu, et. al.Qinghui Liu ... Michael Kampffmeyer
09 Mar 2020
IEEE Transactions on Geoscience and Remote Sensing | VOL. 58

MPTC-FPN: A Multilayer Progressive FPN With Transformer-CNN Based Encoder for Salient Object Detection
Xiaoqi Yang ... Liangliang Duan
IEEE Access | VOL. 10
Xiaoqi Yang, et. al.Xiaoqi Yang ... Liangliang Duan
01 Jan 2021
IEEE Access | VOL. 10

Multi-Field Context Fusion Network for Semantic Segmentation of High-Spatial-Resolution Remote Sensing Images
Xinran Du ... Shumeng He
Remote Sensing | VOL. 14
Xinran Du, et. al.Xinran Du ... Shumeng He
17 Nov 2022
Remote Sensing | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Global–Local Deep Fusion: Semantic Integration with Enhanced Transformer in Dual-Branch Networks for Ultra-High Resolution Image Segmentation

Abstract

Talk to us

Similar Papers

More From: Applied Sciences