Imbalance knowledge-driven multi-modal network for land-cover semantic segmentation using aerial images and LiDAR point clouds

Yameng Wang,Yi Wan,Yongjun Zhang,Bin Zhang,Zhi Gao

doi:10.1016/j.isprsjprs.2023.06.014

Yameng Wang, Yi Wan + Show 3 more

Open Access

https://doi.org/10.1016/j.isprsjprs.2023.06.014

Copy DOI

Abstract

Despite the good results that have been achieved in unimodal segmentation, the inherent limitations of individual data increase the difficulty of achieving breakthroughs in performance. For that reason, multi-modal learning is increasingly being explored within the field of remote sensing. The present multi-modal methods usually map high-dimensional features to low-dimensional spaces as a preprocess before feature extraction to address the nonnegligible domain gap, which inevitably leads to information loss. To address this issue, in this paper we present our novel Imbalance Knowledge-Driven Multi-modal Network (IKD-Net) to extract features from multi-modal heterogeneous data of aerial images and LiDAR directly. IKD-Net is capable of mining imbalance information across modalities while utilizing a strong modal to drive the feature map refinement of the weaker ones in the global and categorical perspectives by way of two sophisticated plug-and-play modules: the Global Knowledge-Guided (GKG) and Class Knowledge-Guided (CKG) gated modules. The whole network then is optimized using a joint loss function. While we were developing IKD-Net, we also established a new dataset called the National Agriculture Imagery Program and 3D Elevation Program Combined dataset in California (N3C-California), which provides a particular benchmark for multi-modal joint segmentation tasks. In our experiments, IKD-Net outperformed the benchmarks and state-of-the-art methods both in the N3C-California and the small-scale ISPRS Vaihingen dataset. IKD-Net has been ranked first on the real-time leaderboard for the GRSS DFC 2018 challenge evaluation until this paper’s submission. Our code and N3C-California dataset are available at https://github.com/wymqqq/IKDNet-pytorch.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Imbalance knowledge-driven multi-modal network for land-cover semantic segmentation using aerial images and LiDAR point clouds

Abstract

Talk to us

Similar Papers

More From: ISPRS Journal of Photogrammetry and Remote Sensing

Lead the way for us

Journal: ISPRS Journal of Photogrammetry and Remote Sensing	Publication Date: Jul 6, 2023
Citations: 7

Similar Papers

Optimization of OpenStreetMap Building Footprints Based on Semantic Information of Oblique UAV Images
Xiangyu Zhuo ... Friedrich Fraundorfer
Remote Sensing | VOL. 10
Xiangyu Zhuo, et. al.Xiangyu Zhuo ... Friedrich Fraundorfer
18 Apr 2018
Remote Sensing | VOL. 10

Reliable registration of LiDAR data and aerial images without orientation parameters
Hui Li ... Xianfeng Huang
Sensor Review | VOL. 32
Hui Li, et. al.Hui Li ... Xianfeng Huang
07 Sep 2012
Sensor Review | VOL. 32

E-TRAINEE: OPEN E-LEARNING COURSE ON TIME SERIES ANALYSIS IN REMOTE SENSING
M Potůčková ... Z Lhotáková
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XLVIII-1/W2-2023
M Potůčková, et. al.M Potůčková ... Z Lhotáková
13 Dec 2023
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XLVIII-1/W2-2023

SEMANTICALLY ENRICHED HIGH RESOLUTION LOD 3 BUILDING MODEL GENERATION
A Gruen ... R Qin
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XLII-4/W15
A Gruen, et. al.A Gruen ... R Qin
23 Sep 2019
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XLII-4/W15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Imbalance knowledge-driven multi-modal network for land-cover semantic segmentation using aerial images and LiDAR point clouds

Abstract

Talk to us

Similar Papers

More From: ISPRS Journal of Photogrammetry and Remote Sensing