MVPNet: A multi-scale voxel-point adaptive fusion network for point cloud semantic segmentation in urban scenes

Huchen Li,Haiyan Guan,Lingfei Ma,Xiangda Lei,Yongtao Yu,Hanyun Wang,Mahmoud Reza Delavar,Jonathan Li

doi:10.1016/j.jag.2023.103391

Abstract

Point cloud semantic segmentation, which contributes to scene understanding at different scales, is crucial for three-dimensional reconstruction and digital twin cities. However, current semantic segmentation methods mostly extract multi-scale features by down-sampling operations, but the feature maps only have a single receptive field at the same scale, resulting in the misclassification of objects with spatial similarity. To effectively capture the geometric features and the semantic information of different receptive fields, a multi-scale voxel-point adaptive fusion network (MVP-Net) is proposed for point cloud semantic segmentation in urban scenes. First, a multi-scale voxel fusion module with gating mechanism is designed to explore the semantic representation ability of different receptive fields. Then, a geometric self-attention module is constructed to deeply fuse fine-grained point features with coarse-grained voxel features. Finally, a pyramid decoder is introduced to aggregate context information at different scales for enhancing feature representation. The proposed MVP-Net was evaluated on three datasets, Toronto3D, WHU-MLS, and SensatUrban, and achieved superior performance in comparison to the state-of-the-art (SOTA) methods. For the public Toronto3D and SensatUrban datasets, our MVP-Net achieved a mIoU of 84.14% and 59.40%, and an overall accuracy of 98.12% and 93.30%, respectively.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MVPNet: A multi-scale voxel-point adaptive fusion network for point cloud semantic segmentation in urban scenes

Abstract

Talk to us

Similar Papers

More From: International Journal of Applied Earth Observation and Geoinformation

Lead the way for us

Journal: International Journal of Applied Earth Observation and Geoinformation	Publication Date: Jun 19, 2023
Citations: 10

Similar Papers

DSANet: Dilated spatial attention for real-time semantic segmentation in urban street scenes
Mohammed A.M Elhassan ... Tewodros Legesse Munea
Expert Systems with Applications | VOL. 183
Mohammed A.M Elhassan, et. al.Mohammed A.M Elhassan ... Tewodros Legesse Munea
29 Apr 2021
Expert Systems with Applications | VOL. 183

Multi-Feature Aggregation for Semantic Segmentation of an Urban Scene Point Cloud
Jiaqing Chen ... Yang Liu
Remote Sensing | VOL. 14
Jiaqing Chen, et. al.Jiaqing Chen ... Yang Liu
14 Oct 2022
Remote Sensing | VOL. 14

Light Transport Induced Domain Adaptation for Semantic Segmentation in Thermal Infrared Urban Scenes
Junzhang Chen ... Xiangzhi Bai
IEEE Transactions on Intelligent Transportation Systems | VOL. 23
Junzhang Chen, et. al.Junzhang Chen ... Xiangzhi Bai
01 Dec 2022
IEEE Transactions on Intelligent Transportation Systems | VOL. 23

FuseSeg: Semantic Segmentation of Urban Scenes Based on RGB and Thermal Data Fusion
Yuxiang Sun ... Peng Yun
IEEE Transactions on Automation Science and Engineering | VOL. 18
Yuxiang Sun, et. al.Yuxiang Sun ... Peng Yun
01 Jul 2021
IEEE Transactions on Automation Science and Engineering | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MVPNet: A multi-scale voxel-point adaptive fusion network for point cloud semantic segmentation in urban scenes

Abstract

Talk to us

Similar Papers

More From: International Journal of Applied Earth Observation and Geoinformation