IMFNet: Interpretable Multimodal Fusion for Point Cloud Registration

Xiaoshui Huang,Xiaowei Zhao,Wentao Qu,Yuming Fang,Yifan Zuo

doi:10.1109/lra.2022.3214789

Abstract

The existing state-of-the-art point descriptor relies on structure information only, which omits the texture information. However, texture information is crucial for our humans to distinguish a scene part. Moreover, the current learning-based point descriptors are all black boxes which are unclear how the original points contribute to the final descriptor. This paper proposes a new multimodal fusion method to generate a point cloud registration descriptor by considering structure and texture information. Specifically, a novel attention-fusion module is designed to extract the weighted texture information for descriptor extraction. In addition, we propose an interpretable module to explain our neural network by visually showing the original points contributing to the final descriptor. We use the descriptor's channel value as the loss to backpropagate to the target layer and consider the gradient as the significance of this point to the final descriptor. This paper moves one step further to explainable deep learning in the registration task. Comprehensive experiments on 3DMatch, 3DLoMatch and KITTI demonstrate that the multimodal fusion descriptor achieves state-of-the-art accuracy and improves the descriptor's distinctiveness. We also demonstrate our interpretable module in explaining the registration descriptor extraction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

IMFNet: Interpretable Multimodal Fusion for Point Cloud Registration

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters

Lead the way for us

Journal: IEEE Robotics and Automation Letters	Publication Date: Oct 1, 2022
Citations: 27

Similar Papers

Delving Globally into Texture and Structure for Image Inpainting
Haipeng Liu ... Yong Rui
-
Haipeng Liu, et. al.Haipeng Liu ... Yong Rui
10 Oct 2022
10 Oct 2022

An effective multimodal representation and fusion method for multimodal intent recognition
Xuejian Huang ... Najla Alnabhan
Neurocomputing | VOL. 548
Xuejian Huang, et. al.Xuejian Huang ... Najla Alnabhan
06 Jun 2023
Neurocomputing | VOL. 548

Development and validation of a multi-modality fusion deep learning model for differentiating glioblastoma from solitary brain metastases.
Chunquan Li ... Ziye Yan
Zhong nan da xue xue bao. Yi xue ban = Journal of Central South University. Medical sciences | VOL. 49
Chunquan Li, et. al.Chunquan Li ... Ziye Yan
28 Jan 2024
Zhong nan da xue xue bao. Yi xue ban = Journal of Central South University. Medical sciences | VOL. 49

Image outpainting guided by prior structure information
Canghong Shi ... Xiaojie Li
Pattern Recognition Letters | VOL. 164
Canghong Shi, et. al.Canghong Shi ... Xiaojie Li
01 Dec 2022
Pattern Recognition Letters | VOL. 164

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

IMFNet: Interpretable Multimodal Fusion for Point Cloud Registration

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters