Self-supervised fusion network for RGB-D interest point detection and description

Ningning Li,Xiaomin Wang,Zhou Zheng,Zhendong Sun

doi:10.1016/j.patcog.2024.111040

Abstract

Interest point detection and description are highly challenging in indoor environments with repeated and sparse textures and heavy illumination changes (noted as challenging indoor environments, CIE). In such environments, it is a severe problem of mismatched or misaligned feature points, often resulting in unsatisfactory accuracy in indoor applications, such as SLAM. To deal with the issue, we propose a self-supervised RGB-D cross-modal fusion network (RDFNet) for feature extraction. In the RDFNet, a dual-stream structure is introduced to build a pseudo-Siamese network for simultaneously processing color and depth images, while a new two-stage cross-modal reweighted fusion method (TCRF) is developed to fuse RGB and depth features. The TCRF achieves effective fusion in two steps: (1) introducing the reweighting idea and compositely enhancing RGB features by the depth features at both low-level and high-level stages; (2) concatenating the enhanced RGB and depth features together. In addition, we add a uniform distribution loss function to encourage the uniform extraction of feature points. To verify the proposed model performance, a new test dataset of specific indoor scenes is created to evaluate it and compare it to other state-of-the-art methods. Experimental results demonstrate its excellent performance in challenging indoor scenarios.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Self-supervised fusion network for RGB-D interest point detection and description

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Similar Papers

Improving Feature Point Detection in High Dynamic Range Images
Welerson Augusto Lino Jesus De Melo ... Daniel Oliveira Dantas
-
Welerson Augusto Lino Jesus De Melo, et. al.Welerson Augusto Lino Jesus De Melo ... Daniel Oliveira Dantas
01 Jun 2018
01 Jun 2018

Swin-MFA: A Multi-Modal Fusion Attention Network Based on Swin-Transformer for Low-Light Image Human Segmentation.
Xunpeng Yi ... Yibo Wang
Sensors | VOL. 22
Xunpeng Yi, et. al.Xunpeng Yi ... Yibo Wang
19 Aug 2022
Sensors | VOL. 22

RDFNet: RGB-D Multi-level Residual Feature Fusion for Indoor Semantic Segmentation
Seungyong Lee ... Ki-Sang Hong
-
Seungyong Lee, et. al.Seungyong Lee ... Ki-Sang Hong
01 Oct 2017
01 Oct 2017

Multiple Classifiers-Based Feature Fusion for RGB-D Object Recognition
Yan Wu ... Jing Bai
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 31
Yan Wu, et. al.Yan Wu ... Jing Bai
27 Feb 2017
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Self-supervised fusion network for RGB-D interest point detection and description

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition