Learning context-aware local feature descriptors for 3D reconstruction

Jian Yang,Jian Zhou,Hao Fan,Junyu Dong,Hui Yu

doi:10.1016/j.neucom.2024.127793

Abstract

The generation of generalizable and discriminative descriptors plays a crucial role in image matching and 3D reconstruction. While numerous existing solutions are concentrated on encoding specific invariances, such as illumination or viewpoint invariance, they often face challenges in achieving robustness and generalization. These challenges arise from the frequent inadequacy of these solutions to effectively adapt to diverse and demanding environments due to their limited information capacity. In this paper, we introduce a novel approach aimed at maximizing the utilization of hidden feature informativeness to address these challenges. Specifically, we propose the Hierarchical Context-aware Aggregation Network (HCNet), which employs a hierarchical dense features constraint in a coarse-to-refinement description manner. In this approach, a coarse-level descriptor is used to present the overall information, while the refinement descriptor captures the detailed information of the image. Leveraging the strengths of both CNN and Transformer architectures, our hierarchical dense feature constraint encodes both local features and long-range information to efficiently generate dense feature descriptions. To boost descriptor informativeness and enhance matching accuracy, we introduce the Context-aware Attention Aggregation (CAA) model, which adaptively aggregates features from various scales through an efficient coarse-to-refinement manner. Additionally, we design a hierarchical triplet training strategy that considers both variant and invariant properties of hierarchical features, aiming to enhance descriptor informativeness while preserving their strong discriminative qualities. Our experiments, conducted on two popular feature-matching benchmarks, as well as a challenging long-term visual localization benchmark, demonstrate that our method significantly improves matching accuracy and outperforms state-of-the-art descriptors. Moreover, our approach exhibits superior generalization capabilities in various 3D reconstruction scenarios.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning context-aware local feature descriptors for 3D reconstruction

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

DeepMatcher: A deep transformer-based network for robust and accurate local feature matching
Tao Xie ... Lijun Zhao
Expert Systems with Applications | VOL. 237
Tao Xie, et. al.Tao Xie ... Lijun Zhao
01 Sep 2023
Expert Systems with Applications | VOL. 237

SADGFeat: Learning local features with layer spatial attention and domain generalization
Wenjing Bai ... Jun Hu
Image and Vision Computing | VOL. 146
Wenjing Bai, et. al.Wenjing Bai ... Jun Hu
21 Apr 2024
Image and Vision Computing | VOL. 146

Dense Residual Network: Enhancing global dense feature flow for character recognition
Zhao Zhang ... Meng Wang
Neural Networks | VOL. 139
Zhao Zhang, et. al.Zhao Zhang ... Meng Wang
25 Feb 2021
Neural Networks | VOL. 139

3D Affine: An Embedding of Local Image Features for Viewpoint Invariance Using RGB-D Sensor Data
Hamdi Sahloul ... Jun Ota
Sensors | VOL. 19
Hamdi Sahloul, et. al.Hamdi Sahloul ... Jun Ota
12 Jan 2019
Sensors | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning context-aware local feature descriptors for 3D reconstruction

Abstract

Talk to us

Similar Papers

More From: Neurocomputing