HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer

Shuzhe Wang,Zakaria Laskar,Iaroslav Melekhov,Xiaotian Li,Yi Zhao,Giorgos Tolias,Juho Kannala

doi:10.1007/s11263-023-01982-9

Abstract

Visual localization is critical to many applications in computer vision and robotics. To address single-image RGB localization, state-of-the-art feature-based methods match local descriptors between a query image and a pre-built 3D model. Recently, deep neural networks have been exploited to regress the mapping between raw pixels and 3D coordinates in the scene, and thus the matching is implicitly performed by the forward pass through the network. However, in a large and ambiguous environment, learning such a regression task directly can be difficult for a single network. In this work, we present a new hierarchical scene coordinate network to predict pixel scene coordinates in a coarse-to-fine manner from a single RGB image. The proposed method, which is an extension of HSCNet, allows us to train compact models which scale robustly to large environments. It sets a new state-of-the-art for single-image localization on the 7-Scenes, 12-Scenes, Cambridge Landmarks datasets, and the combined indoor scenes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Computer Vision	Publication Date: Feb 6, 2024
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Vision

Lead the way for us

Similar Papers

Hierarchical Scene Coordinate Classification and Regression for Visual Localization
Xiaotian Li ... Jakob Verbeek
-
Xiaotian Li, et. al.Xiaotian Li ... Jakob Verbeek
28 Nov 2019
28 Nov 2019

Visual Localization Using Sparse Semantic 3D Map
Tianxin Shi ... Xiang Gao
-
Tianxin Shi, et. al.Tianxin Shi ... Xiang Gao
01 Sep 2019
01 Sep 2019

Visual Localization via Few-Shot Scene Region Classification
Siyan Dong ... Marc Pollefeys
-
Siyan Dong, et. al.Siyan Dong ... Marc Pollefeys
01 Sep 2022
01 Sep 2022

VS-Net: Voting with Segmentation for Visual Localization
Zhaoyang Huang ... Hujun Bao
-
Zhaoyang Huang, et. al.Zhaoyang Huang ... Hujun Bao
01 Jun 2021
01 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Vision