Image Geolocation Research Articles

In the field of cross-view image geolocation, traditional convolutional neural network (CNN)-based learning models generate unsatisfactory fusion performance due to their inability to model global correlations. The Transformer-based fusion methods can well compensate for the above problems, however, the Transformer has quadratic computational complexity and huge GPU memory consumption. The recent Mamba model based on the selection state space has a strong ability to model long sequences, lower GPU memory occupancy, and fewer GFLOPs. It is thus attractive and worth studying to apply Mamba to the cross-view image geolocation task. In addition, in the image-matching process (i.e., fusion of satellite/aerial and street view data.), we found that the storage occupancy of similarity measures based on floating-point features is high. Efficiently converting floating-point features into hash codes is a possible solution. In this study, we propose a cross-view image geolocation method (S6HG) based purely on Vision Mamba and hashing. S6HG fully utilizes the advantages of Vision Mamba in global information modeling and explicit location information encoding and the low storage occupancy of hash codes. Our method consists of two stages. In the first stage, we use a Siamese network based purely on vision Mamba to embed features for street view images and satellite images respectively. Our first-stage model is called S6G. In the second stage, we construct a cross-view autoencoder to further refine and compress the embedded features, and then simply map the refined features to hash codes. Comprehensive experiments show that S6G has achieved superior results on the CVACT dataset and comparable results to the most advanced methods on the CVUSA dataset. It is worth noting that other floating-point feature-based methods (4096-dimension) are 170.59 times faster than S6HG (768-bit) in storing 90,618 retrieval gallery data. Furthermore, the inference efficiency of S6G is higher than ViT-based computational methods.

Remote-sensing images of high spatial resolution (HSR) are valuable sources of fine-grained spatial information for various applications, such as urban surveys and governance. There is continuing research on positional errors in remote-sensing images and their impacts in geoprocessing and applications. This paper explores the combined use of multi-point geostatistics (MPS), machine learning—in particular, generalized additive modeling (GAM)—and computer-image correlation for characterizing positional errors in images—in particular, HSR images. These methods are employed because of the merits of MPS in being flexible for non-parametric and joint simulation of positional errors in X and Y coordinates, the merits of GAM in being capable of handling non-stationarity in-positional errors through error de-trending, and the merits of computer-image correlation in being cost-effective in furnishing the training data (TD) required in MPS. Procedurally, image correlation is applied to identify homologous image points in reference-test image pairs to extract image displacements automatically in constructing TD. To cope with the complexity of urban scenes and the unavailability of truly orthorectified images, visual screening is performed to clean the raw displacement data to create quality-enhanced TD, while manual digitization is used to obtain reference sample data, including conditioning data (CD), for MPS and test data for performance evaluation. GAM is used to decompose CD and TD into trends and residuals. With CD and TD both de-trended, the direct sampling (DS) algorithm for MPS is applied to simulate residuals over a simulation grid (SG) at 80 m spatial resolution. With the realizations of residuals and, hence, positional errors generated in this way, the means, standard deviation, and cross correlation in bivariate positional errors at SG nodes are computed. The simulated error fields are also used to generate equal-probable realizations of vertices that define some road centerlines (RCLs), selected for this research through interpolation over the aforementioned simulated error fields, leading to error metrics for the RCLs and for the lengths of some RCL segments. The enhanced georectification of the RCLs is facilitated through error correction. A case study based in Shanghai municipality, China, was carried out, using HSR images as part of generalized point clouds that were developed. The experiment results confirmed that by using the proposed methods, spatially explicit positional-error metrics, including means, standard deviation, and cross correlation, can be quantified flexibly, with those in the selected RCLs and the lengths of some RCL segments derived easily through error propagation. The reference positions of these RCLs were obtained through error correction. The positional accuracy gains achieved by the proposed methods were found to be comparable with those achieved by conventional image georectification, in which the CD were used as image-georectification control data. The proposed methods are valuable not only for uncertainty-informed image geolocation and analysis, but also for integrated geoinformation processing.

Image Geolocation Research Articles

Articles published on Image Geolocation

An efficient cross-view image fusion method based on selected state space and hashing for promoting urban perception

A Satellite-Drone Image Cross-View Geolocalization Method Based on Multi-Scale Information and Dual-Channel Attention Mechanism

Unified and Real-Time Image Geo-Localization via Fine-Grained Overlap Estimation.

Review of cross-view image geolocalization methods

CIG2S: A Cross-View Image Geo-Localization Model Based on G2S Transform Suitable for Center-Misaligned Scenarios

4SCIG: A Four-Branch Framework to Reduce the Interference of Sky Area in Cross-View Image Geo-Localization

UAV LARGE OBLIQUE IMAGE GEO-LOCALIZATION USING SATELLITE IMAGES IN THE DENSE BUILDINGS AREA

Integrating Multi-Point Geostatistics, Machine Learning, and Image Correlation for Characterizing Positional Errors in Remote-Sensing Images of High Spatial Resolution

Semantic-guided de-attention with sharpened triplet marginal loss for visual place recognition

AENet: attention efficient network for cross-view image geo-localization

Context-Aware Querying, Geolocalization, and Rephotography of Historical Newspaper Images

Improving Street View Image Classification Using Pre-trained CNN Model Extracted Features

AN OPEN API FOR 3D-GEOREFERENCED HISTORICAL PICTURES

Image Geolocation Method Based on Attention Mechanism Front Loading and Feature Fusion

Learning Cross-Scale Visual Representations for Real-Time Image Geo-Localization

Cross-view Image Geo-Localization using Multi-Scale Generalized Pooling with Attention Mechanism

Deep Learning-Based Image Geolocation for Travel Recommendation via Multi-Task Learning

Soft Exemplar Highlighting for Cross-View Image-Based Geo-Localization.

Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching.

Location of Synthetic Aperture Radar Imagery via Range History

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Image Geolocation Research Articles

Articles published on Image Geolocation

An efficient cross-view image fusion method based on selected state space and hashing for promoting urban perception

A Satellite-Drone Image Cross-View Geolocalization Method Based on Multi-Scale Information and Dual-Channel Attention Mechanism

Unified and Real-Time Image Geo-Localization via Fine-Grained Overlap Estimation.

Review of cross-view image geolocalization methods

CIG2S: A Cross-View Image Geo-Localization Model Based on G2S Transform Suitable for Center-Misaligned Scenarios

4SCIG: A Four-Branch Framework to Reduce the Interference of Sky Area in Cross-View Image Geo-Localization

UAV LARGE OBLIQUE IMAGE GEO-LOCALIZATION USING SATELLITE IMAGES IN THE DENSE BUILDINGS AREA

Integrating Multi-Point Geostatistics, Machine Learning, and Image Correlation for Characterizing Positional Errors in Remote-Sensing Images of High Spatial Resolution

Semantic-guided de-attention with sharpened triplet marginal loss for visual place recognition

AENet: attention efficient network for cross-view image geo-localization

Context-Aware Querying, Geolocalization, and Rephotography of Historical Newspaper Images

Improving Street View Image Classification Using Pre-trained CNN Model Extracted Features

AN OPEN API FOR 3D-GEOREFERENCED HISTORICAL PICTURES

Image Geolocation Method Based on Attention Mechanism Front Loading and Feature Fusion

Learning Cross-Scale Visual Representations for Real-Time Image Geo-Localization

Cross-view Image Geo-Localization using Multi-Scale Generalized Pooling with Attention Mechanism

Deep Learning-Based Image Geolocation for Travel Recommendation via Multi-Task Learning

Soft Exemplar Highlighting for Cross-View Image-Based Geo-Localization.

Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching.

Location of Synthetic Aperture Radar Imagery via Range History