Interpreting surface geological elements (such as rocks, minerals, soils, and water bodies) is the main task of geological survey, which plays a crucial role in geological environment remote sensing (GERS). However, the characteristics of geological elements, including high variabilities, various morphology, complicated boundaries and imbalanced class distribution, make it still a challenge for deep learning methods to interpret GERS images. Considering the correlations of geological elements as the regionalized variables in geostatistics, the sensitive features of GERS interpretation mainly include three aspects: tonal, textural and structural characteristics within a singular-class elements, spatial and spectral correlations of adjacent elements, and their global tectonic or spatial distribution. Thus, to simulate the manual interpretation process of geologists from local to global and promote GERS interpretation performance, we propose a local-to-global multi-scale feature fusion network (LGMSFNet). A geological object context represents the intra-class semantic dependencies of pixel sets with the same class. And a local feature aggregation module models the channel and spatial association. Then discriminative features are integrated by a global feature fusion module. For the model optimization, we focus on hard examples during the training process to achieve the balanced optimization of various categories. Two research areas that include large-scale rocks, soils and water exposed on the surface are selected. Massive experiments demonstrate the superiority of the LGMSFNet in GERS interpretation.
Read full abstract