Abstract
In the era of remote sensing big data, the intelligent interpretation of remote sensing images is a key technology for mining the value of remote sensing big data and promoting a number of major applications, mainly including land cover classification and extraction. Among these, the rapid extraction of open-pit mining areas plays a vital role in current practices for refined mineral resources development and management and ecological–environmental protection in China. However, existing methods are not accurate enough for classification, not fine enough for boundary extraction, and poor in terms of multi-scale adaptation. To address these issues, we propose a new semantic segmentation model based on Transformer, which is called Segmentation for Mine—SegMine—and consists of a Vision Transformer-based encoder and a lightweight attention mask decoder. The experimental results show that SegMine enhances the network’s ability to obtain local spatial detail information and improves the problem of disappearing small-scale object features and insufficient information expression. It also better preserves the boundary details of open-pit mining areas. Using the metrics of mIoU, precision, recall, and dice, experimental areas were selected for comparative analysis, and the results show that the new method is significantly better than six other existing major Transformer variants.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.