SwinMin: A mineral recognition model incorporating convolution and multi-scale contexts into swin transformer

Liqin Jia,Feng Chen,Mei Yang,Fang Meng,Mingyue He,Hongmin Liu

doi:10.1016/j.cageo.2024.105532

Abstract

Mineral recognition plays a pivotal role in advancing geological survey methodologies and exploration techniques, serving as a cornerstone of contemporary geoscience research. Recently, Transformer-based neural networks have outperformed ConvNets and have become increasingly prominent in vision models. However, adapting Transformer models to mineral photograph recognition presents two significant challenges. Firstly, mineral photograph recognition heavily relies on low-level features such as color, texture, and edges, which Transformers are not intrinsically optimized for. Secondly, the accurate recognition of small-scale objects within mineral images often poses difficulties. To tackle these challenges, we introduce the SwinMin model, specifically designed for mineral photograph recognition. This model incorporates convolutional information into Transformer sequences, thereby enriching the global representation with finer details. Furthermore, we propose a dynamic feature fusion module, which effectively exploits multi-scale contexts, ensuring a more comprehensive representation. Extensive experiments on the mineral photograph datasets demonstrated that SwinMin achieves state-of-the-art performance compared to existing mineral image recognition methods, underlining its potential for reliable and precise mineral image identification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SwinMin: A mineral recognition model incorporating convolution and multi-scale contexts into swin transformer

Abstract

Talk to us

Similar Papers

More From: Computers & Geosciences

Lead the way for us

Journal: Computers & Geosciences	Publication Date: Jan 11, 2024
Citations: 2

Similar Papers

Decision letter: Causal neural mechanisms of context-based object recognition
Redmond G O'Connell ... Joshua I Gold
-
Redmond G O'Connell, et. al.Redmond G O'Connell ... Joshua I Gold
03 Jun 2021
03 Jun 2021

A contemporary approach for object recognition based on spatial layout and low level features’ integration
Riaz Ahmed Shaikh ... Rafaqat Hussain
Multimedia Tools and Applications | VOL. -
Riaz Ahmed Shaikh, et. al.Riaz Ahmed Shaikh ... Rafaqat Hussain
13 Nov 2018
Multimedia Tools and Applications | VOL. -

Real-Time Temporal Frequency Detection in FPGA Using Event-Based Vision Sensor
Sahar Hoseini ... Bernabe Linares-Barranco
-
Sahar Hoseini, et. al.Sahar Hoseini ... Bernabe Linares-Barranco
01 Sep 2018
01 Sep 2018

A Recognition Method for Multi-object Information Based on Multi-source Data Fusion
Zhengfan Zhao ... Guojian Nie
-
Zhengfan Zhao, et. al.Zhengfan Zhao ... Guojian Nie
19 Nov 2021
19 Nov 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SwinMin: A mineral recognition model incorporating convolution and multi-scale contexts into swin transformer

Abstract

Talk to us

Similar Papers

More From: Computers &amp; Geosciences

More From: Computers & Geosciences