Abstract
Geologically representative feature engineering is a crucial component in geoscientific applications of machine learning. Many commonly applied feature engineering techniques used to produce input variables for machine learning apply geological knowledge to generic data science techniques, which can lead to ambiguity, geological oversimplification, and/or compounding subjective bias. Workflows that utilize minimally processed input variables attempt to overcome these issues, but often lead to convoluted and uninterpretable results. To address these challenges, new and enhanced feature engineering methods were developed by combining geological knowledge, understanding of data limitations, and a variety of data science techniques. These include non-Euclidean fluid pre-deformation path distance, rheological and chemical contrast, geologically constrained interpolation of characteristic host rock geochemistry, interpolation of mobile element gain/loss, assemblages, magnetic intensity, structural complexity, host rock physical properties. These methods were applied to compiled open-source and new field observations from Archean greenstone terranes in the Abitibi and western Wabigoon sub-provinces of the Superior Province near Timmins and Dryden, Ontario, respectively. Resulting feature maps represent conceptually significant components in magmatic, volcanogenic, and orogenic mineral systems. A comparison of ranked feature importance from random forests to conceptual mineral system models show that the feature maps adequately represent system components, with a few exceptions attributed to biased training data or limited constraint data. The study also highlights the shared importance of several highly ranked features for the three mineral systems, indicating that spatially related mineral systems exploit the same features when available. Comparing feature importance when classifying orogenic Au mineralization in Timmins and Dryden provides insights into the possible cause of contrasting endowment being related to fluid source. The study demonstrates that integrative studies leveraging multi-disciplinary data and methodology have the potential to advance geological understanding, maximize data utility, and generate robust exploration targets.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.