Development and application of feature engineered geological layers for ranking magmatic, volcanogenic, and orogenic system components in Archean greenstone belts

Geologically representative feature engineering is a crucial component in geoscientific applications of machine learning. Many commonly applied feature engineering techniques used to produce input variables for machine learning apply geological knowledge to generic data science techniques, which can...

Full description

Bibliographic Details
Main Authors: R.M. Montsion, S. Perrouty, M.D. Lindsay, M.W. Jessell, R. Sherlock
Format: Article
Language:English
Published: Elsevier 2024-03-01
Series:Geoscience Frontiers
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1674987123002268
Description
Summary:Geologically representative feature engineering is a crucial component in geoscientific applications of machine learning. Many commonly applied feature engineering techniques used to produce input variables for machine learning apply geological knowledge to generic data science techniques, which can lead to ambiguity, geological oversimplification, and/or compounding subjective bias. Workflows that utilize minimally processed input variables attempt to overcome these issues, but often lead to convoluted and uninterpretable results. To address these challenges, new and enhanced feature engineering methods were developed by combining geological knowledge, understanding of data limitations, and a variety of data science techniques. These include non-Euclidean fluid pre-deformation path distance, rheological and chemical contrast, geologically constrained interpolation of characteristic host rock geochemistry, interpolation of mobile element gain/loss, assemblages, magnetic intensity, structural complexity, host rock physical properties. These methods were applied to compiled open-source and new field observations from Archean greenstone terranes in the Abitibi and western Wabigoon sub-provinces of the Superior Province near Timmins and Dryden, Ontario, respectively. Resulting feature maps represent conceptually significant components in magmatic, volcanogenic, and orogenic mineral systems. A comparison of ranked feature importance from random forests to conceptual mineral system models show that the feature maps adequately represent system components, with a few exceptions attributed to biased training data or limited constraint data. The study also highlights the shared importance of several highly ranked features for the three mineral systems, indicating that spatially related mineral systems exploit the same features when available. Comparing feature importance when classifying orogenic Au mineralization in Timmins and Dryden provides insights into the possible cause of contrasting endowment being related to fluid source. The study demonstrates that integrative studies leveraging multi-disciplinary data and methodology have the potential to advance geological understanding, maximize data utility, and generate robust exploration targets.
ISSN:1674-9871