Development and application of feature engineered geological layers for ranking magmatic, volcanogenic, and orogenic system components in Archean greenstone belts

Geologically representative feature engineering is a crucial component in geoscientific applications of machine learning. Many commonly applied feature engineering techniques used to produce input variables for machine learning apply geological knowledge to generic data science techniques, which can...

Full description

Bibliographic Details
Main Authors: R.M. Montsion, S. Perrouty, M.D. Lindsay, M.W. Jessell, R. Sherlock
Format: Article
Language:English
Published: Elsevier 2024-03-01
Series:Geoscience Frontiers
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1674987123002268
_version_ 1797311618031288320
author R.M. Montsion
S. Perrouty
M.D. Lindsay
M.W. Jessell
R. Sherlock
author_facet R.M. Montsion
S. Perrouty
M.D. Lindsay
M.W. Jessell
R. Sherlock
author_sort R.M. Montsion
collection DOAJ
description Geologically representative feature engineering is a crucial component in geoscientific applications of machine learning. Many commonly applied feature engineering techniques used to produce input variables for machine learning apply geological knowledge to generic data science techniques, which can lead to ambiguity, geological oversimplification, and/or compounding subjective bias. Workflows that utilize minimally processed input variables attempt to overcome these issues, but often lead to convoluted and uninterpretable results. To address these challenges, new and enhanced feature engineering methods were developed by combining geological knowledge, understanding of data limitations, and a variety of data science techniques. These include non-Euclidean fluid pre-deformation path distance, rheological and chemical contrast, geologically constrained interpolation of characteristic host rock geochemistry, interpolation of mobile element gain/loss, assemblages, magnetic intensity, structural complexity, host rock physical properties. These methods were applied to compiled open-source and new field observations from Archean greenstone terranes in the Abitibi and western Wabigoon sub-provinces of the Superior Province near Timmins and Dryden, Ontario, respectively. Resulting feature maps represent conceptually significant components in magmatic, volcanogenic, and orogenic mineral systems. A comparison of ranked feature importance from random forests to conceptual mineral system models show that the feature maps adequately represent system components, with a few exceptions attributed to biased training data or limited constraint data. The study also highlights the shared importance of several highly ranked features for the three mineral systems, indicating that spatially related mineral systems exploit the same features when available. Comparing feature importance when classifying orogenic Au mineralization in Timmins and Dryden provides insights into the possible cause of contrasting endowment being related to fluid source. The study demonstrates that integrative studies leveraging multi-disciplinary data and methodology have the potential to advance geological understanding, maximize data utility, and generate robust exploration targets.
first_indexed 2024-03-08T02:02:21Z
format Article
id doaj.art-f728661e4ffb4a6e944689bb0ffad537
institution Directory Open Access Journal
issn 1674-9871
language English
last_indexed 2024-03-08T02:02:21Z
publishDate 2024-03-01
publisher Elsevier
record_format Article
series Geoscience Frontiers
spelling doaj.art-f728661e4ffb4a6e944689bb0ffad5372024-02-14T05:14:14ZengElsevierGeoscience Frontiers1674-98712024-03-01152101759Development and application of feature engineered geological layers for ranking magmatic, volcanogenic, and orogenic system components in Archean greenstone beltsR.M. Montsion0S. Perrouty1M.D. Lindsay2M.W. Jessell3R. Sherlock4Mineral Exploration Research Centre, Harquail School of Earth Sciences, Goodman School of Mines, Laurentian University, Sudbury, Ontario P3E 2C6, Canada; Centre for Exploration Targeting, School of Earth Sciences, The University of Western Australia, 35 Stirling Highway, Crawley 6009, Australia; Commonwealth Scientific and Industrial Research Organization, Mineral Resources, 26 Dick Perry Ave, Kensington, WA 6151, Australia; Corresponding author.Mineral Exploration Research Centre, Harquail School of Earth Sciences, Goodman School of Mines, Laurentian University, Sudbury, Ontario P3E 2C6, CanadaCentre for Exploration Targeting, School of Earth Sciences, The University of Western Australia, 35 Stirling Highway, Crawley 6009, Australia; Commonwealth Scientific and Industrial Research Organization, Mineral Resources, 26 Dick Perry Ave, Kensington, WA 6151, Australia; ARC Centre for Data Analytics for Resources and Environments (DARE), Perth and Sydney, AustraliaCentre for Exploration Targeting, School of Earth Sciences, The University of Western Australia, 35 Stirling Highway, Crawley 6009, Australia; ARC Centre for Data Analytics for Resources and Environments (DARE), Perth and Sydney, AustraliaMineral Exploration Research Centre, Harquail School of Earth Sciences, Goodman School of Mines, Laurentian University, Sudbury, Ontario P3E 2C6, CanadaGeologically representative feature engineering is a crucial component in geoscientific applications of machine learning. Many commonly applied feature engineering techniques used to produce input variables for machine learning apply geological knowledge to generic data science techniques, which can lead to ambiguity, geological oversimplification, and/or compounding subjective bias. Workflows that utilize minimally processed input variables attempt to overcome these issues, but often lead to convoluted and uninterpretable results. To address these challenges, new and enhanced feature engineering methods were developed by combining geological knowledge, understanding of data limitations, and a variety of data science techniques. These include non-Euclidean fluid pre-deformation path distance, rheological and chemical contrast, geologically constrained interpolation of characteristic host rock geochemistry, interpolation of mobile element gain/loss, assemblages, magnetic intensity, structural complexity, host rock physical properties. These methods were applied to compiled open-source and new field observations from Archean greenstone terranes in the Abitibi and western Wabigoon sub-provinces of the Superior Province near Timmins and Dryden, Ontario, respectively. Resulting feature maps represent conceptually significant components in magmatic, volcanogenic, and orogenic mineral systems. A comparison of ranked feature importance from random forests to conceptual mineral system models show that the feature maps adequately represent system components, with a few exceptions attributed to biased training data or limited constraint data. The study also highlights the shared importance of several highly ranked features for the three mineral systems, indicating that spatially related mineral systems exploit the same features when available. Comparing feature importance when classifying orogenic Au mineralization in Timmins and Dryden provides insights into the possible cause of contrasting endowment being related to fluid source. The study demonstrates that integrative studies leveraging multi-disciplinary data and methodology have the potential to advance geological understanding, maximize data utility, and generate robust exploration targets.http://www.sciencedirect.com/science/article/pii/S1674987123002268Machine learningRandom forestsMineral systemsMagmatic Ni-Cu-PGEVolcanogenic Massive Sulfide (VMS) Cu-Zn-Pb-Ag(-Au)Orogenic Au
spellingShingle R.M. Montsion
S. Perrouty
M.D. Lindsay
M.W. Jessell
R. Sherlock
Development and application of feature engineered geological layers for ranking magmatic, volcanogenic, and orogenic system components in Archean greenstone belts
Geoscience Frontiers
Machine learning
Random forests
Mineral systems
Magmatic Ni-Cu-PGE
Volcanogenic Massive Sulfide (VMS) Cu-Zn-Pb-Ag(-Au)
Orogenic Au
title Development and application of feature engineered geological layers for ranking magmatic, volcanogenic, and orogenic system components in Archean greenstone belts
title_full Development and application of feature engineered geological layers for ranking magmatic, volcanogenic, and orogenic system components in Archean greenstone belts
title_fullStr Development and application of feature engineered geological layers for ranking magmatic, volcanogenic, and orogenic system components in Archean greenstone belts
title_full_unstemmed Development and application of feature engineered geological layers for ranking magmatic, volcanogenic, and orogenic system components in Archean greenstone belts
title_short Development and application of feature engineered geological layers for ranking magmatic, volcanogenic, and orogenic system components in Archean greenstone belts
title_sort development and application of feature engineered geological layers for ranking magmatic volcanogenic and orogenic system components in archean greenstone belts
topic Machine learning
Random forests
Mineral systems
Magmatic Ni-Cu-PGE
Volcanogenic Massive Sulfide (VMS) Cu-Zn-Pb-Ag(-Au)
Orogenic Au
url http://www.sciencedirect.com/science/article/pii/S1674987123002268
work_keys_str_mv AT rmmontsion developmentandapplicationoffeatureengineeredgeologicallayersforrankingmagmaticvolcanogenicandorogenicsystemcomponentsinarcheangreenstonebelts
AT sperrouty developmentandapplicationoffeatureengineeredgeologicallayersforrankingmagmaticvolcanogenicandorogenicsystemcomponentsinarcheangreenstonebelts
AT mdlindsay developmentandapplicationoffeatureengineeredgeologicallayersforrankingmagmaticvolcanogenicandorogenicsystemcomponentsinarcheangreenstonebelts
AT mwjessell developmentandapplicationoffeatureengineeredgeologicallayersforrankingmagmaticvolcanogenicandorogenicsystemcomponentsinarcheangreenstonebelts
AT rsherlock developmentandapplicationoffeatureengineeredgeologicallayersforrankingmagmaticvolcanogenicandorogenicsystemcomponentsinarcheangreenstonebelts