Machine Learning Algorithms for Lithofacies Classification of the Gulong Shale from the Songliao Basin, China

Lithofacies identification and classification are critical for characterizing the hydrocarbon potential of unconventional resources. Although extensive applications of machine learning models in predicting lithofacies have been applied to conventional reservoir systems, the effectiveness of machine...

Full description

Bibliographic Details
Main Authors: Mingqiu Hou, Yuxiang Xiao, Zhengdong Lei, Zhi Yang, Yihuai Lou, Yuming Liu
Format: Article
Language:English
Published: MDPI AG 2023-03-01
Series:Energies
Subjects:
Online Access:https://www.mdpi.com/1996-1073/16/6/2581
_version_ 1797612152045961216
author Mingqiu Hou
Yuxiang Xiao
Zhengdong Lei
Zhi Yang
Yihuai Lou
Yuming Liu
author_facet Mingqiu Hou
Yuxiang Xiao
Zhengdong Lei
Zhi Yang
Yihuai Lou
Yuming Liu
author_sort Mingqiu Hou
collection DOAJ
description Lithofacies identification and classification are critical for characterizing the hydrocarbon potential of unconventional resources. Although extensive applications of machine learning models in predicting lithofacies have been applied to conventional reservoir systems, the effectiveness of machine learning models in predicting clay-rich, lacustrine shale lithofacies has yet to be tackled. Here, we apply machine learning models to conventional well log data to automatically identify the shale lithofacies of Gulong Shale in the Songliao Basin. The shale lithofacies were classified into six types based on total organic carbon and mineral composition data from core analysis and geochemical logs. We compared the accuracy of Multilayer Perceptron (MLP), Support Vector Machine (SVM), Extreme Gradient Boosting (XGBoost), and Random Forest models. We mitigated the bias of imbalanced data by applying oversampling algorithms. Our results show that ensemble methods (XGBoost and Random Forest) have a better performance in shale lithofacies identification than the other models do, with accuracies of 0.868 and 0.884, respectively. The organic siliceous shale proposed to have the best hydrocarbon potential in Gulong Shale can be identified with F1 scores of 0.853 by XGBoost and 0.877 by Random Forest. Our study suggests that ensemble machine learning models can effectively identify the lithofacies of clay-rich shale from conventional well logs, providing insight into the sweet spot prediction of unconventional reservoirs. Further improvements in model performances can be achieved by adding domain knowledge and employing advanced well log data.
first_indexed 2024-03-11T06:38:18Z
format Article
id doaj.art-e8e881ca5a9a48089da3e0f46dd02afb
institution Directory Open Access Journal
issn 1996-1073
language English
last_indexed 2024-03-11T06:38:18Z
publishDate 2023-03-01
publisher MDPI AG
record_format Article
series Energies
spelling doaj.art-e8e881ca5a9a48089da3e0f46dd02afb2023-11-17T10:48:05ZengMDPI AGEnergies1996-10732023-03-01166258110.3390/en16062581Machine Learning Algorithms for Lithofacies Classification of the Gulong Shale from the Songliao Basin, ChinaMingqiu Hou0Yuxiang Xiao1Zhengdong Lei2Zhi Yang3Yihuai Lou4Yuming Liu5Research Institute of Petroleum Exploration and Development, PetroChina, Beijing 100083, ChinaResearch Institute of Petroleum Exploration and Development, PetroChina, Beijing 100083, ChinaResearch Institute of Petroleum Exploration and Development, PetroChina, Beijing 100083, ChinaResearch Institute of Petroleum Exploration and Development, PetroChina, Beijing 100083, ChinaCenter for Hypergravity Experimental and Interdisciplinary Research, Zhejiang University, Hangzhou 310058, ChinaCollege of Geosciences, China University of Petroleum, Beijing 102249, ChinaLithofacies identification and classification are critical for characterizing the hydrocarbon potential of unconventional resources. Although extensive applications of machine learning models in predicting lithofacies have been applied to conventional reservoir systems, the effectiveness of machine learning models in predicting clay-rich, lacustrine shale lithofacies has yet to be tackled. Here, we apply machine learning models to conventional well log data to automatically identify the shale lithofacies of Gulong Shale in the Songliao Basin. The shale lithofacies were classified into six types based on total organic carbon and mineral composition data from core analysis and geochemical logs. We compared the accuracy of Multilayer Perceptron (MLP), Support Vector Machine (SVM), Extreme Gradient Boosting (XGBoost), and Random Forest models. We mitigated the bias of imbalanced data by applying oversampling algorithms. Our results show that ensemble methods (XGBoost and Random Forest) have a better performance in shale lithofacies identification than the other models do, with accuracies of 0.868 and 0.884, respectively. The organic siliceous shale proposed to have the best hydrocarbon potential in Gulong Shale can be identified with F1 scores of 0.853 by XGBoost and 0.877 by Random Forest. Our study suggests that ensemble machine learning models can effectively identify the lithofacies of clay-rich shale from conventional well logs, providing insight into the sweet spot prediction of unconventional reservoirs. Further improvements in model performances can be achieved by adding domain knowledge and employing advanced well log data.https://www.mdpi.com/1996-1073/16/6/2581machine learning modelsensemble methodsXGBoostrandom forestshale lithofacieswell log
spellingShingle Mingqiu Hou
Yuxiang Xiao
Zhengdong Lei
Zhi Yang
Yihuai Lou
Yuming Liu
Machine Learning Algorithms for Lithofacies Classification of the Gulong Shale from the Songliao Basin, China
Energies
machine learning models
ensemble methods
XGBoost
random forest
shale lithofacies
well log
title Machine Learning Algorithms for Lithofacies Classification of the Gulong Shale from the Songliao Basin, China
title_full Machine Learning Algorithms for Lithofacies Classification of the Gulong Shale from the Songliao Basin, China
title_fullStr Machine Learning Algorithms for Lithofacies Classification of the Gulong Shale from the Songliao Basin, China
title_full_unstemmed Machine Learning Algorithms for Lithofacies Classification of the Gulong Shale from the Songliao Basin, China
title_short Machine Learning Algorithms for Lithofacies Classification of the Gulong Shale from the Songliao Basin, China
title_sort machine learning algorithms for lithofacies classification of the gulong shale from the songliao basin china
topic machine learning models
ensemble methods
XGBoost
random forest
shale lithofacies
well log
url https://www.mdpi.com/1996-1073/16/6/2581
work_keys_str_mv AT mingqiuhou machinelearningalgorithmsforlithofaciesclassificationofthegulongshalefromthesongliaobasinchina
AT yuxiangxiao machinelearningalgorithmsforlithofaciesclassificationofthegulongshalefromthesongliaobasinchina
AT zhengdonglei machinelearningalgorithmsforlithofaciesclassificationofthegulongshalefromthesongliaobasinchina
AT zhiyang machinelearningalgorithmsforlithofaciesclassificationofthegulongshalefromthesongliaobasinchina
AT yihuailou machinelearningalgorithmsforlithofaciesclassificationofthegulongshalefromthesongliaobasinchina
AT yumingliu machinelearningalgorithmsforlithofaciesclassificationofthegulongshalefromthesongliaobasinchina