Advanced Machine Learning Techniques to Improve Hydrological Prediction: A Comparative Analysis of Streamflow Prediction Models

The management of water resources depends heavily on hydrological prediction, and advances in machine learning (ML) present prospects for improving predictive modelling capabilities. This study investigates the use of a variety of widely used machine learning algorithms, such as CatBoost, ElasticNet...

Full description

Bibliographic Details
Main Authors: Vijendra Kumar, Naresh Kedam, Kul Vaibhav Sharma, Darshan J. Mehta, Tommaso Caloiero
Format: Article
Language:English
Published: MDPI AG 2023-07-01
Series:Water
Subjects:
Online Access:https://www.mdpi.com/2073-4441/15/14/2572
_version_ 1797587190891413504
author Vijendra Kumar
Naresh Kedam
Kul Vaibhav Sharma
Darshan J. Mehta
Tommaso Caloiero
author_facet Vijendra Kumar
Naresh Kedam
Kul Vaibhav Sharma
Darshan J. Mehta
Tommaso Caloiero
author_sort Vijendra Kumar
collection DOAJ
description The management of water resources depends heavily on hydrological prediction, and advances in machine learning (ML) present prospects for improving predictive modelling capabilities. This study investigates the use of a variety of widely used machine learning algorithms, such as CatBoost, ElasticNet, k-Nearest Neighbors (KNN), Lasso, Light Gradient Boosting Machine Regressor (LGBM), Linear Regression (LR), Multilayer Perceptron (MLP), Random Forest (RF), Ridge, Stochastic Gradient Descent (SGD), and the Extreme Gradient Boosting Regression Model (XGBoost), to predict the river inflow of the Garudeshwar watershed, a key element in planning for flood control and water supply. The substantial engineering feature used in the study, which incorporates temporal lag and contextual data based on Indian seasons, leads it distinctiveness. The study concludes that the CatBoost method demonstrated remarkable performance across various metrics, including Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and R-squared (R<sup>2</sup>) values, for both training and testing datasets. This was accomplished by an in-depth investigation and model comparison. In contrast to CatBoost, XGBoost and LGBM demonstrated a higher percentage of data points with prediction errors exceeding 35% for moderate inflow numbers above 10,000. CatBoost established itself as a reliable method for hydrological time-series modelling, easily managing both categorical and continuous variables, and thereby greatly enhancing prediction accuracy. The results of this study highlight the value and promise of widely used machine learning algorithms in hydrology and offer valuable insights for academics and industry professionals.
first_indexed 2024-03-11T00:33:39Z
format Article
id doaj.art-d55fd250f8df42349bd8f0b510a9c69d
institution Directory Open Access Journal
issn 2073-4441
language English
last_indexed 2024-03-11T00:33:39Z
publishDate 2023-07-01
publisher MDPI AG
record_format Article
series Water
spelling doaj.art-d55fd250f8df42349bd8f0b510a9c69d2023-11-18T21:47:14ZengMDPI AGWater2073-44412023-07-011514257210.3390/w15142572Advanced Machine Learning Techniques to Improve Hydrological Prediction: A Comparative Analysis of Streamflow Prediction ModelsVijendra Kumar0Naresh Kedam1Kul Vaibhav Sharma2Darshan J. Mehta3Tommaso Caloiero4Department of Civil Engineering, Dr. Vishwanath Karad MIT World Peace University, Pune 411038, Maharashtra, IndiaDepartment of Thermal Engineering and Thermal Engines, Samara National Research University, Moskovskoye Shosse, 34, Samara 443086, RussiaDepartment of Civil Engineering, Dr. Vishwanath Karad MIT World Peace University, Pune 411038, Maharashtra, IndiaDepartment of Civil Engineering, Dr. S. & S. S. Ghandhy Government Engineering College, Surat 395001, Gujarat, IndiaNational Research Council of Italy, Institute for Agricultural and Forest Systems in Mediterranean (CNR-ISAFOM), 87036 Cosenza, ItalyThe management of water resources depends heavily on hydrological prediction, and advances in machine learning (ML) present prospects for improving predictive modelling capabilities. This study investigates the use of a variety of widely used machine learning algorithms, such as CatBoost, ElasticNet, k-Nearest Neighbors (KNN), Lasso, Light Gradient Boosting Machine Regressor (LGBM), Linear Regression (LR), Multilayer Perceptron (MLP), Random Forest (RF), Ridge, Stochastic Gradient Descent (SGD), and the Extreme Gradient Boosting Regression Model (XGBoost), to predict the river inflow of the Garudeshwar watershed, a key element in planning for flood control and water supply. The substantial engineering feature used in the study, which incorporates temporal lag and contextual data based on Indian seasons, leads it distinctiveness. The study concludes that the CatBoost method demonstrated remarkable performance across various metrics, including Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and R-squared (R<sup>2</sup>) values, for both training and testing datasets. This was accomplished by an in-depth investigation and model comparison. In contrast to CatBoost, XGBoost and LGBM demonstrated a higher percentage of data points with prediction errors exceeding 35% for moderate inflow numbers above 10,000. CatBoost established itself as a reliable method for hydrological time-series modelling, easily managing both categorical and continuous variables, and thereby greatly enhancing prediction accuracy. The results of this study highlight the value and promise of widely used machine learning algorithms in hydrology and offer valuable insights for academics and industry professionals.https://www.mdpi.com/2073-4441/15/14/2572hydrological forecastingmachine learningstreamflow predictionCatBoostXGBoostriver inflow prediction
spellingShingle Vijendra Kumar
Naresh Kedam
Kul Vaibhav Sharma
Darshan J. Mehta
Tommaso Caloiero
Advanced Machine Learning Techniques to Improve Hydrological Prediction: A Comparative Analysis of Streamflow Prediction Models
Water
hydrological forecasting
machine learning
streamflow prediction
CatBoost
XGBoost
river inflow prediction
title Advanced Machine Learning Techniques to Improve Hydrological Prediction: A Comparative Analysis of Streamflow Prediction Models
title_full Advanced Machine Learning Techniques to Improve Hydrological Prediction: A Comparative Analysis of Streamflow Prediction Models
title_fullStr Advanced Machine Learning Techniques to Improve Hydrological Prediction: A Comparative Analysis of Streamflow Prediction Models
title_full_unstemmed Advanced Machine Learning Techniques to Improve Hydrological Prediction: A Comparative Analysis of Streamflow Prediction Models
title_short Advanced Machine Learning Techniques to Improve Hydrological Prediction: A Comparative Analysis of Streamflow Prediction Models
title_sort advanced machine learning techniques to improve hydrological prediction a comparative analysis of streamflow prediction models
topic hydrological forecasting
machine learning
streamflow prediction
CatBoost
XGBoost
river inflow prediction
url https://www.mdpi.com/2073-4441/15/14/2572
work_keys_str_mv AT vijendrakumar advancedmachinelearningtechniquestoimprovehydrologicalpredictionacomparativeanalysisofstreamflowpredictionmodels
AT nareshkedam advancedmachinelearningtechniquestoimprovehydrologicalpredictionacomparativeanalysisofstreamflowpredictionmodels
AT kulvaibhavsharma advancedmachinelearningtechniquestoimprovehydrologicalpredictionacomparativeanalysisofstreamflowpredictionmodels
AT darshanjmehta advancedmachinelearningtechniquestoimprovehydrologicalpredictionacomparativeanalysisofstreamflowpredictionmodels
AT tommasocaloiero advancedmachinelearningtechniquestoimprovehydrologicalpredictionacomparativeanalysisofstreamflowpredictionmodels