Machine learning, Water Quality Index, and GIS-based analysis of groundwater quality

Water is essential for life, as it supports bodily functions, nourishes crops, and maintains ecosystems. Drinking water is crucial for maintaining good health and can also contribute to economic development by reducing healthcare costs and improving productivity. In this study, we employed five diff...

Full description

Bibliographic Details
Main Authors: Ghulam Shabir Solangi, Zouhaib Ali, Muhammad Bilal, Muhammad Junaid, Sallahuddin Panhwar, Hareef Ahmed Keerio, Iftikhar Hussain Sohu, Sheeraz Gul Shahani, Noor Zaman
Format: Article
Language:English
Published: IWA Publishing 2024-02-01
Series:Water Practice and Technology
Subjects:
Online Access:http://wpt.iwaponline.com/content/19/2/384
_version_ 1797267979195383808
author Ghulam Shabir Solangi
Zouhaib Ali
Muhammad Bilal
Muhammad Junaid
Sallahuddin Panhwar
Hareef Ahmed Keerio
Iftikhar Hussain Sohu
Sheeraz Gul Shahani
Noor Zaman
author_facet Ghulam Shabir Solangi
Zouhaib Ali
Muhammad Bilal
Muhammad Junaid
Sallahuddin Panhwar
Hareef Ahmed Keerio
Iftikhar Hussain Sohu
Sheeraz Gul Shahani
Noor Zaman
author_sort Ghulam Shabir Solangi
collection DOAJ
description Water is essential for life, as it supports bodily functions, nourishes crops, and maintains ecosystems. Drinking water is crucial for maintaining good health and can also contribute to economic development by reducing healthcare costs and improving productivity. In this study, we employed five different machine learning algorithms – logistic regression (LR), decision tree classifier (DTC), extreme gradient boosting (XGB), random forest (RF), and K-nearest neighbors (KNN) – to analyze the dataset, and their prediction performance were evaluated using four metrics: accuracy, precision, recall, and F1 score. Physiochemical parameters of 30 groundwater samples were analyzed to determine the Water Quality Index (WQI) of Pano Aqil city, Pakistan. The samples were categorized into the following four classes based on their WQI values: excellent water, good water, poor water, and unfit for drinking. The WQI scores showed that only 43.33% of the samples were deemed acceptable for drinking, indicating that the majority (56.67%) were unsuitable. The findings suggest that the DTC and XGB algorithms outperform all other algorithms, achieving overall accuracies of 100% each. In contrast, RF, KNN, and LR exhibit overall accuracies of 88, 75, and 50%, respectively. Researchers seeking to enhance water quality using machine learning can benefit from the models described in this study for water quality prediction. HIGHLIGHTS Groundwater quality is evaluated using the Water Quality Index method.; Machine learning algorithms are used for forecasting groundwater quality.; The predictive capabilities of decision tree classifier, extreme gradient boosting, logistic regression, random forest, and K-nearest neighbors models have been evaluated and compared.;
first_indexed 2024-04-25T01:25:11Z
format Article
id doaj.art-dc498a6f8fa9468da65ff3b287f25ad4
institution Directory Open Access Journal
issn 1751-231X
language English
last_indexed 2024-04-25T01:25:11Z
publishDate 2024-02-01
publisher IWA Publishing
record_format Article
series Water Practice and Technology
spelling doaj.art-dc498a6f8fa9468da65ff3b287f25ad42024-03-09T05:48:59ZengIWA PublishingWater Practice and Technology1751-231X2024-02-0119238440010.2166/wpt.2024.014014Machine learning, Water Quality Index, and GIS-based analysis of groundwater qualityGhulam Shabir Solangi0Zouhaib Ali1Muhammad Bilal2Muhammad Junaid3Sallahuddin Panhwar4Hareef Ahmed Keerio5Iftikhar Hussain Sohu6Sheeraz Gul Shahani7Noor Zaman8 Department of Civil Engineering, Mehran University of Engineering & Technology, Shaheed Zulfiqar Ali Bhutto Campus, Khairpur Mirs, Pakistan Department of Civil Engineering, National University of Sciences and Technology, Baluchistan Campus, Quetta, Pakistan Department of Civil Engineering, National University of Sciences and Technology, Baluchistan Campus, Quetta, Pakistan Department of Civil Engineering, National University of Sciences and Technology, Baluchistan Campus, Quetta, Pakistan Department of Civil Engineering, National University of Sciences and Technology, Baluchistan Campus, Quetta, Pakistan Faculty of Engineering & Quantity Surveying, INTI International University, Persiaran Perdana BBN 1800, Putra Nilai, Nilai, Negeri Sembilan, Malaysia Department of Civil Engineering, Mehran University of Engineering & Technology, Shaheed Zulfiqar Ali Bhutto Campus, Khairpur Mirs, Pakistan Department of Civil Engineering, Mehran University of Engineering & Technology, Shaheed Zulfiqar Ali Bhutto Campus, Khairpur Mirs, Pakistan Department of Civil Engineering, Mehran University of Engineering & Technology, Shaheed Zulfiqar Ali Bhutto Campus, Khairpur Mirs, Pakistan Water is essential for life, as it supports bodily functions, nourishes crops, and maintains ecosystems. Drinking water is crucial for maintaining good health and can also contribute to economic development by reducing healthcare costs and improving productivity. In this study, we employed five different machine learning algorithms – logistic regression (LR), decision tree classifier (DTC), extreme gradient boosting (XGB), random forest (RF), and K-nearest neighbors (KNN) – to analyze the dataset, and their prediction performance were evaluated using four metrics: accuracy, precision, recall, and F1 score. Physiochemical parameters of 30 groundwater samples were analyzed to determine the Water Quality Index (WQI) of Pano Aqil city, Pakistan. The samples were categorized into the following four classes based on their WQI values: excellent water, good water, poor water, and unfit for drinking. The WQI scores showed that only 43.33% of the samples were deemed acceptable for drinking, indicating that the majority (56.67%) were unsuitable. The findings suggest that the DTC and XGB algorithms outperform all other algorithms, achieving overall accuracies of 100% each. In contrast, RF, KNN, and LR exhibit overall accuracies of 88, 75, and 50%, respectively. Researchers seeking to enhance water quality using machine learning can benefit from the models described in this study for water quality prediction. HIGHLIGHTS Groundwater quality is evaluated using the Water Quality Index method.; Machine learning algorithms are used for forecasting groundwater quality.; The predictive capabilities of decision tree classifier, extreme gradient boosting, logistic regression, random forest, and K-nearest neighbors models have been evaluated and compared.;http://wpt.iwaponline.com/content/19/2/384gisgroundwatermachine learningprediction modelswqi
spellingShingle Ghulam Shabir Solangi
Zouhaib Ali
Muhammad Bilal
Muhammad Junaid
Sallahuddin Panhwar
Hareef Ahmed Keerio
Iftikhar Hussain Sohu
Sheeraz Gul Shahani
Noor Zaman
Machine learning, Water Quality Index, and GIS-based analysis of groundwater quality
Water Practice and Technology
gis
groundwater
machine learning
prediction models
wqi
title Machine learning, Water Quality Index, and GIS-based analysis of groundwater quality
title_full Machine learning, Water Quality Index, and GIS-based analysis of groundwater quality
title_fullStr Machine learning, Water Quality Index, and GIS-based analysis of groundwater quality
title_full_unstemmed Machine learning, Water Quality Index, and GIS-based analysis of groundwater quality
title_short Machine learning, Water Quality Index, and GIS-based analysis of groundwater quality
title_sort machine learning water quality index and gis based analysis of groundwater quality
topic gis
groundwater
machine learning
prediction models
wqi
url http://wpt.iwaponline.com/content/19/2/384
work_keys_str_mv AT ghulamshabirsolangi machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality
AT zouhaibali machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality
AT muhammadbilal machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality
AT muhammadjunaid machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality
AT sallahuddinpanhwar machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality
AT hareefahmedkeerio machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality
AT iftikharhussainsohu machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality
AT sheerazgulshahani machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality
AT noorzaman machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality