Machine learning, Water Quality Index, and GIS-based analysis of groundwater quality
Water is essential for life, as it supports bodily functions, nourishes crops, and maintains ecosystems. Drinking water is crucial for maintaining good health and can also contribute to economic development by reducing healthcare costs and improving productivity. In this study, we employed five diff...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IWA Publishing
2024-02-01
|
Series: | Water Practice and Technology |
Subjects: | |
Online Access: | http://wpt.iwaponline.com/content/19/2/384 |
_version_ | 1797267979195383808 |
---|---|
author | Ghulam Shabir Solangi Zouhaib Ali Muhammad Bilal Muhammad Junaid Sallahuddin Panhwar Hareef Ahmed Keerio Iftikhar Hussain Sohu Sheeraz Gul Shahani Noor Zaman |
author_facet | Ghulam Shabir Solangi Zouhaib Ali Muhammad Bilal Muhammad Junaid Sallahuddin Panhwar Hareef Ahmed Keerio Iftikhar Hussain Sohu Sheeraz Gul Shahani Noor Zaman |
author_sort | Ghulam Shabir Solangi |
collection | DOAJ |
description | Water is essential for life, as it supports bodily functions, nourishes crops, and maintains ecosystems. Drinking water is crucial for maintaining good health and can also contribute to economic development by reducing healthcare costs and improving productivity. In this study, we employed five different machine learning algorithms – logistic regression (LR), decision tree classifier (DTC), extreme gradient boosting (XGB), random forest (RF), and K-nearest neighbors (KNN) – to analyze the dataset, and their prediction performance were evaluated using four metrics: accuracy, precision, recall, and F1 score. Physiochemical parameters of 30 groundwater samples were analyzed to determine the Water Quality Index (WQI) of Pano Aqil city, Pakistan. The samples were categorized into the following four classes based on their WQI values: excellent water, good water, poor water, and unfit for drinking. The WQI scores showed that only 43.33% of the samples were deemed acceptable for drinking, indicating that the majority (56.67%) were unsuitable. The findings suggest that the DTC and XGB algorithms outperform all other algorithms, achieving overall accuracies of 100% each. In contrast, RF, KNN, and LR exhibit overall accuracies of 88, 75, and 50%, respectively. Researchers seeking to enhance water quality using machine learning can benefit from the models described in this study for water quality prediction.
HIGHLIGHTS
Groundwater quality is evaluated using the Water Quality Index method.;
Machine learning algorithms are used for forecasting groundwater quality.;
The predictive capabilities of decision tree classifier, extreme gradient boosting, logistic regression, random forest, and K-nearest neighbors models have been evaluated and compared.; |
first_indexed | 2024-04-25T01:25:11Z |
format | Article |
id | doaj.art-dc498a6f8fa9468da65ff3b287f25ad4 |
institution | Directory Open Access Journal |
issn | 1751-231X |
language | English |
last_indexed | 2024-04-25T01:25:11Z |
publishDate | 2024-02-01 |
publisher | IWA Publishing |
record_format | Article |
series | Water Practice and Technology |
spelling | doaj.art-dc498a6f8fa9468da65ff3b287f25ad42024-03-09T05:48:59ZengIWA PublishingWater Practice and Technology1751-231X2024-02-0119238440010.2166/wpt.2024.014014Machine learning, Water Quality Index, and GIS-based analysis of groundwater qualityGhulam Shabir Solangi0Zouhaib Ali1Muhammad Bilal2Muhammad Junaid3Sallahuddin Panhwar4Hareef Ahmed Keerio5Iftikhar Hussain Sohu6Sheeraz Gul Shahani7Noor Zaman8 Department of Civil Engineering, Mehran University of Engineering & Technology, Shaheed Zulfiqar Ali Bhutto Campus, Khairpur Mirs, Pakistan Department of Civil Engineering, National University of Sciences and Technology, Baluchistan Campus, Quetta, Pakistan Department of Civil Engineering, National University of Sciences and Technology, Baluchistan Campus, Quetta, Pakistan Department of Civil Engineering, National University of Sciences and Technology, Baluchistan Campus, Quetta, Pakistan Department of Civil Engineering, National University of Sciences and Technology, Baluchistan Campus, Quetta, Pakistan Faculty of Engineering & Quantity Surveying, INTI International University, Persiaran Perdana BBN 1800, Putra Nilai, Nilai, Negeri Sembilan, Malaysia Department of Civil Engineering, Mehran University of Engineering & Technology, Shaheed Zulfiqar Ali Bhutto Campus, Khairpur Mirs, Pakistan Department of Civil Engineering, Mehran University of Engineering & Technology, Shaheed Zulfiqar Ali Bhutto Campus, Khairpur Mirs, Pakistan Department of Civil Engineering, Mehran University of Engineering & Technology, Shaheed Zulfiqar Ali Bhutto Campus, Khairpur Mirs, Pakistan Water is essential for life, as it supports bodily functions, nourishes crops, and maintains ecosystems. Drinking water is crucial for maintaining good health and can also contribute to economic development by reducing healthcare costs and improving productivity. In this study, we employed five different machine learning algorithms – logistic regression (LR), decision tree classifier (DTC), extreme gradient boosting (XGB), random forest (RF), and K-nearest neighbors (KNN) – to analyze the dataset, and their prediction performance were evaluated using four metrics: accuracy, precision, recall, and F1 score. Physiochemical parameters of 30 groundwater samples were analyzed to determine the Water Quality Index (WQI) of Pano Aqil city, Pakistan. The samples were categorized into the following four classes based on their WQI values: excellent water, good water, poor water, and unfit for drinking. The WQI scores showed that only 43.33% of the samples were deemed acceptable for drinking, indicating that the majority (56.67%) were unsuitable. The findings suggest that the DTC and XGB algorithms outperform all other algorithms, achieving overall accuracies of 100% each. In contrast, RF, KNN, and LR exhibit overall accuracies of 88, 75, and 50%, respectively. Researchers seeking to enhance water quality using machine learning can benefit from the models described in this study for water quality prediction. HIGHLIGHTS Groundwater quality is evaluated using the Water Quality Index method.; Machine learning algorithms are used for forecasting groundwater quality.; The predictive capabilities of decision tree classifier, extreme gradient boosting, logistic regression, random forest, and K-nearest neighbors models have been evaluated and compared.;http://wpt.iwaponline.com/content/19/2/384gisgroundwatermachine learningprediction modelswqi |
spellingShingle | Ghulam Shabir Solangi Zouhaib Ali Muhammad Bilal Muhammad Junaid Sallahuddin Panhwar Hareef Ahmed Keerio Iftikhar Hussain Sohu Sheeraz Gul Shahani Noor Zaman Machine learning, Water Quality Index, and GIS-based analysis of groundwater quality Water Practice and Technology gis groundwater machine learning prediction models wqi |
title | Machine learning, Water Quality Index, and GIS-based analysis of groundwater quality |
title_full | Machine learning, Water Quality Index, and GIS-based analysis of groundwater quality |
title_fullStr | Machine learning, Water Quality Index, and GIS-based analysis of groundwater quality |
title_full_unstemmed | Machine learning, Water Quality Index, and GIS-based analysis of groundwater quality |
title_short | Machine learning, Water Quality Index, and GIS-based analysis of groundwater quality |
title_sort | machine learning water quality index and gis based analysis of groundwater quality |
topic | gis groundwater machine learning prediction models wqi |
url | http://wpt.iwaponline.com/content/19/2/384 |
work_keys_str_mv | AT ghulamshabirsolangi machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality AT zouhaibali machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality AT muhammadbilal machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality AT muhammadjunaid machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality AT sallahuddinpanhwar machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality AT hareefahmedkeerio machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality AT iftikharhussainsohu machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality AT sheerazgulshahani machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality AT noorzaman machinelearningwaterqualityindexandgisbasedanalysisofgroundwaterquality |