Water quality prediction and classification based on principal component regression and gradient boosting classifier approach

Estimating water quality has been one of the significant challenges faced by the world in recent decades. This paper presents a water quality prediction model utilizing the principal component regression technique. Firstly, the water quality index (WQI) is calculated using the weighted arithmetic in...

Full description

Bibliographic Details
Main Authors: Md. Saikat Islam Khan, Nazrul Islam, Jia Uddin, Sifatul Islam, Mostofa Kamal Nasir
Format: Article
Language:English
Published: Elsevier 2022-09-01
Series:Journal of King Saud University: Computer and Information Sciences
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1319157821001361
_version_ 1798037842413223936
author Md. Saikat Islam Khan
Nazrul Islam
Jia Uddin
Sifatul Islam
Mostofa Kamal Nasir
author_facet Md. Saikat Islam Khan
Nazrul Islam
Jia Uddin
Sifatul Islam
Mostofa Kamal Nasir
author_sort Md. Saikat Islam Khan
collection DOAJ
description Estimating water quality has been one of the significant challenges faced by the world in recent decades. This paper presents a water quality prediction model utilizing the principal component regression technique. Firstly, the water quality index (WQI) is calculated using the weighted arithmetic index method. Secondly, the principal component analysis (PCA) is applied to the dataset, and the most dominant WQI parameters have been extracted. Thirdly, to predict the WQI, different regression algorithms are used to the PCA output. Finally, the Gradient Boosting Classifier is utilized to classify the water quality status. The proposed system is experimentally evaluated on a Gulshan Lake-related dataset. The results demonstrate 95% prediction accuracy for the principal component regression method and 100% classification accuracy for the Gradient Boosting Classifier method, which show credible performance compared with the state-of-art models.
first_indexed 2024-04-11T21:32:04Z
format Article
id doaj.art-40b81dc141584d4abb56a644ebd75099
institution Directory Open Access Journal
issn 1319-1578
language English
last_indexed 2024-04-11T21:32:04Z
publishDate 2022-09-01
publisher Elsevier
record_format Article
series Journal of King Saud University: Computer and Information Sciences
spelling doaj.art-40b81dc141584d4abb56a644ebd750992022-12-22T04:01:55ZengElsevierJournal of King Saud University: Computer and Information Sciences1319-15782022-09-0134847734781Water quality prediction and classification based on principal component regression and gradient boosting classifier approachMd. Saikat Islam Khan0Nazrul Islam1Jia Uddin2Sifatul Islam3Mostofa Kamal Nasir4Department of Computer Science and Engineering, Santosh, Tangail-1902, Bangladesh; Mawlana Bhashani Science and Technology University, Santosh, Tangail 1902, BangladeshDepartment of Information and Communication and Technology, Santosh, Tangail-1902, Bangladesh; Mawlana Bhashani Science and Technology University, Santosh, Tangail 1902, Bangladesh; Corresponding author at: Department of Information and Communication and Technology, Mawlana Bhashani Science and Technology University, Santosh, Tangail 1902, Bangladesh.Department of Technology Studies, Endicott College, Woosong University, Daejeon, South KoreaDepartment of Computer Science and Engineering, Santosh, Tangail-1902, Bangladesh; Mawlana Bhashani Science and Technology University, Santosh, Tangail 1902, BangladeshDepartment of Computer Science and Engineering, Santosh, Tangail-1902, Bangladesh; Mawlana Bhashani Science and Technology University, Santosh, Tangail 1902, BangladeshEstimating water quality has been one of the significant challenges faced by the world in recent decades. This paper presents a water quality prediction model utilizing the principal component regression technique. Firstly, the water quality index (WQI) is calculated using the weighted arithmetic index method. Secondly, the principal component analysis (PCA) is applied to the dataset, and the most dominant WQI parameters have been extracted. Thirdly, to predict the WQI, different regression algorithms are used to the PCA output. Finally, the Gradient Boosting Classifier is utilized to classify the water quality status. The proposed system is experimentally evaluated on a Gulshan Lake-related dataset. The results demonstrate 95% prediction accuracy for the principal component regression method and 100% classification accuracy for the Gradient Boosting Classifier method, which show credible performance compared with the state-of-art models.http://www.sciencedirect.com/science/article/pii/S1319157821001361Water quality indexPrincipal component regressionClassification algorithmBoxplot analysis
spellingShingle Md. Saikat Islam Khan
Nazrul Islam
Jia Uddin
Sifatul Islam
Mostofa Kamal Nasir
Water quality prediction and classification based on principal component regression and gradient boosting classifier approach
Journal of King Saud University: Computer and Information Sciences
Water quality index
Principal component regression
Classification algorithm
Boxplot analysis
title Water quality prediction and classification based on principal component regression and gradient boosting classifier approach
title_full Water quality prediction and classification based on principal component regression and gradient boosting classifier approach
title_fullStr Water quality prediction and classification based on principal component regression and gradient boosting classifier approach
title_full_unstemmed Water quality prediction and classification based on principal component regression and gradient boosting classifier approach
title_short Water quality prediction and classification based on principal component regression and gradient boosting classifier approach
title_sort water quality prediction and classification based on principal component regression and gradient boosting classifier approach
topic Water quality index
Principal component regression
Classification algorithm
Boxplot analysis
url http://www.sciencedirect.com/science/article/pii/S1319157821001361
work_keys_str_mv AT mdsaikatislamkhan waterqualitypredictionandclassificationbasedonprincipalcomponentregressionandgradientboostingclassifierapproach
AT nazrulislam waterqualitypredictionandclassificationbasedonprincipalcomponentregressionandgradientboostingclassifierapproach
AT jiauddin waterqualitypredictionandclassificationbasedonprincipalcomponentregressionandgradientboostingclassifierapproach
AT sifatulislam waterqualitypredictionandclassificationbasedonprincipalcomponentregressionandgradientboostingclassifierapproach
AT mostofakamalnasir waterqualitypredictionandclassificationbasedonprincipalcomponentregressionandgradientboostingclassifierapproach