Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayes

Introduction: Air pollution increases the load of hospitalization cases, especially for those who have respiratory problems. For effective environmental management, this study aims to compare the performance of two classification algorithms in machine learning (logistic regression and naive bayes)...

Full description

Bibliographic Details
Main Authors: Alka Pant, Sanjay Sharma, Ramesh Chandra Joshi
Format: Article
Language:English
Published: Tehran University of Medical Sciences 2022-09-01
Series:Journal of Air Pollution and Health
Subjects:
Online Access:https://japh.tums.ac.ir/index.php/japh/article/view/407
_version_ 1798031803736391680
author Alka Pant
Sanjay Sharma
Ramesh Chandra Joshi
author_facet Alka Pant
Sanjay Sharma
Ramesh Chandra Joshi
author_sort Alka Pant
collection DOAJ
description Introduction: Air pollution increases the load of hospitalization cases, especially for those who have respiratory problems. For effective environmental management, this study aims to compare the performance of two classification algorithms in machine learning (logistic regression and naive bayes) and to evaluate the selection of the best algorithm for predicting the air quality class. Materials and methods: Pollutants data (PM10, SO2 , NO2) have been collected from the Haldwani, Kashipur and Rudrapur regions in Uttarakhand (India). In part I of the study, the Air Quality Index (AQI) is calculated and assigned a class accordingly. In part II, the performance of algorithms is compared, and the air quality class is predicted through the best algorithm. In part III, accuracy is calculated after comparing the predicted class with the actual class. Then, it is compared with the accuracy of our selected algorithm. Results: The study finds a positive correlation between PM10 and SO2 pollutants. The result shows that the highest accuracy is achieved through logistic regression to predict the air quality class. Further, logistic regression has achieved the same accuracy i.e., 98.70% after comparing predicted values with the actual values. Conclusion: Logistic regression is the best algorithm to predict the air quality class in the regions of Uttarakhand, where pollutants are being measured in the Government’s hospital. The research also indicates that asthma patients in the Kashipur and Rudrapur regions may experience more health effects dueto moderately polluted air quality; however, the situation is improving during the monsoon season.
first_indexed 2024-04-11T20:03:27Z
format Article
id doaj.art-4f7ec5c162df444785f9c7b239dafe22
institution Directory Open Access Journal
issn 2476-3071
language English
last_indexed 2024-04-11T20:03:27Z
publishDate 2022-09-01
publisher Tehran University of Medical Sciences
record_format Article
series Journal of Air Pollution and Health
spelling doaj.art-4f7ec5c162df444785f9c7b239dafe222022-12-22T04:05:27ZengTehran University of Medical SciencesJournal of Air Pollution and Health2476-30712022-09-017310.18502/japh.v7i3.10542Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayesAlka Pant0Sanjay Sharma1Ramesh Chandra Joshi2Shri Guru Ram Rai University, DehradunSchool of Computer Applications & Information Technology, Shri Guru Ram Rai (S.G.R.R), University, Dehradun, UttarakhandDepartment of Computer Science and Engineering, Graphic Era (Deemed to be University), Dehradun, Uttarakhand Introduction: Air pollution increases the load of hospitalization cases, especially for those who have respiratory problems. For effective environmental management, this study aims to compare the performance of two classification algorithms in machine learning (logistic regression and naive bayes) and to evaluate the selection of the best algorithm for predicting the air quality class. Materials and methods: Pollutants data (PM10, SO2 , NO2) have been collected from the Haldwani, Kashipur and Rudrapur regions in Uttarakhand (India). In part I of the study, the Air Quality Index (AQI) is calculated and assigned a class accordingly. In part II, the performance of algorithms is compared, and the air quality class is predicted through the best algorithm. In part III, accuracy is calculated after comparing the predicted class with the actual class. Then, it is compared with the accuracy of our selected algorithm. Results: The study finds a positive correlation between PM10 and SO2 pollutants. The result shows that the highest accuracy is achieved through logistic regression to predict the air quality class. Further, logistic regression has achieved the same accuracy i.e., 98.70% after comparing predicted values with the actual values. Conclusion: Logistic regression is the best algorithm to predict the air quality class in the regions of Uttarakhand, where pollutants are being measured in the Government’s hospital. The research also indicates that asthma patients in the Kashipur and Rudrapur regions may experience more health effects dueto moderately polluted air quality; however, the situation is improving during the monsoon season. https://japh.tums.ac.ir/index.php/japh/article/view/407Pollutants; Air Quality; Logistic regression; Naive bayes; Environmental management
spellingShingle Alka Pant
Sanjay Sharma
Ramesh Chandra Joshi
Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayes
Journal of Air Pollution and Health
Pollutants; Air Quality; Logistic regression; Naive bayes; Environmental management
title Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayes
title_full Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayes
title_fullStr Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayes
title_full_unstemmed Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayes
title_short Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayes
title_sort air quality modeling for effective environmental management in uttarakhand india a comparison of logistic regression and naive bayes
topic Pollutants; Air Quality; Logistic regression; Naive bayes; Environmental management
url https://japh.tums.ac.ir/index.php/japh/article/view/407
work_keys_str_mv AT alkapant airqualitymodelingforeffectiveenvironmentalmanagementinuttarakhandindiaacomparisonoflogisticregressionandnaivebayes
AT sanjaysharma airqualitymodelingforeffectiveenvironmentalmanagementinuttarakhandindiaacomparisonoflogisticregressionandnaivebayes
AT rameshchandrajoshi airqualitymodelingforeffectiveenvironmentalmanagementinuttarakhandindiaacomparisonoflogisticregressionandnaivebayes