Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayes
Introduction: Air pollution increases the load of hospitalization cases, especially for those who have respiratory problems. For effective environmental management, this study aims to compare the performance of two classification algorithms in machine learning (logistic regression and naive bayes)...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Tehran University of Medical Sciences
2022-09-01
|
Series: | Journal of Air Pollution and Health |
Subjects: | |
Online Access: | https://japh.tums.ac.ir/index.php/japh/article/view/407 |
_version_ | 1798031803736391680 |
---|---|
author | Alka Pant Sanjay Sharma Ramesh Chandra Joshi |
author_facet | Alka Pant Sanjay Sharma Ramesh Chandra Joshi |
author_sort | Alka Pant |
collection | DOAJ |
description |
Introduction: Air pollution increases the load of hospitalization cases, especially for those who have respiratory problems. For effective environmental management, this study aims to compare the performance of two classification algorithms in machine learning (logistic regression and naive bayes) and to evaluate the selection of the best algorithm for predicting the air quality class.
Materials and methods: Pollutants data (PM10, SO2 , NO2) have been collected from the Haldwani, Kashipur and Rudrapur regions in Uttarakhand (India). In part I of the study, the Air Quality Index (AQI) is calculated and assigned a class accordingly. In part II, the performance of algorithms is compared, and the air quality class is predicted through the best algorithm. In part III, accuracy is calculated after comparing the predicted class with the actual class. Then, it is compared with the accuracy of our selected algorithm.
Results: The study finds a positive correlation between PM10 and SO2 pollutants. The result shows that the highest accuracy is achieved through logistic regression to predict the air quality class. Further, logistic regression has achieved the same accuracy i.e., 98.70% after comparing predicted values with the actual values.
Conclusion: Logistic regression is the best algorithm to predict the air quality class in the regions of Uttarakhand, where pollutants are being measured in the Government’s hospital. The research also indicates that asthma patients in the Kashipur and Rudrapur regions may experience more health effects dueto moderately polluted air quality; however, the situation is improving during the monsoon season.
|
first_indexed | 2024-04-11T20:03:27Z |
format | Article |
id | doaj.art-4f7ec5c162df444785f9c7b239dafe22 |
institution | Directory Open Access Journal |
issn | 2476-3071 |
language | English |
last_indexed | 2024-04-11T20:03:27Z |
publishDate | 2022-09-01 |
publisher | Tehran University of Medical Sciences |
record_format | Article |
series | Journal of Air Pollution and Health |
spelling | doaj.art-4f7ec5c162df444785f9c7b239dafe222022-12-22T04:05:27ZengTehran University of Medical SciencesJournal of Air Pollution and Health2476-30712022-09-017310.18502/japh.v7i3.10542Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayesAlka Pant0Sanjay Sharma1Ramesh Chandra Joshi2Shri Guru Ram Rai University, DehradunSchool of Computer Applications & Information Technology, Shri Guru Ram Rai (S.G.R.R), University, Dehradun, UttarakhandDepartment of Computer Science and Engineering, Graphic Era (Deemed to be University), Dehradun, Uttarakhand Introduction: Air pollution increases the load of hospitalization cases, especially for those who have respiratory problems. For effective environmental management, this study aims to compare the performance of two classification algorithms in machine learning (logistic regression and naive bayes) and to evaluate the selection of the best algorithm for predicting the air quality class. Materials and methods: Pollutants data (PM10, SO2 , NO2) have been collected from the Haldwani, Kashipur and Rudrapur regions in Uttarakhand (India). In part I of the study, the Air Quality Index (AQI) is calculated and assigned a class accordingly. In part II, the performance of algorithms is compared, and the air quality class is predicted through the best algorithm. In part III, accuracy is calculated after comparing the predicted class with the actual class. Then, it is compared with the accuracy of our selected algorithm. Results: The study finds a positive correlation between PM10 and SO2 pollutants. The result shows that the highest accuracy is achieved through logistic regression to predict the air quality class. Further, logistic regression has achieved the same accuracy i.e., 98.70% after comparing predicted values with the actual values. Conclusion: Logistic regression is the best algorithm to predict the air quality class in the regions of Uttarakhand, where pollutants are being measured in the Government’s hospital. The research also indicates that asthma patients in the Kashipur and Rudrapur regions may experience more health effects dueto moderately polluted air quality; however, the situation is improving during the monsoon season. https://japh.tums.ac.ir/index.php/japh/article/view/407Pollutants; Air Quality; Logistic regression; Naive bayes; Environmental management |
spellingShingle | Alka Pant Sanjay Sharma Ramesh Chandra Joshi Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayes Journal of Air Pollution and Health Pollutants; Air Quality; Logistic regression; Naive bayes; Environmental management |
title | Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayes |
title_full | Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayes |
title_fullStr | Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayes |
title_full_unstemmed | Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayes |
title_short | Air quality modeling for effective environmental management in Uttarakhand, India: A comparison of logistic regression and naive bayes |
title_sort | air quality modeling for effective environmental management in uttarakhand india a comparison of logistic regression and naive bayes |
topic | Pollutants; Air Quality; Logistic regression; Naive bayes; Environmental management |
url | https://japh.tums.ac.ir/index.php/japh/article/view/407 |
work_keys_str_mv | AT alkapant airqualitymodelingforeffectiveenvironmentalmanagementinuttarakhandindiaacomparisonoflogisticregressionandnaivebayes AT sanjaysharma airqualitymodelingforeffectiveenvironmentalmanagementinuttarakhandindiaacomparisonoflogisticregressionandnaivebayes AT rameshchandrajoshi airqualitymodelingforeffectiveenvironmentalmanagementinuttarakhandindiaacomparisonoflogisticregressionandnaivebayes |