Research on the chemical oxygen demand spectral inversion model in water based on IPLS-GAN-SVM hybrid algorithm.

Spectral collinearity and limited spectral datasets are the problems influencing Chemical Oxygen Demand (COD) modeling. To address the first problem and obtain optimal modeling range, the spectra are preprocessed using six methods including Standard Normal Variate, Savitzky-Golay Smoothing Filtering...

Full description

Bibliographic Details
Main Authors: Qirong Lu, Jian Zou, Yingya Ye, Zexin Wang
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2024-01-01
Series:PLoS ONE
Online Access:https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0301902&type=printable
_version_ 1797202059878989824
author Qirong Lu
Jian Zou
Yingya Ye
Zexin Wang
author_facet Qirong Lu
Jian Zou
Yingya Ye
Zexin Wang
author_sort Qirong Lu
collection DOAJ
description Spectral collinearity and limited spectral datasets are the problems influencing Chemical Oxygen Demand (COD) modeling. To address the first problem and obtain optimal modeling range, the spectra are preprocessed using six methods including Standard Normal Variate, Savitzky-Golay Smoothing Filtering (SG) etc. Subsequently, the 190-350 nm spectral range is divided into 10 subintervals, and Interval Partial Least Squares (IPLS) is used to perform PLS modeling on each interval. The results indicate that it is best modeled in the 7th range (238~253 nm). The values of Mean Square Error (MSE), Mean Absolute Error (MAE) and R2score of the model without pretreatment are 1.6489, 1.0661, and 0.9942. After pretreatment, the SG is better than others, with MSE and MAE decreasing to 1.4727, 1.0318 and R2score improving to 0.9944. Using the optimal model, the predicted COD for three samples are 10.87 mg/L, 14.88 mg/L, and 19.29 mg/L. To address the problem of the small dataset, using Generative Adversarial Networks for data augmentation, three datasets are obtained for Support Vector Machine (SVM) modeling. The results indicate that, compared to the original dataset, the SVM's MSE and MAE have decreased, while its accuracy has improved by 2.88%, 11.53%, and 11.53%, and the R2score has improved by 18.07%, 17.40%, and 18.74%.
first_indexed 2024-04-24T07:57:25Z
format Article
id doaj.art-a1083daede794281bde49d2591bd143d
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-24T07:57:25Z
publishDate 2024-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-a1083daede794281bde49d2591bd143d2024-04-18T05:31:48ZengPublic Library of Science (PLoS)PLoS ONE1932-62032024-01-01194e030190210.1371/journal.pone.0301902Research on the chemical oxygen demand spectral inversion model in water based on IPLS-GAN-SVM hybrid algorithm.Qirong LuJian ZouYingya YeZexin WangSpectral collinearity and limited spectral datasets are the problems influencing Chemical Oxygen Demand (COD) modeling. To address the first problem and obtain optimal modeling range, the spectra are preprocessed using six methods including Standard Normal Variate, Savitzky-Golay Smoothing Filtering (SG) etc. Subsequently, the 190-350 nm spectral range is divided into 10 subintervals, and Interval Partial Least Squares (IPLS) is used to perform PLS modeling on each interval. The results indicate that it is best modeled in the 7th range (238~253 nm). The values of Mean Square Error (MSE), Mean Absolute Error (MAE) and R2score of the model without pretreatment are 1.6489, 1.0661, and 0.9942. After pretreatment, the SG is better than others, with MSE and MAE decreasing to 1.4727, 1.0318 and R2score improving to 0.9944. Using the optimal model, the predicted COD for three samples are 10.87 mg/L, 14.88 mg/L, and 19.29 mg/L. To address the problem of the small dataset, using Generative Adversarial Networks for data augmentation, three datasets are obtained for Support Vector Machine (SVM) modeling. The results indicate that, compared to the original dataset, the SVM's MSE and MAE have decreased, while its accuracy has improved by 2.88%, 11.53%, and 11.53%, and the R2score has improved by 18.07%, 17.40%, and 18.74%.https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0301902&type=printable
spellingShingle Qirong Lu
Jian Zou
Yingya Ye
Zexin Wang
Research on the chemical oxygen demand spectral inversion model in water based on IPLS-GAN-SVM hybrid algorithm.
PLoS ONE
title Research on the chemical oxygen demand spectral inversion model in water based on IPLS-GAN-SVM hybrid algorithm.
title_full Research on the chemical oxygen demand spectral inversion model in water based on IPLS-GAN-SVM hybrid algorithm.
title_fullStr Research on the chemical oxygen demand spectral inversion model in water based on IPLS-GAN-SVM hybrid algorithm.
title_full_unstemmed Research on the chemical oxygen demand spectral inversion model in water based on IPLS-GAN-SVM hybrid algorithm.
title_short Research on the chemical oxygen demand spectral inversion model in water based on IPLS-GAN-SVM hybrid algorithm.
title_sort research on the chemical oxygen demand spectral inversion model in water based on ipls gan svm hybrid algorithm
url https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0301902&type=printable
work_keys_str_mv AT qironglu researchonthechemicaloxygendemandspectralinversionmodelinwaterbasedoniplsgansvmhybridalgorithm
AT jianzou researchonthechemicaloxygendemandspectralinversionmodelinwaterbasedoniplsgansvmhybridalgorithm
AT yingyaye researchonthechemicaloxygendemandspectralinversionmodelinwaterbasedoniplsgansvmhybridalgorithm
AT zexinwang researchonthechemicaloxygendemandspectralinversionmodelinwaterbasedoniplsgansvmhybridalgorithm