Multivariety and multimanufacturer drug identification based on near-infrared spectroscopy and recurrent neural network

Near-infrared (NIR) spectral analysis, which has the advantages of rapidness, nondestruction and high-efficiency, is widely used in the detection of feed, food and mineral. In terms of qualitative identification, it can also be used for the discriminant analysis of medicines. Long short-term memory...

Full description

Bibliographic Details
Main Authors: Wenjie Zeng, Yunqi Qiu, Yanting Huang, Qingping Sun, Zhuoya Luo
Format: Article
Language:English
Published: World Scientific Publishing 2022-07-01
Series:Journal of Innovative Optical Health Sciences
Subjects:
Online Access:https://www.worldscientific.com/doi/10.1142/S1793545822500225
_version_ 1818490193994317824
author Wenjie Zeng
Yunqi Qiu
Yanting Huang
Qingping Sun
Zhuoya Luo
author_facet Wenjie Zeng
Yunqi Qiu
Yanting Huang
Qingping Sun
Zhuoya Luo
author_sort Wenjie Zeng
collection DOAJ
description Near-infrared (NIR) spectral analysis, which has the advantages of rapidness, nondestruction and high-efficiency, is widely used in the detection of feed, food and mineral. In terms of qualitative identification, it can also be used for the discriminant analysis of medicines. Long short-term memory (LSTM) neural network, bidirectional long short-term memory (BiLSTM) neural network and gated recurrent unit (GRU) network are variants of the recurrent neural network (RNN). The potential relationship between nonlinear features learned from the sequence by these variants is used to complete the missions in fields such as natural language processing, signal classification and video analysis. Since the effect of these variants in drug identification is still to be studied, this paper constructs a multiclassifier of these three variants, using compound [Formula: see text]-keto acid tablets produced by four manufacturers and repaglinide tablets produced by five manufacturers as the research object. Then, the paper analyzes the impacts of seven different pre-processed methods on the drug NIR data by constructing different layers of LSTM, BiLSTM and GRU networks and compares different classification model indicators and training time of each model. When the spectrum data are pre-processed by [Formula: see text]-score normalization, the GRU-3 model has the best accuracy in all models. The BiLSTM models are better for analyzing high coincidence data. The method proposed in this paper can be further extended to other NIR spectroscopy data sets.
first_indexed 2024-12-10T17:14:03Z
format Article
id doaj.art-61d67a94c11442a480423a79e9e186d4
institution Directory Open Access Journal
issn 1793-5458
1793-7205
language English
last_indexed 2024-12-10T17:14:03Z
publishDate 2022-07-01
publisher World Scientific Publishing
record_format Article
series Journal of Innovative Optical Health Sciences
spelling doaj.art-61d67a94c11442a480423a79e9e186d42022-12-22T01:40:12ZengWorld Scientific PublishingJournal of Innovative Optical Health Sciences1793-54581793-72052022-07-01150410.1142/S1793545822500225Multivariety and multimanufacturer drug identification based on near-infrared spectroscopy and recurrent neural networkWenjie Zeng0Yunqi Qiu1Yanting Huang2Qingping Sun3Zhuoya Luo4School of Pharmaceutical Sciences, Guangzhou Medical University, 1 Xinzao Road Guangzhou 511436, P. R. ChinaNMPA Key Laboratory of Rapid Drug Inspection Technology, Guangdong Institute for Drug Control, 766 Shenzhou Road, Guangzhou 510663, P. R. ChinaNMPA Key Laboratory of Rapid Drug Inspection Technology, Guangdong Institute for Drug Control, 766 Shenzhou Road, Guangzhou 510663, P. R. ChinaNMPA Key Laboratory of Rapid Drug Inspection Technology, Guangdong Institute for Drug Control, 766 Shenzhou Road, Guangzhou 510663, P. R. ChinaNMPA Key Laboratory of Rapid Drug Inspection Technology, Guangdong Institute for Drug Control, 766 Shenzhou Road, Guangzhou 510663, P. R. ChinaNear-infrared (NIR) spectral analysis, which has the advantages of rapidness, nondestruction and high-efficiency, is widely used in the detection of feed, food and mineral. In terms of qualitative identification, it can also be used for the discriminant analysis of medicines. Long short-term memory (LSTM) neural network, bidirectional long short-term memory (BiLSTM) neural network and gated recurrent unit (GRU) network are variants of the recurrent neural network (RNN). The potential relationship between nonlinear features learned from the sequence by these variants is used to complete the missions in fields such as natural language processing, signal classification and video analysis. Since the effect of these variants in drug identification is still to be studied, this paper constructs a multiclassifier of these three variants, using compound [Formula: see text]-keto acid tablets produced by four manufacturers and repaglinide tablets produced by five manufacturers as the research object. Then, the paper analyzes the impacts of seven different pre-processed methods on the drug NIR data by constructing different layers of LSTM, BiLSTM and GRU networks and compares different classification model indicators and training time of each model. When the spectrum data are pre-processed by [Formula: see text]-score normalization, the GRU-3 model has the best accuracy in all models. The BiLSTM models are better for analyzing high coincidence data. The method proposed in this paper can be further extended to other NIR spectroscopy data sets.https://www.worldscientific.com/doi/10.1142/S1793545822500225Near-infrared spectroscopylong short-term memorybidirectional long short-term memorygated recurrent unitmultiple classifiers
spellingShingle Wenjie Zeng
Yunqi Qiu
Yanting Huang
Qingping Sun
Zhuoya Luo
Multivariety and multimanufacturer drug identification based on near-infrared spectroscopy and recurrent neural network
Journal of Innovative Optical Health Sciences
Near-infrared spectroscopy
long short-term memory
bidirectional long short-term memory
gated recurrent unit
multiple classifiers
title Multivariety and multimanufacturer drug identification based on near-infrared spectroscopy and recurrent neural network
title_full Multivariety and multimanufacturer drug identification based on near-infrared spectroscopy and recurrent neural network
title_fullStr Multivariety and multimanufacturer drug identification based on near-infrared spectroscopy and recurrent neural network
title_full_unstemmed Multivariety and multimanufacturer drug identification based on near-infrared spectroscopy and recurrent neural network
title_short Multivariety and multimanufacturer drug identification based on near-infrared spectroscopy and recurrent neural network
title_sort multivariety and multimanufacturer drug identification based on near infrared spectroscopy and recurrent neural network
topic Near-infrared spectroscopy
long short-term memory
bidirectional long short-term memory
gated recurrent unit
multiple classifiers
url https://www.worldscientific.com/doi/10.1142/S1793545822500225
work_keys_str_mv AT wenjiezeng multivarietyandmultimanufacturerdrugidentificationbasedonnearinfraredspectroscopyandrecurrentneuralnetwork
AT yunqiqiu multivarietyandmultimanufacturerdrugidentificationbasedonnearinfraredspectroscopyandrecurrentneuralnetwork
AT yantinghuang multivarietyandmultimanufacturerdrugidentificationbasedonnearinfraredspectroscopyandrecurrentneuralnetwork
AT qingpingsun multivarietyandmultimanufacturerdrugidentificationbasedonnearinfraredspectroscopyandrecurrentneuralnetwork
AT zhuoyaluo multivarietyandmultimanufacturerdrugidentificationbasedonnearinfraredspectroscopyandrecurrentneuralnetwork