Urine based near-infrared spectroscopy analysis reveals a noninvasive and convenient diagnosis method for cancers: a pilot study

Background The challenges in cancer diagnosis underline the need for continued research and development of new diagnostic tools and methods. This study aims to explore an effective, noninvasive, and convenient diagnostic tool using urine based near-infrared spectroscopy (NIRS) analysis combined with...

Full description

Bibliographic Details
Main Authors: Jing Zhu, Siyu Zhang, Ruting Wang, Ruhua Fang, Lan Lei, Ji Zheng, Zhongjian Chen
Format: Article
Language:English
Published: PeerJ Inc. 2023-08-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/15895.pdf
_version_ 1797424969234251776
author Jing Zhu
Siyu Zhang
Ruting Wang
Ruhua Fang
Lan Lei
Ji Zheng
Zhongjian Chen
author_facet Jing Zhu
Siyu Zhang
Ruting Wang
Ruhua Fang
Lan Lei
Ji Zheng
Zhongjian Chen
author_sort Jing Zhu
collection DOAJ
description Background The challenges in cancer diagnosis underline the need for continued research and development of new diagnostic tools and methods. This study aims to explore an effective, noninvasive, and convenient diagnostic tool using urine based near-infrared spectroscopy (NIRS) analysis combined with machine learning algorithm. Methods Urine samples were collected from a total of 327 participants, including 181 cancer cases and 146 healthy controls. These participants were randomly spit into train set (n = 218) and test set (n = 109). NIRS analysis (4,000 ∼10,000 cm−1) was performed for each sample in both train and test sets. Five pretreatment methods, including Savitzky-Golay (SG) smoothing, multiplicative scatter correction (MSC), baseline removal (BSL) with fitting polynomials to be used as baselines, the first derivative (DERIV1), and the second derivative (DERIV2), and combination with “scaling” and “center”, were investigated. Then partial least-squares (PLS) and linear support-vector machine (SVM) classification models were established, and prediction performance was evaluated in test set. Results NIRS had greatly overlapping in peaks, and PCA analysis failed in separation between cancers and healthy controls. In modeling with urine based NIRS data, PLS model showed its highest prediction accuracy of 0.780, with DERIV2, “scaling” and “center” pretreatment, while linear SVM displayed its best prediction accuracy of 0.844, with raw NIRS. With optimization in SVM, the prediction accuracy could improve to 0.862, when the top 262 features were involved as variables. Discussion This pilot study combining urine based NIRS analysis and machine learning is effective and convenient that might facilitate in cancer diagnosis, encouraging further evaluation with a large-size multi-center study.
first_indexed 2024-03-09T08:09:15Z
format Article
id doaj.art-25e15d90e952481db88038046524f779
institution Directory Open Access Journal
issn 2167-8359
language English
last_indexed 2024-03-09T08:09:15Z
publishDate 2023-08-01
publisher PeerJ Inc.
record_format Article
series PeerJ
spelling doaj.art-25e15d90e952481db88038046524f7792023-12-02T23:31:01ZengPeerJ Inc.PeerJ2167-83592023-08-0111e1589510.7717/peerj.15895Urine based near-infrared spectroscopy analysis reveals a noninvasive and convenient diagnosis method for cancers: a pilot studyJing Zhu0Siyu Zhang1Ruting Wang2Ruhua Fang3Lan Lei4Ji Zheng5Zhongjian Chen6Department of Clinical Laboratory, Zhejiang Cancer Hospital, Hangzhou, Zhejiang, ChinaDepartment of Clinical Pharmacy, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, ChinaExperimental Research Center, Zhejiang Cancer Hospital, Hangzhou, Zhejiang, ChinaDepartment of Clinical Laboratory, Zhejiang Cancer Hospital, Hangzhou, Zhejiang, ChinaZhejiang Hospital, Hangzhou, Zhejiang, ChinaDepartment of Radiotherapy and Chemotherapy, Ningbo No. 2 Hospital, Ningbo, Zhejiang, ChinaExperimental Research Center, Zhejiang Cancer Hospital, Hangzhou, Zhejiang, ChinaBackground The challenges in cancer diagnosis underline the need for continued research and development of new diagnostic tools and methods. This study aims to explore an effective, noninvasive, and convenient diagnostic tool using urine based near-infrared spectroscopy (NIRS) analysis combined with machine learning algorithm. Methods Urine samples were collected from a total of 327 participants, including 181 cancer cases and 146 healthy controls. These participants were randomly spit into train set (n = 218) and test set (n = 109). NIRS analysis (4,000 ∼10,000 cm−1) was performed for each sample in both train and test sets. Five pretreatment methods, including Savitzky-Golay (SG) smoothing, multiplicative scatter correction (MSC), baseline removal (BSL) with fitting polynomials to be used as baselines, the first derivative (DERIV1), and the second derivative (DERIV2), and combination with “scaling” and “center”, were investigated. Then partial least-squares (PLS) and linear support-vector machine (SVM) classification models were established, and prediction performance was evaluated in test set. Results NIRS had greatly overlapping in peaks, and PCA analysis failed in separation between cancers and healthy controls. In modeling with urine based NIRS data, PLS model showed its highest prediction accuracy of 0.780, with DERIV2, “scaling” and “center” pretreatment, while linear SVM displayed its best prediction accuracy of 0.844, with raw NIRS. With optimization in SVM, the prediction accuracy could improve to 0.862, when the top 262 features were involved as variables. Discussion This pilot study combining urine based NIRS analysis and machine learning is effective and convenient that might facilitate in cancer diagnosis, encouraging further evaluation with a large-size multi-center study.https://peerj.com/articles/15895.pdfMachine learningDiagnosisUrineNear-infrared spectroscopyPartial least-squares (PLS)Support-vector machine (SVM)
spellingShingle Jing Zhu
Siyu Zhang
Ruting Wang
Ruhua Fang
Lan Lei
Ji Zheng
Zhongjian Chen
Urine based near-infrared spectroscopy analysis reveals a noninvasive and convenient diagnosis method for cancers: a pilot study
PeerJ
Machine learning
Diagnosis
Urine
Near-infrared spectroscopy
Partial least-squares (PLS)
Support-vector machine (SVM)
title Urine based near-infrared spectroscopy analysis reveals a noninvasive and convenient diagnosis method for cancers: a pilot study
title_full Urine based near-infrared spectroscopy analysis reveals a noninvasive and convenient diagnosis method for cancers: a pilot study
title_fullStr Urine based near-infrared spectroscopy analysis reveals a noninvasive and convenient diagnosis method for cancers: a pilot study
title_full_unstemmed Urine based near-infrared spectroscopy analysis reveals a noninvasive and convenient diagnosis method for cancers: a pilot study
title_short Urine based near-infrared spectroscopy analysis reveals a noninvasive and convenient diagnosis method for cancers: a pilot study
title_sort urine based near infrared spectroscopy analysis reveals a noninvasive and convenient diagnosis method for cancers a pilot study
topic Machine learning
Diagnosis
Urine
Near-infrared spectroscopy
Partial least-squares (PLS)
Support-vector machine (SVM)
url https://peerj.com/articles/15895.pdf
work_keys_str_mv AT jingzhu urinebasednearinfraredspectroscopyanalysisrevealsanoninvasiveandconvenientdiagnosismethodforcancersapilotstudy
AT siyuzhang urinebasednearinfraredspectroscopyanalysisrevealsanoninvasiveandconvenientdiagnosismethodforcancersapilotstudy
AT rutingwang urinebasednearinfraredspectroscopyanalysisrevealsanoninvasiveandconvenientdiagnosismethodforcancersapilotstudy
AT ruhuafang urinebasednearinfraredspectroscopyanalysisrevealsanoninvasiveandconvenientdiagnosismethodforcancersapilotstudy
AT lanlei urinebasednearinfraredspectroscopyanalysisrevealsanoninvasiveandconvenientdiagnosismethodforcancersapilotstudy
AT jizheng urinebasednearinfraredspectroscopyanalysisrevealsanoninvasiveandconvenientdiagnosismethodforcancersapilotstudy
AT zhongjianchen urinebasednearinfraredspectroscopyanalysisrevealsanoninvasiveandconvenientdiagnosismethodforcancersapilotstudy