Data preprocessing workflow for exhaled breath analysis by GC/MS using open sources

Abstract The noninvasive diagnosis and monitoring of high prevalence diseases such as cardiovascular diseases, cancers and chronic respiratory diseases are currently priority objectives in the area of health. In this regard, the analysis of volatile organic compounds (VOCs) has been identified as a...

Full description

Bibliographic Details
Main Authors: Rosa Alba Sola Martínez, José María Pastor Hernández, Gema Lozano Terol, Julia Gallego-Jara, Luis García-Marcos, Manuel Cánovas Díaz, Teresa de Diego Puente
Format: Article
Language:English
Published: Nature Portfolio 2020-12-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-020-79014-6
_version_ 1818692008332492800
author Rosa Alba Sola Martínez
José María Pastor Hernández
Gema Lozano Terol
Julia Gallego-Jara
Luis García-Marcos
Manuel Cánovas Díaz
Teresa de Diego Puente
author_facet Rosa Alba Sola Martínez
José María Pastor Hernández
Gema Lozano Terol
Julia Gallego-Jara
Luis García-Marcos
Manuel Cánovas Díaz
Teresa de Diego Puente
author_sort Rosa Alba Sola Martínez
collection DOAJ
description Abstract The noninvasive diagnosis and monitoring of high prevalence diseases such as cardiovascular diseases, cancers and chronic respiratory diseases are currently priority objectives in the area of health. In this regard, the analysis of volatile organic compounds (VOCs) has been identified as a potential noninvasive tool for the diagnosis and surveillance of several diseases. Despite the advantages of this strategy, it is not yet a routine clinical tool. The lack of reproducible protocols for each step of the biomarker discovery phase is an obstacle of the current state. Specifically, this issue is present at the data preprocessing step. Thus, an open source workflow for preprocessing the data obtained by the analysis of exhaled breath samples using gas chromatography coupled with single quadrupole mass spectrometry (GC/MS) is presented in this paper. This workflow is based on the connection of two approaches to transform raw data into a useful matrix for statistical analysis. Moreover, this workflow includes matching compounds from breath samples with a spectral library. Three free packages (xcms, cliqueMS and eRah) written in the language R are used for this purpose. Furthermore, this paper presents a suitable protocol for exhaled breath sample collection from infants under 2 years of age for GC/MS.
first_indexed 2024-12-17T12:50:57Z
format Article
id doaj.art-0c1818f732ec47778a753f50b5d846c8
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-12-17T12:50:57Z
publishDate 2020-12-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-0c1818f732ec47778a753f50b5d846c82022-12-21T21:47:36ZengNature PortfolioScientific Reports2045-23222020-12-0110111110.1038/s41598-020-79014-6Data preprocessing workflow for exhaled breath analysis by GC/MS using open sourcesRosa Alba Sola Martínez0José María Pastor Hernández1Gema Lozano Terol2Julia Gallego-Jara3Luis García-Marcos4Manuel Cánovas Díaz5Teresa de Diego Puente6Biotechnology Group, Department of Biochemistry and Molecular Biology and Immunology (B), Faculty of Chemistry, University of MurciaBiotechnology Group, Department of Biochemistry and Molecular Biology and Immunology (B), Faculty of Chemistry, University of MurciaBiotechnology Group, Department of Biochemistry and Molecular Biology and Immunology (B), Faculty of Chemistry, University of MurciaBiotechnology Group, Department of Biochemistry and Molecular Biology and Immunology (B), Faculty of Chemistry, University of MurciaBiomedical Research Institute of Murcia (IMIB-Arrixaca)Biotechnology Group, Department of Biochemistry and Molecular Biology and Immunology (B), Faculty of Chemistry, University of MurciaBiotechnology Group, Department of Biochemistry and Molecular Biology and Immunology (B), Faculty of Chemistry, University of MurciaAbstract The noninvasive diagnosis and monitoring of high prevalence diseases such as cardiovascular diseases, cancers and chronic respiratory diseases are currently priority objectives in the area of health. In this regard, the analysis of volatile organic compounds (VOCs) has been identified as a potential noninvasive tool for the diagnosis and surveillance of several diseases. Despite the advantages of this strategy, it is not yet a routine clinical tool. The lack of reproducible protocols for each step of the biomarker discovery phase is an obstacle of the current state. Specifically, this issue is present at the data preprocessing step. Thus, an open source workflow for preprocessing the data obtained by the analysis of exhaled breath samples using gas chromatography coupled with single quadrupole mass spectrometry (GC/MS) is presented in this paper. This workflow is based on the connection of two approaches to transform raw data into a useful matrix for statistical analysis. Moreover, this workflow includes matching compounds from breath samples with a spectral library. Three free packages (xcms, cliqueMS and eRah) written in the language R are used for this purpose. Furthermore, this paper presents a suitable protocol for exhaled breath sample collection from infants under 2 years of age for GC/MS.https://doi.org/10.1038/s41598-020-79014-6
spellingShingle Rosa Alba Sola Martínez
José María Pastor Hernández
Gema Lozano Terol
Julia Gallego-Jara
Luis García-Marcos
Manuel Cánovas Díaz
Teresa de Diego Puente
Data preprocessing workflow for exhaled breath analysis by GC/MS using open sources
Scientific Reports
title Data preprocessing workflow for exhaled breath analysis by GC/MS using open sources
title_full Data preprocessing workflow for exhaled breath analysis by GC/MS using open sources
title_fullStr Data preprocessing workflow for exhaled breath analysis by GC/MS using open sources
title_full_unstemmed Data preprocessing workflow for exhaled breath analysis by GC/MS using open sources
title_short Data preprocessing workflow for exhaled breath analysis by GC/MS using open sources
title_sort data preprocessing workflow for exhaled breath analysis by gc ms using open sources
url https://doi.org/10.1038/s41598-020-79014-6
work_keys_str_mv AT rosaalbasolamartinez datapreprocessingworkflowforexhaledbreathanalysisbygcmsusingopensources
AT josemariapastorhernandez datapreprocessingworkflowforexhaledbreathanalysisbygcmsusingopensources
AT gemalozanoterol datapreprocessingworkflowforexhaledbreathanalysisbygcmsusingopensources
AT juliagallegojara datapreprocessingworkflowforexhaledbreathanalysisbygcmsusingopensources
AT luisgarciamarcos datapreprocessingworkflowforexhaledbreathanalysisbygcmsusingopensources
AT manuelcanovasdiaz datapreprocessingworkflowforexhaledbreathanalysisbygcmsusingopensources
AT teresadediegopuente datapreprocessingworkflowforexhaledbreathanalysisbygcmsusingopensources