Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data.

1H Nuclear Magnetic Resonance (NMR)-based metabolic profiling is very promising for the diagnostic of the stages of chronic kidney disease (CKD). Because of the high dimension of NMR spectra datasets and the complex mixture of metabolites in biological samples, the identification of discriminant bio...

Full description

Bibliographic Details
Main Authors: Margaux Luck, Gildas Bertho, Mathilde Bateson, Alexandre Karras, Anastasia Yartseva, Eric Thervet, Cecilia Damon, Nicolas Pallet
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2016-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5115883?pdf=render
_version_ 1818480880257073152
author Margaux Luck
Gildas Bertho
Mathilde Bateson
Alexandre Karras
Anastasia Yartseva
Eric Thervet
Cecilia Damon
Nicolas Pallet
author_facet Margaux Luck
Gildas Bertho
Mathilde Bateson
Alexandre Karras
Anastasia Yartseva
Eric Thervet
Cecilia Damon
Nicolas Pallet
author_sort Margaux Luck
collection DOAJ
description 1H Nuclear Magnetic Resonance (NMR)-based metabolic profiling is very promising for the diagnostic of the stages of chronic kidney disease (CKD). Because of the high dimension of NMR spectra datasets and the complex mixture of metabolites in biological samples, the identification of discriminant biomarkers of a disease is challenging. None of the widely used chemometric methods in NMR metabolomics performs a local exhaustive exploration of the data. We developed a descriptive and easily understandable approach that searches for discriminant local phenomena using an original exhaustive rule-mining algorithm in order to predict two groups of patients: 1) patients having low to mild CKD stages with no renal failure and 2) patients having moderate to established CKD stages with renal failure. Our predictive algorithm explores the m-dimensional variable space to capture the local overdensities of the two groups of patients under the form of easily interpretable rules. Afterwards, a L2-penalized logistic regression on the discriminant rules was used to build predictive models of the CKD stages. We explored a complex multi-source dataset that included the clinical, demographic, clinical chemistry, renal pathology and urine metabolomic data of a cohort of 110 patients. Given this multi-source dataset and the complex nature of metabolomic data, we analyzed 1- and 2-dimensional rules in order to integrate the information carried by the interactions between the variables. The results indicated that our local algorithm is a valuable analytical method for the precise characterization of multivariate CKD stage profiles and as efficient as the classical global model using chi2 variable section with an approximately 70% of good classification level. The resulting predictive models predominantly identify urinary metabolites (such as 3-hydroxyisovalerate, carnitine, citrate, dimethylsulfone, creatinine and N-methylnicotinamide) as relevant variables indicating that CKD significantly affects the urinary metabolome. In addition, the simple knowledge of the concentration of urinary metabolites classifies the CKD stage of the patients correctly.
first_indexed 2024-12-10T11:28:04Z
format Article
id doaj.art-ab9e1cafa2b0400c9c1d5f86319c99a4
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-10T11:28:04Z
publishDate 2016-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-ab9e1cafa2b0400c9c1d5f86319c99a42022-12-22T01:50:40ZengPublic Library of Science (PLoS)PLoS ONE1932-62032016-01-011111e016690510.1371/journal.pone.0166905Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data.Margaux LuckGildas BerthoMathilde BatesonAlexandre KarrasAnastasia YartsevaEric ThervetCecilia DamonNicolas Pallet1H Nuclear Magnetic Resonance (NMR)-based metabolic profiling is very promising for the diagnostic of the stages of chronic kidney disease (CKD). Because of the high dimension of NMR spectra datasets and the complex mixture of metabolites in biological samples, the identification of discriminant biomarkers of a disease is challenging. None of the widely used chemometric methods in NMR metabolomics performs a local exhaustive exploration of the data. We developed a descriptive and easily understandable approach that searches for discriminant local phenomena using an original exhaustive rule-mining algorithm in order to predict two groups of patients: 1) patients having low to mild CKD stages with no renal failure and 2) patients having moderate to established CKD stages with renal failure. Our predictive algorithm explores the m-dimensional variable space to capture the local overdensities of the two groups of patients under the form of easily interpretable rules. Afterwards, a L2-penalized logistic regression on the discriminant rules was used to build predictive models of the CKD stages. We explored a complex multi-source dataset that included the clinical, demographic, clinical chemistry, renal pathology and urine metabolomic data of a cohort of 110 patients. Given this multi-source dataset and the complex nature of metabolomic data, we analyzed 1- and 2-dimensional rules in order to integrate the information carried by the interactions between the variables. The results indicated that our local algorithm is a valuable analytical method for the precise characterization of multivariate CKD stage profiles and as efficient as the classical global model using chi2 variable section with an approximately 70% of good classification level. The resulting predictive models predominantly identify urinary metabolites (such as 3-hydroxyisovalerate, carnitine, citrate, dimethylsulfone, creatinine and N-methylnicotinamide) as relevant variables indicating that CKD significantly affects the urinary metabolome. In addition, the simple knowledge of the concentration of urinary metabolites classifies the CKD stage of the patients correctly.http://europepmc.org/articles/PMC5115883?pdf=render
spellingShingle Margaux Luck
Gildas Bertho
Mathilde Bateson
Alexandre Karras
Anastasia Yartseva
Eric Thervet
Cecilia Damon
Nicolas Pallet
Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data.
PLoS ONE
title Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data.
title_full Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data.
title_fullStr Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data.
title_full_unstemmed Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data.
title_short Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data.
title_sort rule mining for the early prediction of chronic kidney disease based on metabolomics and multi source data
url http://europepmc.org/articles/PMC5115883?pdf=render
work_keys_str_mv AT margauxluck ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata
AT gildasbertho ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata
AT mathildebateson ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata
AT alexandrekarras ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata
AT anastasiayartseva ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata
AT ericthervet ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata
AT ceciliadamon ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata
AT nicolaspallet ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata