Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data.
1H Nuclear Magnetic Resonance (NMR)-based metabolic profiling is very promising for the diagnostic of the stages of chronic kidney disease (CKD). Because of the high dimension of NMR spectra datasets and the complex mixture of metabolites in biological samples, the identification of discriminant bio...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2016-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC5115883?pdf=render |
_version_ | 1818480880257073152 |
---|---|
author | Margaux Luck Gildas Bertho Mathilde Bateson Alexandre Karras Anastasia Yartseva Eric Thervet Cecilia Damon Nicolas Pallet |
author_facet | Margaux Luck Gildas Bertho Mathilde Bateson Alexandre Karras Anastasia Yartseva Eric Thervet Cecilia Damon Nicolas Pallet |
author_sort | Margaux Luck |
collection | DOAJ |
description | 1H Nuclear Magnetic Resonance (NMR)-based metabolic profiling is very promising for the diagnostic of the stages of chronic kidney disease (CKD). Because of the high dimension of NMR spectra datasets and the complex mixture of metabolites in biological samples, the identification of discriminant biomarkers of a disease is challenging. None of the widely used chemometric methods in NMR metabolomics performs a local exhaustive exploration of the data. We developed a descriptive and easily understandable approach that searches for discriminant local phenomena using an original exhaustive rule-mining algorithm in order to predict two groups of patients: 1) patients having low to mild CKD stages with no renal failure and 2) patients having moderate to established CKD stages with renal failure. Our predictive algorithm explores the m-dimensional variable space to capture the local overdensities of the two groups of patients under the form of easily interpretable rules. Afterwards, a L2-penalized logistic regression on the discriminant rules was used to build predictive models of the CKD stages. We explored a complex multi-source dataset that included the clinical, demographic, clinical chemistry, renal pathology and urine metabolomic data of a cohort of 110 patients. Given this multi-source dataset and the complex nature of metabolomic data, we analyzed 1- and 2-dimensional rules in order to integrate the information carried by the interactions between the variables. The results indicated that our local algorithm is a valuable analytical method for the precise characterization of multivariate CKD stage profiles and as efficient as the classical global model using chi2 variable section with an approximately 70% of good classification level. The resulting predictive models predominantly identify urinary metabolites (such as 3-hydroxyisovalerate, carnitine, citrate, dimethylsulfone, creatinine and N-methylnicotinamide) as relevant variables indicating that CKD significantly affects the urinary metabolome. In addition, the simple knowledge of the concentration of urinary metabolites classifies the CKD stage of the patients correctly. |
first_indexed | 2024-12-10T11:28:04Z |
format | Article |
id | doaj.art-ab9e1cafa2b0400c9c1d5f86319c99a4 |
institution | Directory Open Access Journal |
issn | 1932-6203 |
language | English |
last_indexed | 2024-12-10T11:28:04Z |
publishDate | 2016-01-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS ONE |
spelling | doaj.art-ab9e1cafa2b0400c9c1d5f86319c99a42022-12-22T01:50:40ZengPublic Library of Science (PLoS)PLoS ONE1932-62032016-01-011111e016690510.1371/journal.pone.0166905Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data.Margaux LuckGildas BerthoMathilde BatesonAlexandre KarrasAnastasia YartsevaEric ThervetCecilia DamonNicolas Pallet1H Nuclear Magnetic Resonance (NMR)-based metabolic profiling is very promising for the diagnostic of the stages of chronic kidney disease (CKD). Because of the high dimension of NMR spectra datasets and the complex mixture of metabolites in biological samples, the identification of discriminant biomarkers of a disease is challenging. None of the widely used chemometric methods in NMR metabolomics performs a local exhaustive exploration of the data. We developed a descriptive and easily understandable approach that searches for discriminant local phenomena using an original exhaustive rule-mining algorithm in order to predict two groups of patients: 1) patients having low to mild CKD stages with no renal failure and 2) patients having moderate to established CKD stages with renal failure. Our predictive algorithm explores the m-dimensional variable space to capture the local overdensities of the two groups of patients under the form of easily interpretable rules. Afterwards, a L2-penalized logistic regression on the discriminant rules was used to build predictive models of the CKD stages. We explored a complex multi-source dataset that included the clinical, demographic, clinical chemistry, renal pathology and urine metabolomic data of a cohort of 110 patients. Given this multi-source dataset and the complex nature of metabolomic data, we analyzed 1- and 2-dimensional rules in order to integrate the information carried by the interactions between the variables. The results indicated that our local algorithm is a valuable analytical method for the precise characterization of multivariate CKD stage profiles and as efficient as the classical global model using chi2 variable section with an approximately 70% of good classification level. The resulting predictive models predominantly identify urinary metabolites (such as 3-hydroxyisovalerate, carnitine, citrate, dimethylsulfone, creatinine and N-methylnicotinamide) as relevant variables indicating that CKD significantly affects the urinary metabolome. In addition, the simple knowledge of the concentration of urinary metabolites classifies the CKD stage of the patients correctly.http://europepmc.org/articles/PMC5115883?pdf=render |
spellingShingle | Margaux Luck Gildas Bertho Mathilde Bateson Alexandre Karras Anastasia Yartseva Eric Thervet Cecilia Damon Nicolas Pallet Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data. PLoS ONE |
title | Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data. |
title_full | Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data. |
title_fullStr | Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data. |
title_full_unstemmed | Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data. |
title_short | Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data. |
title_sort | rule mining for the early prediction of chronic kidney disease based on metabolomics and multi source data |
url | http://europepmc.org/articles/PMC5115883?pdf=render |
work_keys_str_mv | AT margauxluck ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata AT gildasbertho ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata AT mathildebateson ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata AT alexandrekarras ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata AT anastasiayartseva ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata AT ericthervet ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata AT ceciliadamon ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata AT nicolaspallet ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata |