A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles

<p>Abstract</p> <p>Background</p> <p>The discovery of biomarkers is an important step towards the development of criteria for early diagnosis of disease status. Recently electrospray ionization (ESI) and matrix assisted laser desorption (MALDI) time-of-flight (TOF) mass...

Full description

Bibliographic Details
Main Authors: Dey Dipak K, Grant David F, Ghosh Samiran, Hill Dennis W
Format: Article
Language:English
Published: BMC 2008-01-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/9/38
_version_ 1811247084043501568
author Dey Dipak K
Grant David F
Ghosh Samiran
Hill Dennis W
author_facet Dey Dipak K
Grant David F
Ghosh Samiran
Hill Dennis W
author_sort Dey Dipak K
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>The discovery of biomarkers is an important step towards the development of criteria for early diagnosis of disease status. Recently electrospray ionization (ESI) and matrix assisted laser desorption (MALDI) time-of-flight (TOF) mass spectrometry have been used to identify biomarkers both in proteomics and metabonomics studies. Data sets generated from such studies are generally very large in size and thus require the use of sophisticated statistical techniques to glean useful information. Most recent attempts to process these types of data model each compound's intensity either discretely by positional (mass to charge ratio) clustering or through each compounds' own intensity distribution. Traditionally data processing steps such as noise removal, background elimination and m/z alignment, are generally carried out separately resulting in unsatisfactory propagation of signals in the final model.</p> <p>Results</p> <p>In the present study a novel semi-parametric approach has been developed to distinguish urinary metabolic profiles in a group of traumatic patients from those of a control group consisting of normal individuals. Data sets obtained from the replicates of a single subject were used to develop a functional profile through Dirichlet mixture of beta distribution. This functional profile is flexible enough to accommodate variability of the instrument and the inherent variability of each individual, thus simultaneously addressing different sources of systematic error. To address instrument variability, all data sets were analyzed in replicate, an important issue ignored by most studies in the past. Different model comparisons were performed to select the best model for each subject. The m/z values in the window of the irregular pattern are then further recommended for possible biomarker discovery.</p> <p>Conclusion</p> <p>To the best of our knowledge this is the very first attempt to model the physical process behind the time-of flight mass spectrometry. Most of the state of the art techniques does not take these physical principles in consideration while modeling such data. The proposed modeling process will apply as long as the basic physical principle presented in this paper is valid. Notably we have confined our present work mostly within the modeling aspect. Nevertheless clinical validation of our recommended list of potential biomarkers will be required. Hence, we have termed our modeling approach as a "framework" for further work.</p>
first_indexed 2024-04-12T15:04:23Z
format Article
id doaj.art-538ac886d73e4e4bac0a85226e0c8164
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-04-12T15:04:23Z
publishDate 2008-01-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-538ac886d73e4e4bac0a85226e0c81642022-12-22T03:28:00ZengBMCBMC Bioinformatics1471-21052008-01-01913810.1186/1471-2105-9-38A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profilesDey Dipak KGrant David FGhosh SamiranHill Dennis W<p>Abstract</p> <p>Background</p> <p>The discovery of biomarkers is an important step towards the development of criteria for early diagnosis of disease status. Recently electrospray ionization (ESI) and matrix assisted laser desorption (MALDI) time-of-flight (TOF) mass spectrometry have been used to identify biomarkers both in proteomics and metabonomics studies. Data sets generated from such studies are generally very large in size and thus require the use of sophisticated statistical techniques to glean useful information. Most recent attempts to process these types of data model each compound's intensity either discretely by positional (mass to charge ratio) clustering or through each compounds' own intensity distribution. Traditionally data processing steps such as noise removal, background elimination and m/z alignment, are generally carried out separately resulting in unsatisfactory propagation of signals in the final model.</p> <p>Results</p> <p>In the present study a novel semi-parametric approach has been developed to distinguish urinary metabolic profiles in a group of traumatic patients from those of a control group consisting of normal individuals. Data sets obtained from the replicates of a single subject were used to develop a functional profile through Dirichlet mixture of beta distribution. This functional profile is flexible enough to accommodate variability of the instrument and the inherent variability of each individual, thus simultaneously addressing different sources of systematic error. To address instrument variability, all data sets were analyzed in replicate, an important issue ignored by most studies in the past. Different model comparisons were performed to select the best model for each subject. The m/z values in the window of the irregular pattern are then further recommended for possible biomarker discovery.</p> <p>Conclusion</p> <p>To the best of our knowledge this is the very first attempt to model the physical process behind the time-of flight mass spectrometry. Most of the state of the art techniques does not take these physical principles in consideration while modeling such data. The proposed modeling process will apply as long as the basic physical principle presented in this paper is valid. Notably we have confined our present work mostly within the modeling aspect. Nevertheless clinical validation of our recommended list of potential biomarkers will be required. Hence, we have termed our modeling approach as a "framework" for further work.</p>http://www.biomedcentral.com/1471-2105/9/38
spellingShingle Dey Dipak K
Grant David F
Ghosh Samiran
Hill Dennis W
A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles
BMC Bioinformatics
title A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles
title_full A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles
title_fullStr A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles
title_full_unstemmed A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles
title_short A semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles
title_sort semiparametric modeling framework for potential biomarker discovery and the development of metabonomic profiles
url http://www.biomedcentral.com/1471-2105/9/38
work_keys_str_mv AT deydipakk asemiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles
AT grantdavidf asemiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles
AT ghoshsamiran asemiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles
AT hilldennisw asemiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles
AT deydipakk semiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles
AT grantdavidf semiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles
AT ghoshsamiran semiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles
AT hilldennisw semiparametricmodelingframeworkforpotentialbiomarkerdiscoveryandthedevelopmentofmetabonomicprofiles