A database application for pre-processing, storage and comparison of mass spectra derived from patients and controls

<p>Abstract</p> <p>Background</p> <p>Statistical comparison of peptide profiles in biomarker discovery requires fast, user-friendly software for high throughput data analysis. Important features are flexibility in changing input variables and statistical analysis of pep...

Full description

Bibliographic Details
Main Authors: Sillevis Smitt Peter A, Heeren Ron MA, van Rijswijk Angelique LCT, Dekker Lennard J, Siccama Ivar, Titulaer Mark K, Luider Theo M
Format: Article
Language:English
Published: BMC 2006-09-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/7/403
_version_ 1811266762755276800
author Sillevis Smitt Peter A
Heeren Ron MA
van Rijswijk Angelique LCT
Dekker Lennard J
Siccama Ivar
Titulaer Mark K
Luider Theo M
author_facet Sillevis Smitt Peter A
Heeren Ron MA
van Rijswijk Angelique LCT
Dekker Lennard J
Siccama Ivar
Titulaer Mark K
Luider Theo M
author_sort Sillevis Smitt Peter A
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Statistical comparison of peptide profiles in biomarker discovery requires fast, user-friendly software for high throughput data analysis. Important features are flexibility in changing input variables and statistical analysis of peptides that are differentially expressed between patient and control groups. In addition, integration the mass spectrometry data with the results of other experiments, such as microarray analysis, and information from other databases requires a central storage of the profile matrix, where protein id's can be added to peptide masses of interest.</p> <p>Results</p> <p>A new database application is presented, to detect and identify significantly differentially expressed peptides in peptide profiles obtained from body fluids of patient and control groups. The presented modular software is capable of central storage of mass spectra and results in fast analysis. The software architecture consists of 4 pillars, 1) a Graphical User Interface written in Java, 2) a MySQL database, which contains all metadata, such as experiment numbers and sample codes, 3) a FTP (File Transport Protocol) server to store all raw mass spectrometry files and processed data, and 4) the software package R, which is used for modular statistical calculations, such as the Wilcoxon-Mann-Whitney rank sum test. Statistic analysis by the Wilcoxon-Mann-Whitney test in R demonstrates that peptide-profiles of two patient groups 1) breast cancer patients with leptomeningeal metastases and 2) prostate cancer patients in end stage disease can be distinguished from those of control groups.</p> <p>Conclusion</p> <p>The database application is capable to distinguish patient Matrix Assisted Laser Desorption Ionization (MALDI-TOF) peptide profiles from control groups using large size datasets. The modular architecture of the application makes it possible to adapt the application to handle also large sized data from MS/MS- and Fourier Transform Ion Cyclotron Resonance (FT-ICR) mass spectrometry experiments. It is expected that the higher resolution and mass accuracy of the FT-ICR mass spectrometry prevents the clustering of peaks of different peptides and allows the identification of differentially expressed proteins from the peptide profiles.</p>
first_indexed 2024-04-12T20:49:09Z
format Article
id doaj.art-657bf35245a8454d90a380bbf510677f
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-04-12T20:49:09Z
publishDate 2006-09-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-657bf35245a8454d90a380bbf510677f2022-12-22T03:17:10ZengBMCBMC Bioinformatics1471-21052006-09-017140310.1186/1471-2105-7-403A database application for pre-processing, storage and comparison of mass spectra derived from patients and controlsSillevis Smitt Peter AHeeren Ron MAvan Rijswijk Angelique LCTDekker Lennard JSiccama IvarTitulaer Mark KLuider Theo M<p>Abstract</p> <p>Background</p> <p>Statistical comparison of peptide profiles in biomarker discovery requires fast, user-friendly software for high throughput data analysis. Important features are flexibility in changing input variables and statistical analysis of peptides that are differentially expressed between patient and control groups. In addition, integration the mass spectrometry data with the results of other experiments, such as microarray analysis, and information from other databases requires a central storage of the profile matrix, where protein id's can be added to peptide masses of interest.</p> <p>Results</p> <p>A new database application is presented, to detect and identify significantly differentially expressed peptides in peptide profiles obtained from body fluids of patient and control groups. The presented modular software is capable of central storage of mass spectra and results in fast analysis. The software architecture consists of 4 pillars, 1) a Graphical User Interface written in Java, 2) a MySQL database, which contains all metadata, such as experiment numbers and sample codes, 3) a FTP (File Transport Protocol) server to store all raw mass spectrometry files and processed data, and 4) the software package R, which is used for modular statistical calculations, such as the Wilcoxon-Mann-Whitney rank sum test. Statistic analysis by the Wilcoxon-Mann-Whitney test in R demonstrates that peptide-profiles of two patient groups 1) breast cancer patients with leptomeningeal metastases and 2) prostate cancer patients in end stage disease can be distinguished from those of control groups.</p> <p>Conclusion</p> <p>The database application is capable to distinguish patient Matrix Assisted Laser Desorption Ionization (MALDI-TOF) peptide profiles from control groups using large size datasets. The modular architecture of the application makes it possible to adapt the application to handle also large sized data from MS/MS- and Fourier Transform Ion Cyclotron Resonance (FT-ICR) mass spectrometry experiments. It is expected that the higher resolution and mass accuracy of the FT-ICR mass spectrometry prevents the clustering of peaks of different peptides and allows the identification of differentially expressed proteins from the peptide profiles.</p>http://www.biomedcentral.com/1471-2105/7/403
spellingShingle Sillevis Smitt Peter A
Heeren Ron MA
van Rijswijk Angelique LCT
Dekker Lennard J
Siccama Ivar
Titulaer Mark K
Luider Theo M
A database application for pre-processing, storage and comparison of mass spectra derived from patients and controls
BMC Bioinformatics
title A database application for pre-processing, storage and comparison of mass spectra derived from patients and controls
title_full A database application for pre-processing, storage and comparison of mass spectra derived from patients and controls
title_fullStr A database application for pre-processing, storage and comparison of mass spectra derived from patients and controls
title_full_unstemmed A database application for pre-processing, storage and comparison of mass spectra derived from patients and controls
title_short A database application for pre-processing, storage and comparison of mass spectra derived from patients and controls
title_sort database application for pre processing storage and comparison of mass spectra derived from patients and controls
url http://www.biomedcentral.com/1471-2105/7/403
work_keys_str_mv AT sillevissmittpetera adatabaseapplicationforpreprocessingstorageandcomparisonofmassspectraderivedfrompatientsandcontrols
AT heerenronma adatabaseapplicationforpreprocessingstorageandcomparisonofmassspectraderivedfrompatientsandcontrols
AT vanrijswijkangeliquelct adatabaseapplicationforpreprocessingstorageandcomparisonofmassspectraderivedfrompatientsandcontrols
AT dekkerlennardj adatabaseapplicationforpreprocessingstorageandcomparisonofmassspectraderivedfrompatientsandcontrols
AT siccamaivar adatabaseapplicationforpreprocessingstorageandcomparisonofmassspectraderivedfrompatientsandcontrols
AT titulaermarkk adatabaseapplicationforpreprocessingstorageandcomparisonofmassspectraderivedfrompatientsandcontrols
AT luidertheom adatabaseapplicationforpreprocessingstorageandcomparisonofmassspectraderivedfrompatientsandcontrols
AT sillevissmittpetera databaseapplicationforpreprocessingstorageandcomparisonofmassspectraderivedfrompatientsandcontrols
AT heerenronma databaseapplicationforpreprocessingstorageandcomparisonofmassspectraderivedfrompatientsandcontrols
AT vanrijswijkangeliquelct databaseapplicationforpreprocessingstorageandcomparisonofmassspectraderivedfrompatientsandcontrols
AT dekkerlennardj databaseapplicationforpreprocessingstorageandcomparisonofmassspectraderivedfrompatientsandcontrols
AT siccamaivar databaseapplicationforpreprocessingstorageandcomparisonofmassspectraderivedfrompatientsandcontrols
AT titulaermarkk databaseapplicationforpreprocessingstorageandcomparisonofmassspectraderivedfrompatientsandcontrols
AT luidertheom databaseapplicationforpreprocessingstorageandcomparisonofmassspectraderivedfrompatientsandcontrols