Significance analysis of microarray for relative quantitation of LC/MS data in proteomics

<p>Abstract</p> <p>Background</p> <p>Although fold change is a commonly used criterion in quantitative proteomics for differentiating regulated proteins, it does not provide an estimation of false positive and false negative rates that is often desirable in a large-scal...

Full description

Bibliographic Details
Main Authors: Li Qingbo, Roxas Bryan AP
Format: Article
Language:English
Published: BMC 2008-04-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/9/187
_version_ 1828220146702876672
author Li Qingbo
Roxas Bryan AP
author_facet Li Qingbo
Roxas Bryan AP
author_sort Li Qingbo
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Although fold change is a commonly used criterion in quantitative proteomics for differentiating regulated proteins, it does not provide an estimation of false positive and false negative rates that is often desirable in a large-scale quantitative proteomic analysis. We explore the possibility of applying the Significance Analysis of Microarray (SAM) method (PNAS 98:5116-5121) to a differential proteomics problem of two samples with replicates. The quantitative proteomic analysis was carried out with nanoliquid chromatography/linear iron trap-Fourier transform mass spectrometry. The biological sample model included two <it>Mycobacterium smegmatis </it>unlabeled cell cultures grown at pH 5 and pH 7. The objective was to compare the protein relative abundance between the two unlabeled cell cultures, with an emphasis on significance analysis of protein differential expression using the SAM method. Results using the SAM method are compared with those obtained by fold change and the conventional <it>t</it>-test.</p> <p>Results</p> <p>We have applied the SAM method to solve the two-sample significance analysis problem in liquid chromatography/mass spectrometry (LC/MS) based quantitative proteomics. We grew the pH5 and pH7 unlabelled cell cultures in triplicate resulting in 6 biological replicates. Each biological replicate was mixed with a common <sup>15</sup>N-labeled reference culture cells for normalization prior to SDS/PAGE fractionation and LC/MS analysis. For each biological replicate, one center SDS/PAGE gel fraction was selected for triplicate LC/MS analysis. There were 121 proteins quantified in at least 5 of the 6 biological replicates. Of these 121 proteins, 106 were significant in differential expression by the <it>t</it>-test (<it>p </it>< 0.05) based on peptide-level replicates, 54 were significant in differential expression by SAM with Δ = 0.68 cutoff and false positive rate at 5%, and 29 were significant in differential expression by the <it>t</it>-test (<it>p </it>< 0.05) based on protein-level replicates. The results indicate that SAM appears to overcome the false positives one encounters using the peptide-based <it>t</it>-test while allowing for identification of a greater number of differentially expressed proteins than the protein-based <it>t</it>-test.</p> <p>Conclusion</p> <p>We demonstrate that the SAM method can be adapted for effective significance analysis of proteomic data. It provides much richer information about the protein differential expression profiles and is particularly useful in the estimation of false discovery rates and miss rates.</p>
first_indexed 2024-04-12T16:23:58Z
format Article
id doaj.art-8618d0dda681464d9ee08821dca63136
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-04-12T16:23:58Z
publishDate 2008-04-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-8618d0dda681464d9ee08821dca631362022-12-22T03:25:27ZengBMCBMC Bioinformatics1471-21052008-04-019118710.1186/1471-2105-9-187Significance analysis of microarray for relative quantitation of LC/MS data in proteomicsLi QingboRoxas Bryan AP<p>Abstract</p> <p>Background</p> <p>Although fold change is a commonly used criterion in quantitative proteomics for differentiating regulated proteins, it does not provide an estimation of false positive and false negative rates that is often desirable in a large-scale quantitative proteomic analysis. We explore the possibility of applying the Significance Analysis of Microarray (SAM) method (PNAS 98:5116-5121) to a differential proteomics problem of two samples with replicates. The quantitative proteomic analysis was carried out with nanoliquid chromatography/linear iron trap-Fourier transform mass spectrometry. The biological sample model included two <it>Mycobacterium smegmatis </it>unlabeled cell cultures grown at pH 5 and pH 7. The objective was to compare the protein relative abundance between the two unlabeled cell cultures, with an emphasis on significance analysis of protein differential expression using the SAM method. Results using the SAM method are compared with those obtained by fold change and the conventional <it>t</it>-test.</p> <p>Results</p> <p>We have applied the SAM method to solve the two-sample significance analysis problem in liquid chromatography/mass spectrometry (LC/MS) based quantitative proteomics. We grew the pH5 and pH7 unlabelled cell cultures in triplicate resulting in 6 biological replicates. Each biological replicate was mixed with a common <sup>15</sup>N-labeled reference culture cells for normalization prior to SDS/PAGE fractionation and LC/MS analysis. For each biological replicate, one center SDS/PAGE gel fraction was selected for triplicate LC/MS analysis. There were 121 proteins quantified in at least 5 of the 6 biological replicates. Of these 121 proteins, 106 were significant in differential expression by the <it>t</it>-test (<it>p </it>< 0.05) based on peptide-level replicates, 54 were significant in differential expression by SAM with Δ = 0.68 cutoff and false positive rate at 5%, and 29 were significant in differential expression by the <it>t</it>-test (<it>p </it>< 0.05) based on protein-level replicates. The results indicate that SAM appears to overcome the false positives one encounters using the peptide-based <it>t</it>-test while allowing for identification of a greater number of differentially expressed proteins than the protein-based <it>t</it>-test.</p> <p>Conclusion</p> <p>We demonstrate that the SAM method can be adapted for effective significance analysis of proteomic data. It provides much richer information about the protein differential expression profiles and is particularly useful in the estimation of false discovery rates and miss rates.</p>http://www.biomedcentral.com/1471-2105/9/187
spellingShingle Li Qingbo
Roxas Bryan AP
Significance analysis of microarray for relative quantitation of LC/MS data in proteomics
BMC Bioinformatics
title Significance analysis of microarray for relative quantitation of LC/MS data in proteomics
title_full Significance analysis of microarray for relative quantitation of LC/MS data in proteomics
title_fullStr Significance analysis of microarray for relative quantitation of LC/MS data in proteomics
title_full_unstemmed Significance analysis of microarray for relative quantitation of LC/MS data in proteomics
title_short Significance analysis of microarray for relative quantitation of LC/MS data in proteomics
title_sort significance analysis of microarray for relative quantitation of lc ms data in proteomics
url http://www.biomedcentral.com/1471-2105/9/187
work_keys_str_mv AT liqingbo significanceanalysisofmicroarrayforrelativequantitationoflcmsdatainproteomics
AT roxasbryanap significanceanalysisofmicroarrayforrelativequantitationoflcmsdatainproteomics