Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment

Abstract DIA is a mainstream method for quantitative proteomics, but consistent quantification across multiple LC-MS/MS instruments remains a bottleneck in parallelizing data acquisition. One reason for this inconsistency and missing quantification is the retention time shift which current software...

Full description

Bibliographic Details
Main Authors: Shubham Gupta, Justin C. Sing, Hannes L. Röst
Format: Article
Language:English
Published: Nature Portfolio 2023-10-01
Series:Communications Biology
Online Access:https://doi.org/10.1038/s42003-023-05437-2
_version_ 1797636770431500288
author Shubham Gupta
Justin C. Sing
Hannes L. Röst
author_facet Shubham Gupta
Justin C. Sing
Hannes L. Röst
author_sort Shubham Gupta
collection DOAJ
description Abstract DIA is a mainstream method for quantitative proteomics, but consistent quantification across multiple LC-MS/MS instruments remains a bottleneck in parallelizing data acquisition. One reason for this inconsistency and missing quantification is the retention time shift which current software does not adequately address for runs from multiple sites. We present multirun chromatogram alignment strategies to map peaks across columns, including the traditional reference-based Star method, and two novel approaches: MST and Progressive alignment. These reference-free strategies produce a quantitatively accurate data-matrix, even from heterogeneous multi-column studies. Progressive alignment also generates merged chromatograms from all runs which has not been previously achieved for LC-MS/MS data. First, we demonstrate the effectiveness of multirun alignment strategies on a gold-standard annotated dataset, resulting in a threefold reduction in quantitation error-rate compared to non-aligned DIA results. Subsequently, on a multi-species dataset that DIAlignR effectively controls the quantitative error rate, improves precision in protein measurements, and exhibits conservative peak alignment. We next show that the MST alignment reduces cross-site CV by 50% for highly abundant proteins when applied to a dataset from 11 different LC-MS/MS setups. Finally, the reanalysis of 949 plasma runs with multirun alignment revealed a more than 50% increase in insulin resistance (IR) and respiratory viral infection (RVI) proteins, identifying 11 and 13 proteins respectively, compared to prior analysis without it. The three strategies are implemented in our DIAlignR workflow (>2.3) and can be combined with linear, non-linear, or hybrid pairwise alignment.
first_indexed 2024-03-11T12:38:54Z
format Article
id doaj.art-a98ebd438962461d9d19ed06dd776dc6
institution Directory Open Access Journal
issn 2399-3642
language English
last_indexed 2024-03-11T12:38:54Z
publishDate 2023-10-01
publisher Nature Portfolio
record_format Article
series Communications Biology
spelling doaj.art-a98ebd438962461d9d19ed06dd776dc62023-11-05T12:26:54ZengNature PortfolioCommunications Biology2399-36422023-10-016111210.1038/s42003-023-05437-2Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignmentShubham Gupta0Justin C. Sing1Hannes L. Röst2Terrence Donnelly Centre for Cellular & Biomolecular Research, University of TorontoTerrence Donnelly Centre for Cellular & Biomolecular Research, University of TorontoTerrence Donnelly Centre for Cellular & Biomolecular Research, University of TorontoAbstract DIA is a mainstream method for quantitative proteomics, but consistent quantification across multiple LC-MS/MS instruments remains a bottleneck in parallelizing data acquisition. One reason for this inconsistency and missing quantification is the retention time shift which current software does not adequately address for runs from multiple sites. We present multirun chromatogram alignment strategies to map peaks across columns, including the traditional reference-based Star method, and two novel approaches: MST and Progressive alignment. These reference-free strategies produce a quantitatively accurate data-matrix, even from heterogeneous multi-column studies. Progressive alignment also generates merged chromatograms from all runs which has not been previously achieved for LC-MS/MS data. First, we demonstrate the effectiveness of multirun alignment strategies on a gold-standard annotated dataset, resulting in a threefold reduction in quantitation error-rate compared to non-aligned DIA results. Subsequently, on a multi-species dataset that DIAlignR effectively controls the quantitative error rate, improves precision in protein measurements, and exhibits conservative peak alignment. We next show that the MST alignment reduces cross-site CV by 50% for highly abundant proteins when applied to a dataset from 11 different LC-MS/MS setups. Finally, the reanalysis of 949 plasma runs with multirun alignment revealed a more than 50% increase in insulin resistance (IR) and respiratory viral infection (RVI) proteins, identifying 11 and 13 proteins respectively, compared to prior analysis without it. The three strategies are implemented in our DIAlignR workflow (>2.3) and can be combined with linear, non-linear, or hybrid pairwise alignment.https://doi.org/10.1038/s42003-023-05437-2
spellingShingle Shubham Gupta
Justin C. Sing
Hannes L. Röst
Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment
Communications Biology
title Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment
title_full Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment
title_fullStr Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment
title_full_unstemmed Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment
title_short Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment
title_sort achieving quantitative reproducibility in label free multisite dia experiments through multirun alignment
url https://doi.org/10.1038/s42003-023-05437-2
work_keys_str_mv AT shubhamgupta achievingquantitativereproducibilityinlabelfreemultisitediaexperimentsthroughmultirunalignment
AT justincsing achievingquantitativereproducibilityinlabelfreemultisitediaexperimentsthroughmultirunalignment
AT hanneslrost achievingquantitativereproducibilityinlabelfreemultisitediaexperimentsthroughmultirunalignment