The Current State-of-the-Art Identification of Unknown Proteins Using Mass Spectrometry Exemplified on De Novo Sequencing of a Venom Protease from <i>Bothrops moojeni</i>

(1) Background: The amino acid sequence elucidation of peptides from the gas phase fragmentation mass spectra, de novo sequencing, is a valuable method for the identification of unknown proteins complementary to Edman sequencing. It is increasingly used in shot-gun mass spectrometry (MS)-based prote...

Full description

Bibliographic Details
Main Authors: Simone König, Wolfgang M. J. Obermann, Johannes A. Eble
Format: Article
Language:English
Published: MDPI AG 2022-08-01
Series:Molecules
Subjects:
Online Access:https://www.mdpi.com/1420-3049/27/15/4976
_version_ 1797412895490834432
author Simone König
Wolfgang M. J. Obermann
Johannes A. Eble
author_facet Simone König
Wolfgang M. J. Obermann
Johannes A. Eble
author_sort Simone König
collection DOAJ
description (1) Background: The amino acid sequence elucidation of peptides from the gas phase fragmentation mass spectra, de novo sequencing, is a valuable method for the identification of unknown proteins complementary to Edman sequencing. It is increasingly used in shot-gun mass spectrometry (MS)-based proteomics experiments. We review the current state-of-the-art and use the identification of an unknown snake venom protein targeting the human tissue factor (TF) as an example to describe the analysis process based on manual spectrum interrogation. (2) Methods: The immobilized TF was incubated with a crude <i>B. moojeni</i> venom solution. The potential binding partners were eluted and further purified by gel electrophoresis. Edman degradation was performed to elucidate the N-terminus of the 31 kDa protein of interest. High-resolution MS with collision-induced dissociation was employed to generate peptide fragmentation spectra. Sequence tags were deduced and used for searches in the NCBI and Uniprot databases. Protein matches from the snake species were further validated by target MS/MS. (3) Results: Sequence tag D [K/Q] D [I/L] VDD [K/Q] led to a snake venom serine protease (SVSP) from lancehead <i>B. jararaca</i> (P81824). With target MS/MS, 24% of the SVSP sequence were confirmed; an additional 41% were tentatively assigned by data-independent MS. Edman sequencing provided information for 10 N-terminal amino acid residues, also confirming the match to SVSP. (4) Conclusions: The identification of unknown proteins continues to be a challenge despite major advances in MS instrumentation and bioinformatic tools. The main requirement is the generation of meaningful, high-quality MS peptide fragmentation spectra. These are used to elucidate sufficiently long sequence tags, which can subsequently be submitted to searches in protein databases. This basic method does not require extensive bioinformatics because peptide MS/MS spectra, especially of doubly-charged ions, can be analysed manually. We demonstrated the procedure with the elucidation of SVSP. While de novo sequencing quickly indicates the correct protein group, the validation of the entire protein sequence of amino acid-by-amino acid will take time. Reasons are the need to properly assign isobaric amino acid residues and modifications. With the ongoing efforts in genomics and transcriptomics and the availability of ever more data in public databases, the need for de novo MS sequencing will decrease. Still, not every animal and plant species will be sequenced, so the combination of MS and Edman sequencing will continue to be of importance for the identification of unknown proteins.
first_indexed 2024-03-09T05:09:53Z
format Article
id doaj.art-3acf8b963f064e1f912bd51279ffa91c
institution Directory Open Access Journal
issn 1420-3049
language English
last_indexed 2024-03-09T05:09:53Z
publishDate 2022-08-01
publisher MDPI AG
record_format Article
series Molecules
spelling doaj.art-3acf8b963f064e1f912bd51279ffa91c2023-12-03T12:50:51ZengMDPI AGMolecules1420-30492022-08-012715497610.3390/molecules27154976The Current State-of-the-Art Identification of Unknown Proteins Using Mass Spectrometry Exemplified on De Novo Sequencing of a Venom Protease from <i>Bothrops moojeni</i>Simone König0Wolfgang M. J. Obermann1Johannes A. Eble2IZKF Core Unit Proteomics, Interdisciplinary Center for Clinical Research, University of Münster, Röntgenstr. 21, 48149 Münster, GermanyInstitute of Physiological Chemistry and Pathobiochemistry, University of Münster, Waldeyer-Str. 15, 48149 Münster, GermanyInstitute of Physiological Chemistry and Pathobiochemistry, University of Münster, Waldeyer-Str. 15, 48149 Münster, Germany(1) Background: The amino acid sequence elucidation of peptides from the gas phase fragmentation mass spectra, de novo sequencing, is a valuable method for the identification of unknown proteins complementary to Edman sequencing. It is increasingly used in shot-gun mass spectrometry (MS)-based proteomics experiments. We review the current state-of-the-art and use the identification of an unknown snake venom protein targeting the human tissue factor (TF) as an example to describe the analysis process based on manual spectrum interrogation. (2) Methods: The immobilized TF was incubated with a crude <i>B. moojeni</i> venom solution. The potential binding partners were eluted and further purified by gel electrophoresis. Edman degradation was performed to elucidate the N-terminus of the 31 kDa protein of interest. High-resolution MS with collision-induced dissociation was employed to generate peptide fragmentation spectra. Sequence tags were deduced and used for searches in the NCBI and Uniprot databases. Protein matches from the snake species were further validated by target MS/MS. (3) Results: Sequence tag D [K/Q] D [I/L] VDD [K/Q] led to a snake venom serine protease (SVSP) from lancehead <i>B. jararaca</i> (P81824). With target MS/MS, 24% of the SVSP sequence were confirmed; an additional 41% were tentatively assigned by data-independent MS. Edman sequencing provided information for 10 N-terminal amino acid residues, also confirming the match to SVSP. (4) Conclusions: The identification of unknown proteins continues to be a challenge despite major advances in MS instrumentation and bioinformatic tools. The main requirement is the generation of meaningful, high-quality MS peptide fragmentation spectra. These are used to elucidate sufficiently long sequence tags, which can subsequently be submitted to searches in protein databases. This basic method does not require extensive bioinformatics because peptide MS/MS spectra, especially of doubly-charged ions, can be analysed manually. We demonstrated the procedure with the elucidation of SVSP. While de novo sequencing quickly indicates the correct protein group, the validation of the entire protein sequence of amino acid-by-amino acid will take time. Reasons are the need to properly assign isobaric amino acid residues and modifications. With the ongoing efforts in genomics and transcriptomics and the availability of ever more data in public databases, the need for de novo MS sequencing will decrease. Still, not every animal and plant species will be sequenced, so the combination of MS and Edman sequencing will continue to be of importance for the identification of unknown proteins.https://www.mdpi.com/1420-3049/27/15/4976snake venomgas phase peptide ion fragmentationmass spectrometryspectrum qualityMS/MSCID
spellingShingle Simone König
Wolfgang M. J. Obermann
Johannes A. Eble
The Current State-of-the-Art Identification of Unknown Proteins Using Mass Spectrometry Exemplified on De Novo Sequencing of a Venom Protease from <i>Bothrops moojeni</i>
Molecules
snake venom
gas phase peptide ion fragmentation
mass spectrometry
spectrum quality
MS/MS
CID
title The Current State-of-the-Art Identification of Unknown Proteins Using Mass Spectrometry Exemplified on De Novo Sequencing of a Venom Protease from <i>Bothrops moojeni</i>
title_full The Current State-of-the-Art Identification of Unknown Proteins Using Mass Spectrometry Exemplified on De Novo Sequencing of a Venom Protease from <i>Bothrops moojeni</i>
title_fullStr The Current State-of-the-Art Identification of Unknown Proteins Using Mass Spectrometry Exemplified on De Novo Sequencing of a Venom Protease from <i>Bothrops moojeni</i>
title_full_unstemmed The Current State-of-the-Art Identification of Unknown Proteins Using Mass Spectrometry Exemplified on De Novo Sequencing of a Venom Protease from <i>Bothrops moojeni</i>
title_short The Current State-of-the-Art Identification of Unknown Proteins Using Mass Spectrometry Exemplified on De Novo Sequencing of a Venom Protease from <i>Bothrops moojeni</i>
title_sort current state of the art identification of unknown proteins using mass spectrometry exemplified on de novo sequencing of a venom protease from i bothrops moojeni i
topic snake venom
gas phase peptide ion fragmentation
mass spectrometry
spectrum quality
MS/MS
CID
url https://www.mdpi.com/1420-3049/27/15/4976
work_keys_str_mv AT simonekonig thecurrentstateoftheartidentificationofunknownproteinsusingmassspectrometryexemplifiedondenovosequencingofavenomproteasefromibothropsmoojenii
AT wolfgangmjobermann thecurrentstateoftheartidentificationofunknownproteinsusingmassspectrometryexemplifiedondenovosequencingofavenomproteasefromibothropsmoojenii
AT johannesaeble thecurrentstateoftheartidentificationofunknownproteinsusingmassspectrometryexemplifiedondenovosequencingofavenomproteasefromibothropsmoojenii
AT simonekonig currentstateoftheartidentificationofunknownproteinsusingmassspectrometryexemplifiedondenovosequencingofavenomproteasefromibothropsmoojenii
AT wolfgangmjobermann currentstateoftheartidentificationofunknownproteinsusingmassspectrometryexemplifiedondenovosequencingofavenomproteasefromibothropsmoojenii
AT johannesaeble currentstateoftheartidentificationofunknownproteinsusingmassspectrometryexemplifiedondenovosequencingofavenomproteasefromibothropsmoojenii