Metagenome Proteins and Database Contamination

ABSTRACT Continued influx of metagenome-derived proteins with misannotated taxonomy into conventional databases, including RefSeq, threatens to eliminate the value of taxonomy identifiers. To prevent this, urgent efforts should be undertaken by submitters of metagenomic data sets as well as by datab...

Full description

Bibliographic Details
Main Author: Irina R. Arkhipova
Format: Article
Language:English
Published: American Society for Microbiology 2020-12-01
Series:mSphere
Subjects:
Online Access:https://journals.asm.org/doi/10.1128/mSphere.00854-20
_version_ 1818935983051112448
author Irina R. Arkhipova
author_facet Irina R. Arkhipova
author_sort Irina R. Arkhipova
collection DOAJ
description ABSTRACT Continued influx of metagenome-derived proteins with misannotated taxonomy into conventional databases, including RefSeq, threatens to eliminate the value of taxonomy identifiers. To prevent this, urgent efforts should be undertaken by submitters of metagenomic data sets as well as by database managers.
first_indexed 2024-12-20T05:28:50Z
format Article
id doaj.art-508685c8200644ed90da2b8a279b33e7
institution Directory Open Access Journal
issn 2379-5042
language English
last_indexed 2024-12-20T05:28:50Z
publishDate 2020-12-01
publisher American Society for Microbiology
record_format Article
series mSphere
spelling doaj.art-508685c8200644ed90da2b8a279b33e72022-12-21T19:51:49ZengAmerican Society for MicrobiologymSphere2379-50422020-12-015610.1128/mSphere.00854-20Metagenome Proteins and Database ContaminationIrina R. Arkhipova0Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, Massachusetts, USAABSTRACT Continued influx of metagenome-derived proteins with misannotated taxonomy into conventional databases, including RefSeq, threatens to eliminate the value of taxonomy identifiers. To prevent this, urgent efforts should be undertaken by submitters of metagenomic data sets as well as by database managers.https://journals.asm.org/doi/10.1128/mSphere.00854-20MAGRefSeqbinningclassificationmetagenomicstaxonomy
spellingShingle Irina R. Arkhipova
Metagenome Proteins and Database Contamination
mSphere
MAG
RefSeq
binning
classification
metagenomics
taxonomy
title Metagenome Proteins and Database Contamination
title_full Metagenome Proteins and Database Contamination
title_fullStr Metagenome Proteins and Database Contamination
title_full_unstemmed Metagenome Proteins and Database Contamination
title_short Metagenome Proteins and Database Contamination
title_sort metagenome proteins and database contamination
topic MAG
RefSeq
binning
classification
metagenomics
taxonomy
url https://journals.asm.org/doi/10.1128/mSphere.00854-20
work_keys_str_mv AT irinararkhipova metagenomeproteinsanddatabasecontamination