STAT: a fast, scalable, MinHash-based k-mer tool to assess Sequence Read Archive next-generation sequence submissions

Abstract Sequence Read Archive submissions to the National Center for Biotechnology Information often lack useful metadata, which limits the utility of these submissions. We describe the Sequence Taxonomic Analysis Tool (STAT), a scalable k-mer-based tool for fast assessment of taxonomic diversity i...

Full description

Bibliographic Details
Main Authors: Kenneth S. Katz, Oleg Shutov, Richard Lapoint, Michael Kimelman, J. Rodney Brister, Christopher O’Sullivan
Format: Article
Language:English
Published: BMC 2021-09-01
Series:Genome Biology
Subjects:
Online Access:https://doi.org/10.1186/s13059-021-02490-0