Matchtigs: minimum plain text representation of k-mer sets

Abstract We propose a polynomial algorithm computing a minimum plain-text representation of k-mer sets, as well as an efficient near-minimum greedy heuristic. When compressing read sets of large model organisms or bacterial pangenomes, with only a minor runtime increase, we shrink the representation...

Full description

Bibliographic Details
Main Authors: Sebastian Schmidt, Shahbaz Khan, Jarno N. Alanko, Giulio E. Pibiri, Alexandru I. Tomescu
Format: Article
Language:English
Published: BMC 2023-06-01
Series:Genome Biology
Subjects:
Online Access:https://doi.org/10.1186/s13059-023-02968-z