Comparing fixed sampling with minimizer sampling when using k-mer indexes to find maximal exact matches.

Bioinformatics applications and pipelines increasingly use k-mer indexes to search for similar sequences. The major problem with k-mer indexes is that they require lots of memory. Sampling is often used to reduce index size and query time. Most applications use one of two major types of sampling: fi...

Full description

Bibliographic Details
Main Authors: Meznah Almutairy, Eric Torng
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5794061?pdf=render