NeuralBeds: Neural embeddings for efficient DNA data compression and optimized similarity search

The availability of high throughput sequencing tools coupled with the declining costs in the production of DNA sequences has led to the generation of enormous amounts of omics data curated in several databases such as NCBI and EMBL. Identification of similar DNA sequences from these databases is one...

Full description

Bibliographic Details
Main Authors: Oluwafemi A. Sarumi, Maximilian Hahn, Dominik Heider
Format: Article
Language:English
Published: Elsevier 2024-12-01
Series:Computational and Structural Biotechnology Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2001037023005214