Improvements in viral gene annotation using large language models and soft alignments

Abstract Background The annotation of protein sequences in public databases has long posed a challenge in molecular biology. This issue is particularly acute for viral proteins, which demonstrate limited homology to known proteins when using alignment, k-mer, or profile-based homology search approac...

Descripció completa

Dades bibliogràfiques
Autors principals: William L. Harrigan, Barbra D. Ferrell, K. Eric Wommack, Shawn W. Polson, Zachary D. Schreiber, Mahdi Belcaid
Format: Article
Idioma:English
Publicat: BMC 2024-04-01
Col·lecció:BMC Bioinformatics
Matèries:
Accés en línia:https://doi.org/10.1186/s12859-024-05779-6