Improvements in viral gene annotation using large language models and soft alignments
Abstract Background The annotation of protein sequences in public databases has long posed a challenge in molecular biology. This issue is particularly acute for viral proteins, which demonstrate limited homology to known proteins when using alignment, k-mer, or profile-based homology search approac...
Autors principals: | , , , , , |
---|---|
Format: | Article |
Idioma: | English |
Publicat: |
BMC
2024-04-01
|
Col·lecció: | BMC Bioinformatics |
Matèries: | |
Accés en línia: | https://doi.org/10.1186/s12859-024-05779-6 |