Nearest neighbor search on embeddings rapidly identifies distant protein relations
Since 1992, all state-of-the-art methods for fast and sensitive identification of evolutionary, structural, and functional relations between proteins (also referred to as “homology detection”) use sequences and sequence-profiles (PSSMs). Protein Language Models (pLMs) generalize sequences, possibly...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2022-11-01
|
Series: | Frontiers in Bioinformatics |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fbinf.2022.1033775/full |