Nearest neighbor search on embeddings rapidly identifies distant protein relations

Since 1992, all state-of-the-art methods for fast and sensitive identification of evolutionary, structural, and functional relations between proteins (also referred to as “homology detection”) use sequences and sequence-profiles (PSSMs). Protein Language Models (pLMs) generalize sequences, possibly...

Full description

Bibliographic Details
Main Authors: Konstantin Schütze, Michael Heinzinger, Martin Steinegger, Burkhard Rost
Format: Article
Language:English
Published: Frontiers Media S.A. 2022-11-01
Series:Frontiers in Bioinformatics
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fbinf.2022.1033775/full

Similar Items