Nearest neighbor search on embeddings rapidly identifies distant protein relations
Since 1992, all state-of-the-art methods for fast and sensitive identification of evolutionary, structural, and functional relations between proteins (also referred to as “homology detection”) use sequences and sequence-profiles (PSSMs). Protein Language Models (pLMs) generalize sequences, possibly...
Main Authors: | Konstantin Schütze, Michael Heinzinger, Martin Steinegger, Burkhard Rost |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2022-11-01
|
Series: | Frontiers in Bioinformatics |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fbinf.2022.1033775/full |
Similar Items
-
Non-zero probability of nearest neighbor searching
by: A. Mesrikhani, et al.
Published: (2017-03-01) -
Approximate Nearest Neighbor Search by Residual Vector Quantization
by: Cheng Wang, et al.
Published: (2010-12-01) -
Approximate Nearest Neighbor Search Using Enhanced Accumulative Quantization
by: Liefu Ai, et al.
Published: (2022-07-01) -
Improving Natural Language Person Description Search from Videos with Language Model Fine-Tuning and Approximate Nearest Neighbor
by: Sumeth Yuenyong, et al.
Published: (2022-11-01) -
Secure Cloud-Aided Approximate Nearest Neighbor Search on High-Dimensional Data
by: Jia Liu, et al.
Published: (2023-01-01)