Survey of Protein Sequence Embedding Models

Derived from the natural language processing (NLP) algorithms, protein language models enable the encoding of protein sequences, which are widely diverse in length and amino acid composition, in fixed-size numerical vectors (embeddings). We surveyed representative embedding models such as Esm, Esm1b...

Full description

Bibliographic Details
Main Authors: Chau Tran, Siddharth Khadkikar, Aleksey Porollo
Format: Article
Language:English
Published: MDPI AG 2023-02-01
Series:International Journal of Molecular Sciences
Subjects:
Online Access:https://www.mdpi.com/1422-0067/24/4/3775