Latent generative landscapes as maps of functional diversity in protein sequence space

Abstract Variational autoencoders are unsupervised learning models with generative capabilities, when applied to protein data, they classify sequences by phylogeny and generate de novo sequences which preserve statistical properties of protein composition. While previous studies focus on clustering...

Full description

Bibliographic Details
Main Authors: Cheyenne Ziegler, Jonathan Martin, Claude Sinner, Faruck Morcos
Format: Article
Language:English
Published: Nature Portfolio 2023-04-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-023-37958-z