Complet+: a computationally scalable method to improve completeness of large-scale protein sequence clustering

A major challenge for clustering algorithms is to balance the trade-off between homogeneity, i.e., the degree to which an individual cluster includes only related sequences, and completeness, the degree to which related sequences are broken up into multiple clusters. Most algorithms are conservative...

Full description

Bibliographic Details
Main Authors: Rachel Nguyen, Bahrad A. Sokhansanj, Robi Polikar, Gail L. Rosen
Format: Article
Language:English
Published: PeerJ Inc. 2023-02-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/14779.pdf