Protein Condensate Atlas from predictive models of heteromolecular condensate composition

Biomolecular condensates help cells organise their content in space and time. Cells harbour a variety of condensate types with diverse composition and many are likely yet to be discovered. Here, we develop a methodology to predict the composition of biomolecular condensates. We first analyse availab...

Full description

Bibliographic Details
Main Authors: Saar, KL, Scrutton, RM, Bloznelyte, K, Morgunov, AS, Good, LL, Lee, AA, Teichmann, SA, Knowles, TPJ
Format: Journal article
Language:English
Published: Nature Research 2024
_version_ 1811140061290299392
author Saar, KL
Scrutton, RM
Bloznelyte, K
Morgunov, AS
Good, LL
Lee, AA
Teichmann, SA
Knowles, TPJ
author_facet Saar, KL
Scrutton, RM
Bloznelyte, K
Morgunov, AS
Good, LL
Lee, AA
Teichmann, SA
Knowles, TPJ
author_sort Saar, KL
collection OXFORD
description Biomolecular condensates help cells organise their content in space and time. Cells harbour a variety of condensate types with diverse composition and many are likely yet to be discovered. Here, we develop a methodology to predict the composition of biomolecular condensates. We first analyse available proteomics data of cellular condensates and find that the biophysical features that determine protein localisation into condensates differ from known drivers of homotypic phase separation processes, with charge mediated protein-RNA and hydrophobicity mediated protein-protein interactions playing a key role in the former process. We then develop a machine learning model that links protein sequence to its propensity to localise into heteromolecular condensates. We apply the model across the proteome and find many of the top-ranked targets outside the original training data to localise into condensates as confirmed by orthogonal immunohistochemical staining imaging. Finally, we segment the condensation-prone proteome into condensate types based on an overlap with biomolecular interaction profiles to generate a Protein Condensate Atlas. Several condensate clusters within the Atlas closely match the composition of experimentally characterised condensates or regions within them, suggesting that the Atlas can be valuable for identifying additional components within known condensate systems and discovering previously uncharacterised condensates.
first_indexed 2024-09-25T04:15:59Z
format Journal article
id oxford-uuid:bcd6ef15-a649-4423-bf64-d5a56fb22ca3
institution University of Oxford
language English
last_indexed 2024-09-25T04:15:59Z
publishDate 2024
publisher Nature Research
record_format dspace
spelling oxford-uuid:bcd6ef15-a649-4423-bf64-d5a56fb22ca32024-07-19T20:03:41ZProtein Condensate Atlas from predictive models of heteromolecular condensate compositionJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:bcd6ef15-a649-4423-bf64-d5a56fb22ca3EnglishJisc Publications RouterNature Research2024Saar, KLScrutton, RMBloznelyte, KMorgunov, ASGood, LLLee, AATeichmann, SAKnowles, TPJBiomolecular condensates help cells organise their content in space and time. Cells harbour a variety of condensate types with diverse composition and many are likely yet to be discovered. Here, we develop a methodology to predict the composition of biomolecular condensates. We first analyse available proteomics data of cellular condensates and find that the biophysical features that determine protein localisation into condensates differ from known drivers of homotypic phase separation processes, with charge mediated protein-RNA and hydrophobicity mediated protein-protein interactions playing a key role in the former process. We then develop a machine learning model that links protein sequence to its propensity to localise into heteromolecular condensates. We apply the model across the proteome and find many of the top-ranked targets outside the original training data to localise into condensates as confirmed by orthogonal immunohistochemical staining imaging. Finally, we segment the condensation-prone proteome into condensate types based on an overlap with biomolecular interaction profiles to generate a Protein Condensate Atlas. Several condensate clusters within the Atlas closely match the composition of experimentally characterised condensates or regions within them, suggesting that the Atlas can be valuable for identifying additional components within known condensate systems and discovering previously uncharacterised condensates.
spellingShingle Saar, KL
Scrutton, RM
Bloznelyte, K
Morgunov, AS
Good, LL
Lee, AA
Teichmann, SA
Knowles, TPJ
Protein Condensate Atlas from predictive models of heteromolecular condensate composition
title Protein Condensate Atlas from predictive models of heteromolecular condensate composition
title_full Protein Condensate Atlas from predictive models of heteromolecular condensate composition
title_fullStr Protein Condensate Atlas from predictive models of heteromolecular condensate composition
title_full_unstemmed Protein Condensate Atlas from predictive models of heteromolecular condensate composition
title_short Protein Condensate Atlas from predictive models of heteromolecular condensate composition
title_sort protein condensate atlas from predictive models of heteromolecular condensate composition
work_keys_str_mv AT saarkl proteincondensateatlasfrompredictivemodelsofheteromolecularcondensatecomposition
AT scruttonrm proteincondensateatlasfrompredictivemodelsofheteromolecularcondensatecomposition
AT bloznelytek proteincondensateatlasfrompredictivemodelsofheteromolecularcondensatecomposition
AT morgunovas proteincondensateatlasfrompredictivemodelsofheteromolecularcondensatecomposition
AT goodll proteincondensateatlasfrompredictivemodelsofheteromolecularcondensatecomposition
AT leeaa proteincondensateatlasfrompredictivemodelsofheteromolecularcondensatecomposition
AT teichmannsa proteincondensateatlasfrompredictivemodelsofheteromolecularcondensatecomposition
AT knowlestpj proteincondensateatlasfrompredictivemodelsofheteromolecularcondensatecomposition