Functional and informatics analysis enables glycosyltransferase activity prediction
The elucidation and prediction of how changes in a protein result in altered activities and selectivities remain a major challenge in chemistry. Two hurdles have prevented accurate family-wide models: obtaining (i) diverse datasets and (ii) suitable parameter frameworks that encapsulate activities i...
Главные авторы: | , , , , , , , , , |
---|---|
Формат: | Journal article |
Язык: | English |
Опубликовано: |
Nature Publishing Group
2018
|
_version_ | 1826285091449470976 |
---|---|
author | Yang, M Fehl, C Lees, KV Lim, E-K Offen, WA Davies, GJ Bowles, DJ Davidson, MG Roberts, SJ Davis, BG |
author_facet | Yang, M Fehl, C Lees, KV Lim, E-K Offen, WA Davies, GJ Bowles, DJ Davidson, MG Roberts, SJ Davis, BG |
author_sort | Yang, M |
collection | OXFORD |
description | The elucidation and prediction of how changes in a protein result in altered activities and selectivities remain a major challenge in chemistry. Two hurdles have prevented accurate family-wide models: obtaining (i) diverse datasets and (ii) suitable parameter frameworks that encapsulate activities in large sets. Here, we show that a relatively small but broad activity dataset is sufficient to train algorithms for functional prediction over the entire glycosyltransferase superfamily 1 (GT1) of the plant Arabidopsis thaliana. Whereas sequence analysis alone failed for GT1 substrate utilization patterns, our chemical–bioinformatic model, GT-Predict, succeeded by coupling physicochemical features with isozyme-recognition patterns over the family. GT-Predict identified GT1 biocatalysts for novel substrates and enabled functional annotation of uncharacterized GT1s. Finally, analyses of GT-Predict decision pathways revealed structural modulators of substrate recognition, thus providing information on mechanisms. This multifaceted approach to enzyme prediction may guide the streamlined utilization (and design) of biocatalysts and the discovery of other family-wide protein functions. |
first_indexed | 2024-03-07T01:23:40Z |
format | Journal article |
id | oxford-uuid:913410fd-9cc3-4ebd-b6d3-0a67d181a122 |
institution | University of Oxford |
language | English |
last_indexed | 2024-03-07T01:23:40Z |
publishDate | 2018 |
publisher | Nature Publishing Group |
record_format | dspace |
spelling | oxford-uuid:913410fd-9cc3-4ebd-b6d3-0a67d181a1222022-03-26T23:17:13ZFunctional and informatics analysis enables glycosyltransferase activity predictionJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:913410fd-9cc3-4ebd-b6d3-0a67d181a122EnglishSymplectic Elements at OxfordNature Publishing Group2018Yang, MFehl, CLees, KVLim, E-KOffen, WADavies, GJBowles, DJDavidson, MGRoberts, SJDavis, BGThe elucidation and prediction of how changes in a protein result in altered activities and selectivities remain a major challenge in chemistry. Two hurdles have prevented accurate family-wide models: obtaining (i) diverse datasets and (ii) suitable parameter frameworks that encapsulate activities in large sets. Here, we show that a relatively small but broad activity dataset is sufficient to train algorithms for functional prediction over the entire glycosyltransferase superfamily 1 (GT1) of the plant Arabidopsis thaliana. Whereas sequence analysis alone failed for GT1 substrate utilization patterns, our chemical–bioinformatic model, GT-Predict, succeeded by coupling physicochemical features with isozyme-recognition patterns over the family. GT-Predict identified GT1 biocatalysts for novel substrates and enabled functional annotation of uncharacterized GT1s. Finally, analyses of GT-Predict decision pathways revealed structural modulators of substrate recognition, thus providing information on mechanisms. This multifaceted approach to enzyme prediction may guide the streamlined utilization (and design) of biocatalysts and the discovery of other family-wide protein functions. |
spellingShingle | Yang, M Fehl, C Lees, KV Lim, E-K Offen, WA Davies, GJ Bowles, DJ Davidson, MG Roberts, SJ Davis, BG Functional and informatics analysis enables glycosyltransferase activity prediction |
title | Functional and informatics analysis enables glycosyltransferase activity prediction |
title_full | Functional and informatics analysis enables glycosyltransferase activity prediction |
title_fullStr | Functional and informatics analysis enables glycosyltransferase activity prediction |
title_full_unstemmed | Functional and informatics analysis enables glycosyltransferase activity prediction |
title_short | Functional and informatics analysis enables glycosyltransferase activity prediction |
title_sort | functional and informatics analysis enables glycosyltransferase activity prediction |
work_keys_str_mv | AT yangm functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction AT fehlc functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction AT leeskv functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction AT limek functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction AT offenwa functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction AT daviesgj functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction AT bowlesdj functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction AT davidsonmg functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction AT robertssj functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction AT davisbg functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction |