Functional and informatics analysis enables glycosyltransferase activity prediction

The elucidation and prediction of how changes in a protein result in altered activities and selectivities remain a major challenge in chemistry. Two hurdles have prevented accurate family-wide models: obtaining (i) diverse datasets and (ii) suitable parameter frameworks that encapsulate activities i...

Полное описание

Библиографические подробности
Главные авторы: Yang, M, Fehl, C, Lees, KV, Lim, E-K, Offen, WA, Davies, GJ, Bowles, DJ, Davidson, MG, Roberts, SJ, Davis, BG
Формат: Journal article
Язык:English
Опубликовано: Nature Publishing Group 2018
_version_ 1826285091449470976
author Yang, M
Fehl, C
Lees, KV
Lim, E-K
Offen, WA
Davies, GJ
Bowles, DJ
Davidson, MG
Roberts, SJ
Davis, BG
author_facet Yang, M
Fehl, C
Lees, KV
Lim, E-K
Offen, WA
Davies, GJ
Bowles, DJ
Davidson, MG
Roberts, SJ
Davis, BG
author_sort Yang, M
collection OXFORD
description The elucidation and prediction of how changes in a protein result in altered activities and selectivities remain a major challenge in chemistry. Two hurdles have prevented accurate family-wide models: obtaining (i) diverse datasets and (ii) suitable parameter frameworks that encapsulate activities in large sets. Here, we show that a relatively small but broad activity dataset is sufficient to train algorithms for functional prediction over the entire glycosyltransferase superfamily 1 (GT1) of the plant Arabidopsis thaliana. Whereas sequence analysis alone failed for GT1 substrate utilization patterns, our chemical–bioinformatic model, GT-Predict, succeeded by coupling physicochemical features with isozyme-recognition patterns over the family. GT-Predict identified GT1 biocatalysts for novel substrates and enabled functional annotation of uncharacterized GT1s. Finally, analyses of GT-Predict decision pathways revealed structural modulators of substrate recognition, thus providing information on mechanisms. This multifaceted approach to enzyme prediction may guide the streamlined utilization (and design) of biocatalysts and the discovery of other family-wide protein functions.
first_indexed 2024-03-07T01:23:40Z
format Journal article
id oxford-uuid:913410fd-9cc3-4ebd-b6d3-0a67d181a122
institution University of Oxford
language English
last_indexed 2024-03-07T01:23:40Z
publishDate 2018
publisher Nature Publishing Group
record_format dspace
spelling oxford-uuid:913410fd-9cc3-4ebd-b6d3-0a67d181a1222022-03-26T23:17:13ZFunctional and informatics analysis enables glycosyltransferase activity predictionJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:913410fd-9cc3-4ebd-b6d3-0a67d181a122EnglishSymplectic Elements at OxfordNature Publishing Group2018Yang, MFehl, CLees, KVLim, E-KOffen, WADavies, GJBowles, DJDavidson, MGRoberts, SJDavis, BGThe elucidation and prediction of how changes in a protein result in altered activities and selectivities remain a major challenge in chemistry. Two hurdles have prevented accurate family-wide models: obtaining (i) diverse datasets and (ii) suitable parameter frameworks that encapsulate activities in large sets. Here, we show that a relatively small but broad activity dataset is sufficient to train algorithms for functional prediction over the entire glycosyltransferase superfamily 1 (GT1) of the plant Arabidopsis thaliana. Whereas sequence analysis alone failed for GT1 substrate utilization patterns, our chemical–bioinformatic model, GT-Predict, succeeded by coupling physicochemical features with isozyme-recognition patterns over the family. GT-Predict identified GT1 biocatalysts for novel substrates and enabled functional annotation of uncharacterized GT1s. Finally, analyses of GT-Predict decision pathways revealed structural modulators of substrate recognition, thus providing information on mechanisms. This multifaceted approach to enzyme prediction may guide the streamlined utilization (and design) of biocatalysts and the discovery of other family-wide protein functions.
spellingShingle Yang, M
Fehl, C
Lees, KV
Lim, E-K
Offen, WA
Davies, GJ
Bowles, DJ
Davidson, MG
Roberts, SJ
Davis, BG
Functional and informatics analysis enables glycosyltransferase activity prediction
title Functional and informatics analysis enables glycosyltransferase activity prediction
title_full Functional and informatics analysis enables glycosyltransferase activity prediction
title_fullStr Functional and informatics analysis enables glycosyltransferase activity prediction
title_full_unstemmed Functional and informatics analysis enables glycosyltransferase activity prediction
title_short Functional and informatics analysis enables glycosyltransferase activity prediction
title_sort functional and informatics analysis enables glycosyltransferase activity prediction
work_keys_str_mv AT yangm functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction
AT fehlc functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction
AT leeskv functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction
AT limek functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction
AT offenwa functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction
AT daviesgj functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction
AT bowlesdj functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction
AT davidsonmg functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction
AT robertssj functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction
AT davisbg functionalandinformaticsanalysisenablesglycosyltransferaseactivityprediction