A critical analysis of computational protein design with sparse residue interaction graphs.

Protein design algorithms enumerate a combinatorial number of candidate structures to compute the Global Minimum Energy Conformation (GMEC). To efficiently find the GMEC, protein design algorithms must methodically reduce the conformational search space. By applying distance and energy cutoffs, the...

Full description

Bibliographic Details
Main Authors: Swati Jain, Jonathan D Jou, Ivelin S Georgiev, Bruce R Donald
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-03-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC5391103?pdf=render
_version_ 1818193734351716352
author Swati Jain
Jonathan D Jou
Ivelin S Georgiev
Bruce R Donald
author_facet Swati Jain
Jonathan D Jou
Ivelin S Georgiev
Bruce R Donald
author_sort Swati Jain
collection DOAJ
description Protein design algorithms enumerate a combinatorial number of candidate structures to compute the Global Minimum Energy Conformation (GMEC). To efficiently find the GMEC, protein design algorithms must methodically reduce the conformational search space. By applying distance and energy cutoffs, the protein system to be designed can thus be represented using a sparse residue interaction graph, where the number of interacting residue pairs is less than all pairs of mutable residues, and the corresponding GMEC is called the sparse GMEC. However, ignoring some pairwise residue interactions can lead to a change in the energy, conformation, or sequence of the sparse GMEC vs. the original or the full GMEC. Despite the widespread use of sparse residue interaction graphs in protein design, the above mentioned effects of their use have not been previously analyzed. To analyze the costs and benefits of designing with sparse residue interaction graphs, we computed the GMECs for 136 different protein design problems both with and without distance and energy cutoffs, and compared their energies, conformations, and sequences. Our analysis shows that the differences between the GMECs depend critically on whether or not the design includes core, boundary, or surface residues. Moreover, neglecting long-range interactions can alter local interactions and introduce large sequence differences, both of which can result in significant structural and functional changes. Designs on proteins with experimentally measured thermostability show it is beneficial to compute both the full and the sparse GMEC accurately and efficiently. To this end, we show that a provable, ensemble-based algorithm can efficiently compute both GMECs by enumerating a small number of conformations, usually fewer than 1000. This provides a novel way to combine sparse residue interaction graphs with provable, ensemble-based algorithms to reap the benefits of sparse residue interaction graphs while avoiding their potential inaccuracies.
first_indexed 2024-12-12T00:51:06Z
format Article
id doaj.art-3908321b543544069fe12083525a2d74
institution Directory Open Access Journal
issn 1553-734X
1553-7358
language English
last_indexed 2024-12-12T00:51:06Z
publishDate 2017-03-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj.art-3908321b543544069fe12083525a2d742022-12-22T00:44:01ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582017-03-01133e100534610.1371/journal.pcbi.1005346A critical analysis of computational protein design with sparse residue interaction graphs.Swati JainJonathan D JouIvelin S GeorgievBruce R DonaldProtein design algorithms enumerate a combinatorial number of candidate structures to compute the Global Minimum Energy Conformation (GMEC). To efficiently find the GMEC, protein design algorithms must methodically reduce the conformational search space. By applying distance and energy cutoffs, the protein system to be designed can thus be represented using a sparse residue interaction graph, where the number of interacting residue pairs is less than all pairs of mutable residues, and the corresponding GMEC is called the sparse GMEC. However, ignoring some pairwise residue interactions can lead to a change in the energy, conformation, or sequence of the sparse GMEC vs. the original or the full GMEC. Despite the widespread use of sparse residue interaction graphs in protein design, the above mentioned effects of their use have not been previously analyzed. To analyze the costs and benefits of designing with sparse residue interaction graphs, we computed the GMECs for 136 different protein design problems both with and without distance and energy cutoffs, and compared their energies, conformations, and sequences. Our analysis shows that the differences between the GMECs depend critically on whether or not the design includes core, boundary, or surface residues. Moreover, neglecting long-range interactions can alter local interactions and introduce large sequence differences, both of which can result in significant structural and functional changes. Designs on proteins with experimentally measured thermostability show it is beneficial to compute both the full and the sparse GMEC accurately and efficiently. To this end, we show that a provable, ensemble-based algorithm can efficiently compute both GMECs by enumerating a small number of conformations, usually fewer than 1000. This provides a novel way to combine sparse residue interaction graphs with provable, ensemble-based algorithms to reap the benefits of sparse residue interaction graphs while avoiding their potential inaccuracies.http://europepmc.org/articles/PMC5391103?pdf=render
spellingShingle Swati Jain
Jonathan D Jou
Ivelin S Georgiev
Bruce R Donald
A critical analysis of computational protein design with sparse residue interaction graphs.
PLoS Computational Biology
title A critical analysis of computational protein design with sparse residue interaction graphs.
title_full A critical analysis of computational protein design with sparse residue interaction graphs.
title_fullStr A critical analysis of computational protein design with sparse residue interaction graphs.
title_full_unstemmed A critical analysis of computational protein design with sparse residue interaction graphs.
title_short A critical analysis of computational protein design with sparse residue interaction graphs.
title_sort critical analysis of computational protein design with sparse residue interaction graphs
url http://europepmc.org/articles/PMC5391103?pdf=render
work_keys_str_mv AT swatijain acriticalanalysisofcomputationalproteindesignwithsparseresidueinteractiongraphs
AT jonathandjou acriticalanalysisofcomputationalproteindesignwithsparseresidueinteractiongraphs
AT ivelinsgeorgiev acriticalanalysisofcomputationalproteindesignwithsparseresidueinteractiongraphs
AT brucerdonald acriticalanalysisofcomputationalproteindesignwithsparseresidueinteractiongraphs
AT swatijain criticalanalysisofcomputationalproteindesignwithsparseresidueinteractiongraphs
AT jonathandjou criticalanalysisofcomputationalproteindesignwithsparseresidueinteractiongraphs
AT ivelinsgeorgiev criticalanalysisofcomputationalproteindesignwithsparseresidueinteractiongraphs
AT brucerdonald criticalanalysisofcomputationalproteindesignwithsparseresidueinteractiongraphs