Exploring the composition of protein-ligand binding sites on a large scale.

The residue composition of a ligand binding site determines the interactions available for diffusion-mediated ligand binding, and understanding general composition of these sites is of great importance if we are to gain insight into the functional diversity of the proteome. Many structure-based drug...

Full description

Bibliographic Details
Main Authors: Nickolay A Khazanov, Heather A Carlson
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC3836696?pdf=render
_version_ 1818324899768303616
author Nickolay A Khazanov
Heather A Carlson
author_facet Nickolay A Khazanov
Heather A Carlson
author_sort Nickolay A Khazanov
collection DOAJ
description The residue composition of a ligand binding site determines the interactions available for diffusion-mediated ligand binding, and understanding general composition of these sites is of great importance if we are to gain insight into the functional diversity of the proteome. Many structure-based drug design methods utilize such heuristic information for improving prediction or characterization of ligand-binding sites in proteins of unknown function. The Binding MOAD database if one of the largest curated sets of protein-ligand complexes, and provides a source of diverse, high-quality data for establishing general trends of residue composition from currently available protein structures. We present an analysis of 3,295 non-redundant proteins with 9,114 non-redundant binding sites to identify residues over-represented in binding regions versus the rest of the protein surface. The Binding MOAD database delineates biologically-relevant "valid" ligands from "invalid" small-molecule ligands bound to the protein. Invalids are present in the crystallization medium and serve no known biological function. Contacts are found to differ between these classes of ligands, indicating that residue composition of biologically relevant binding sites is distinct not only from the rest of the protein surface, but also from surface regions capable of opportunistic binding of non-functional small molecules. To confirm these trends, we perform a rigorous analysis of the variation of residue propensity with respect to the size of the dataset and the content bias inherent in structure sets obtained from a large protein structure database. The optimal size of the dataset for establishing general trends of residue propensities, as well as strategies for assessing the significance of such trends, are suggested for future studies of binding-site composition.
first_indexed 2024-12-13T11:35:55Z
format Article
id doaj.art-07b0e3e13af44172a99f00282a9c0c63
institution Directory Open Access Journal
issn 1553-734X
1553-7358
language English
last_indexed 2024-12-13T11:35:55Z
publishDate 2013-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj.art-07b0e3e13af44172a99f00282a9c0c632022-12-21T23:47:47ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582013-01-01911e100332110.1371/journal.pcbi.1003321Exploring the composition of protein-ligand binding sites on a large scale.Nickolay A KhazanovHeather A CarlsonThe residue composition of a ligand binding site determines the interactions available for diffusion-mediated ligand binding, and understanding general composition of these sites is of great importance if we are to gain insight into the functional diversity of the proteome. Many structure-based drug design methods utilize such heuristic information for improving prediction or characterization of ligand-binding sites in proteins of unknown function. The Binding MOAD database if one of the largest curated sets of protein-ligand complexes, and provides a source of diverse, high-quality data for establishing general trends of residue composition from currently available protein structures. We present an analysis of 3,295 non-redundant proteins with 9,114 non-redundant binding sites to identify residues over-represented in binding regions versus the rest of the protein surface. The Binding MOAD database delineates biologically-relevant "valid" ligands from "invalid" small-molecule ligands bound to the protein. Invalids are present in the crystallization medium and serve no known biological function. Contacts are found to differ between these classes of ligands, indicating that residue composition of biologically relevant binding sites is distinct not only from the rest of the protein surface, but also from surface regions capable of opportunistic binding of non-functional small molecules. To confirm these trends, we perform a rigorous analysis of the variation of residue propensity with respect to the size of the dataset and the content bias inherent in structure sets obtained from a large protein structure database. The optimal size of the dataset for establishing general trends of residue propensities, as well as strategies for assessing the significance of such trends, are suggested for future studies of binding-site composition.http://europepmc.org/articles/PMC3836696?pdf=render
spellingShingle Nickolay A Khazanov
Heather A Carlson
Exploring the composition of protein-ligand binding sites on a large scale.
PLoS Computational Biology
title Exploring the composition of protein-ligand binding sites on a large scale.
title_full Exploring the composition of protein-ligand binding sites on a large scale.
title_fullStr Exploring the composition of protein-ligand binding sites on a large scale.
title_full_unstemmed Exploring the composition of protein-ligand binding sites on a large scale.
title_short Exploring the composition of protein-ligand binding sites on a large scale.
title_sort exploring the composition of protein ligand binding sites on a large scale
url http://europepmc.org/articles/PMC3836696?pdf=render
work_keys_str_mv AT nickolayakhazanov exploringthecompositionofproteinligandbindingsitesonalargescale
AT heatheracarlson exploringthecompositionofproteinligandbindingsitesonalargescale