A database of domain definitions for proteins with complex interdomain geometry.

Protein structural domains are necessary for understanding evolution and protein folding, and may vary widely from functional and sequence based domains. Although, various structural domain databases exist, defining domains for some proteins is non-trivial, and definitions of their domain boundaries...

Full description

Bibliographic Details
Main Authors: Indraneel Majumdar, Lisa N Kinch, Nick V Grishin
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2009-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC2662426?pdf=render
_version_ 1818155838061150208
author Indraneel Majumdar
Lisa N Kinch
Nick V Grishin
author_facet Indraneel Majumdar
Lisa N Kinch
Nick V Grishin
author_sort Indraneel Majumdar
collection DOAJ
description Protein structural domains are necessary for understanding evolution and protein folding, and may vary widely from functional and sequence based domains. Although, various structural domain databases exist, defining domains for some proteins is non-trivial, and definitions of their domain boundaries are not available. Here, we present a novel database of manually defined structural domains for a representative set of proteins from the SCOP "multi-domain proteins" class. (http://prodata.swmed.edu/multidom/). We consider our domains as mobile evolutionary units, which may rearrange during protein evolution. Additionally, they may be visualized as structurally compact and possibly independently folding units. We also found that representing domains as evolutionary and folding units do not always lead to a unique domain definition. However, unlike existing databases, we retain and refine these "alternate" domain definitions after careful inspection of structural similarity, functional sites and automated domain definition methods. We provide domain definitions, including actual residue boundaries, for proteins that well known databases like SCOP and CATH do not attempt to split. Our alternate domain definitions are suitable for sequence and structure searches by automated methods. Additionally, the database can be used for training and testing domain delineation algorithms. Since our domains represent structurally compact evolutionary units, the database may be useful for studying domain properties and evolution.
first_indexed 2024-12-11T14:48:45Z
format Article
id doaj.art-948bfdc394a84f6cae595230eae4411c
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-11T14:48:45Z
publishDate 2009-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-948bfdc394a84f6cae595230eae4411c2022-12-22T01:01:35ZengPublic Library of Science (PLoS)PLoS ONE1932-62032009-01-0144e508410.1371/journal.pone.0005084A database of domain definitions for proteins with complex interdomain geometry.Indraneel MajumdarLisa N KinchNick V GrishinProtein structural domains are necessary for understanding evolution and protein folding, and may vary widely from functional and sequence based domains. Although, various structural domain databases exist, defining domains for some proteins is non-trivial, and definitions of their domain boundaries are not available. Here, we present a novel database of manually defined structural domains for a representative set of proteins from the SCOP "multi-domain proteins" class. (http://prodata.swmed.edu/multidom/). We consider our domains as mobile evolutionary units, which may rearrange during protein evolution. Additionally, they may be visualized as structurally compact and possibly independently folding units. We also found that representing domains as evolutionary and folding units do not always lead to a unique domain definition. However, unlike existing databases, we retain and refine these "alternate" domain definitions after careful inspection of structural similarity, functional sites and automated domain definition methods. We provide domain definitions, including actual residue boundaries, for proteins that well known databases like SCOP and CATH do not attempt to split. Our alternate domain definitions are suitable for sequence and structure searches by automated methods. Additionally, the database can be used for training and testing domain delineation algorithms. Since our domains represent structurally compact evolutionary units, the database may be useful for studying domain properties and evolution.http://europepmc.org/articles/PMC2662426?pdf=render
spellingShingle Indraneel Majumdar
Lisa N Kinch
Nick V Grishin
A database of domain definitions for proteins with complex interdomain geometry.
PLoS ONE
title A database of domain definitions for proteins with complex interdomain geometry.
title_full A database of domain definitions for proteins with complex interdomain geometry.
title_fullStr A database of domain definitions for proteins with complex interdomain geometry.
title_full_unstemmed A database of domain definitions for proteins with complex interdomain geometry.
title_short A database of domain definitions for proteins with complex interdomain geometry.
title_sort database of domain definitions for proteins with complex interdomain geometry
url http://europepmc.org/articles/PMC2662426?pdf=render
work_keys_str_mv AT indraneelmajumdar adatabaseofdomaindefinitionsforproteinswithcomplexinterdomaingeometry
AT lisankinch adatabaseofdomaindefinitionsforproteinswithcomplexinterdomaingeometry
AT nickvgrishin adatabaseofdomaindefinitionsforproteinswithcomplexinterdomaingeometry
AT indraneelmajumdar databaseofdomaindefinitionsforproteinswithcomplexinterdomaingeometry
AT lisankinch databaseofdomaindefinitionsforproteinswithcomplexinterdomaingeometry
AT nickvgrishin databaseofdomaindefinitionsforproteinswithcomplexinterdomaingeometry