Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease

The Protein Data Bank (PDB) contains over 71,000 structures. Extensively studied proteins have hundreds of submissions available, including mutations, different complexes, and space groups, allowing for application of data-mining algorithms to analyze an array of static structures and gain insight a...

Full description

Bibliographic Details
Main Authors: Robert McKenna, Mavis Agbandje-McKenna, Miorel-Lucian Palii, Balasubramanian Venkatakrishnan
Format: Article
Language:English
Published: MDPI AG 2012-03-01
Series:Viruses
Subjects:
Online Access:http://www.mdpi.com/1999-4915/4/3/348/
_version_ 1811201906675023872
author Robert McKenna
Mavis Agbandje-McKenna
Miorel-Lucian Palii
Balasubramanian Venkatakrishnan
author_facet Robert McKenna
Mavis Agbandje-McKenna
Miorel-Lucian Palii
Balasubramanian Venkatakrishnan
author_sort Robert McKenna
collection DOAJ
description The Protein Data Bank (PDB) contains over 71,000 structures. Extensively studied proteins have hundreds of submissions available, including mutations, different complexes, and space groups, allowing for application of data-mining algorithms to analyze an array of static structures and gain insight about a protein’s structural variation and possibly its dynamics. This investigation is a case study of HIV protease (PR) using in-house algorithms for data mining and structure superposition through generalized formulæ that account for multiple conformations and fractional occupancies. Temperature factors (B-factors) are compared with spatial displacement from the mean structure over the entire study set and separately over bound and ligand-free structures, to assess the significance of structural deviation in a statistical context. Space group differences are also examined.
first_indexed 2024-04-12T02:29:48Z
format Article
id doaj.art-4863db47620246a9acda1cf45aa4fbba
institution Directory Open Access Journal
issn 1999-4915
language English
last_indexed 2024-04-12T02:29:48Z
publishDate 2012-03-01
publisher MDPI AG
record_format Article
series Viruses
spelling doaj.art-4863db47620246a9acda1cf45aa4fbba2022-12-22T03:51:50ZengMDPI AGViruses1999-49152012-03-014334836210.3390/v4030348Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV ProteaseRobert McKennaMavis Agbandje-McKennaMiorel-Lucian PaliiBalasubramanian VenkatakrishnanThe Protein Data Bank (PDB) contains over 71,000 structures. Extensively studied proteins have hundreds of submissions available, including mutations, different complexes, and space groups, allowing for application of data-mining algorithms to analyze an array of static structures and gain insight about a protein’s structural variation and possibly its dynamics. This investigation is a case study of HIV protease (PR) using in-house algorithms for data mining and structure superposition through generalized formulæ that account for multiple conformations and fractional occupancies. Temperature factors (B-factors) are compared with spatial displacement from the mean structure over the entire study set and separately over bound and ligand-free structures, to assess the significance of structural deviation in a statistical context. Space group differences are also examined.http://www.mdpi.com/1999-4915/4/3/348/B-factor and spatial variationdata miningHIV proteasestructure superposition
spellingShingle Robert McKenna
Mavis Agbandje-McKenna
Miorel-Lucian Palii
Balasubramanian Venkatakrishnan
Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease
Viruses
B-factor and spatial variation
data mining
HIV protease
structure superposition
title Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease
title_full Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease
title_fullStr Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease
title_full_unstemmed Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease
title_short Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease
title_sort mining the protein data bank to differentiate error from structural variation in clustered static structures an examination of hiv protease
topic B-factor and spatial variation
data mining
HIV protease
structure superposition
url http://www.mdpi.com/1999-4915/4/3/348/
work_keys_str_mv AT robertmckenna miningtheproteindatabanktodifferentiateerrorfromstructuralvariationinclusteredstaticstructuresanexaminationofhivprotease
AT mavisagbandjemckenna miningtheproteindatabanktodifferentiateerrorfromstructuralvariationinclusteredstaticstructuresanexaminationofhivprotease
AT miorellucianpalii miningtheproteindatabanktodifferentiateerrorfromstructuralvariationinclusteredstaticstructuresanexaminationofhivprotease
AT balasubramanianvenkatakrishnan miningtheproteindatabanktodifferentiateerrorfromstructuralvariationinclusteredstaticstructuresanexaminationofhivprotease