SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information.

In this paper, we describe SAFlex (Structural Alphabet Flexibility), an extension of an existing structural alphabet (HMM-SA), to better explore increasing protein three dimensional structure information by encoding conformations of proteins in case of missing residues or uncertainties. An SA aims t...

Full description

Bibliographic Details
Main Authors: Ikram Allam, Delphine Flatters, Géraldine Caumes, Leslie Regad, Vincent Delos, Gregory Nuel, Anne-Claude Camproux
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC6033379?pdf=render
_version_ 1819261751852531712
author Ikram Allam
Delphine Flatters
Géraldine Caumes
Leslie Regad
Vincent Delos
Gregory Nuel
Anne-Claude Camproux
author_facet Ikram Allam
Delphine Flatters
Géraldine Caumes
Leslie Regad
Vincent Delos
Gregory Nuel
Anne-Claude Camproux
author_sort Ikram Allam
collection DOAJ
description In this paper, we describe SAFlex (Structural Alphabet Flexibility), an extension of an existing structural alphabet (HMM-SA), to better explore increasing protein three dimensional structure information by encoding conformations of proteins in case of missing residues or uncertainties. An SA aims to reduce three dimensional conformations of proteins as well as their analysis and comparison complexity by simplifying any conformation in a series of structural letters. Our methodology presents several novelties. Firstly, it can account for the encoding uncertainty by providing a wide range of encoding options: the maximum a posteriori, the marginal posterior distribution, and the effective number of letters at each given position. Secondly, our new algorithm deals with the missing data in the protein structure files (concerning more than 75% of the proteins from the Protein Data Bank) in a rigorous probabilistic framework. Thirdly, SAFlex is able to encode and to build a consensus encoding from different replicates of a single protein such as several homomer chains. This allows localizing structural differences between different chains and detecting structural variability, which is essential for protein flexibility identification. These improvements are illustrated on different proteins, such as the crystal structure of an eukaryotic small heat shock protein. They are promising to explore increasing protein redundancy data and obtain useful quantification of their flexibility.
first_indexed 2024-12-23T19:46:47Z
format Article
id doaj.art-fa4eb1feb90e462489338045fa85ef4d
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-23T19:46:47Z
publishDate 2018-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-fa4eb1feb90e462489338045fa85ef4d2022-12-21T17:33:31ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-01137e019885410.1371/journal.pone.0198854SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information.Ikram AllamDelphine FlattersGéraldine CaumesLeslie RegadVincent DelosGregory NuelAnne-Claude CamprouxIn this paper, we describe SAFlex (Structural Alphabet Flexibility), an extension of an existing structural alphabet (HMM-SA), to better explore increasing protein three dimensional structure information by encoding conformations of proteins in case of missing residues or uncertainties. An SA aims to reduce three dimensional conformations of proteins as well as their analysis and comparison complexity by simplifying any conformation in a series of structural letters. Our methodology presents several novelties. Firstly, it can account for the encoding uncertainty by providing a wide range of encoding options: the maximum a posteriori, the marginal posterior distribution, and the effective number of letters at each given position. Secondly, our new algorithm deals with the missing data in the protein structure files (concerning more than 75% of the proteins from the Protein Data Bank) in a rigorous probabilistic framework. Thirdly, SAFlex is able to encode and to build a consensus encoding from different replicates of a single protein such as several homomer chains. This allows localizing structural differences between different chains and detecting structural variability, which is essential for protein flexibility identification. These improvements are illustrated on different proteins, such as the crystal structure of an eukaryotic small heat shock protein. They are promising to explore increasing protein redundancy data and obtain useful quantification of their flexibility.http://europepmc.org/articles/PMC6033379?pdf=render
spellingShingle Ikram Allam
Delphine Flatters
Géraldine Caumes
Leslie Regad
Vincent Delos
Gregory Nuel
Anne-Claude Camproux
SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information.
PLoS ONE
title SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information.
title_full SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information.
title_fullStr SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information.
title_full_unstemmed SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information.
title_short SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information.
title_sort saflex a structural alphabet extension to integrate protein structural flexibility and missing data information
url http://europepmc.org/articles/PMC6033379?pdf=render
work_keys_str_mv AT ikramallam saflexastructuralalphabetextensiontointegrateproteinstructuralflexibilityandmissingdatainformation
AT delphineflatters saflexastructuralalphabetextensiontointegrateproteinstructuralflexibilityandmissingdatainformation
AT geraldinecaumes saflexastructuralalphabetextensiontointegrateproteinstructuralflexibilityandmissingdatainformation
AT leslieregad saflexastructuralalphabetextensiontointegrateproteinstructuralflexibilityandmissingdatainformation
AT vincentdelos saflexastructuralalphabetextensiontointegrateproteinstructuralflexibilityandmissingdatainformation
AT gregorynuel saflexastructuralalphabetextensiontointegrateproteinstructuralflexibilityandmissingdatainformation
AT anneclaudecamproux saflexastructuralalphabetextensiontointegrateproteinstructuralflexibilityandmissingdatainformation