Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties.

Interactions between proteins and nucleic acids are at the heart of many essential biological processes. Despite increasing structural information about how these interactions may take place, our understanding of the usage made of protein surfaces by nucleic acids is still very limited. This is in p...

Full description

Bibliographic Details
Main Authors: Flavia Corsi, Richard Lavery, Elodie Laine, Alessandra Carbone
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-02-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1007624
_version_ 1818716778024402944
author Flavia Corsi
Richard Lavery
Elodie Laine
Alessandra Carbone
author_facet Flavia Corsi
Richard Lavery
Elodie Laine
Alessandra Carbone
author_sort Flavia Corsi
collection DOAJ
description Interactions between proteins and nucleic acids are at the heart of many essential biological processes. Despite increasing structural information about how these interactions may take place, our understanding of the usage made of protein surfaces by nucleic acids is still very limited. This is in part due to the inherent complexity associated to protein surface deformability and evolution. In this work, we present a method that contributes to decipher such complexity by predicting protein-DNA interfaces and characterizing their properties. It relies on three biologically and physically meaningful descriptors, namely evolutionary conservation, physico-chemical properties and surface geometry. We carefully assessed its performance on several hundreds of protein structures and compared it to several machine-learning state-of-the-art methods. Our approach achieves a higher sensitivity compared to the other methods, with a similar precision. Importantly, we show that it is able to unravel 'hidden' binding sites by applying it to unbound protein structures and to proteins binding to DNA via multiple sites and in different conformations. It is also applicable to the detection of RNA-binding sites, without significant loss of performance. This confirms that DNA and RNA-binding sites share similar properties. Our method is implemented as a fully automated tool, [Formula: see text], freely accessible at: http://www.lcqb.upmc.fr/JET2DNA. We also provide a new dataset of 187 protein-DNA complex structures, along with a subset of 82 associated unbound structures. The set represents the largest body of high-resolution crystallographic structures of protein-DNA complexes, use biological protein assemblies as DNA-binding units, and covers all major types of protein-DNA interactions. It is available at: http://www.lcqb.upmc.fr/PDNAbenchmarks.
first_indexed 2024-12-17T19:24:40Z
format Article
id doaj.art-8b5993df380d447e96230e53eead5306
institution Directory Open Access Journal
issn 1553-734X
1553-7358
language English
last_indexed 2024-12-17T19:24:40Z
publishDate 2020-02-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj.art-8b5993df380d447e96230e53eead53062022-12-21T21:35:25ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582020-02-01162e100762410.1371/journal.pcbi.1007624Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties.Flavia CorsiRichard LaveryElodie LaineAlessandra CarboneInteractions between proteins and nucleic acids are at the heart of many essential biological processes. Despite increasing structural information about how these interactions may take place, our understanding of the usage made of protein surfaces by nucleic acids is still very limited. This is in part due to the inherent complexity associated to protein surface deformability and evolution. In this work, we present a method that contributes to decipher such complexity by predicting protein-DNA interfaces and characterizing their properties. It relies on three biologically and physically meaningful descriptors, namely evolutionary conservation, physico-chemical properties and surface geometry. We carefully assessed its performance on several hundreds of protein structures and compared it to several machine-learning state-of-the-art methods. Our approach achieves a higher sensitivity compared to the other methods, with a similar precision. Importantly, we show that it is able to unravel 'hidden' binding sites by applying it to unbound protein structures and to proteins binding to DNA via multiple sites and in different conformations. It is also applicable to the detection of RNA-binding sites, without significant loss of performance. This confirms that DNA and RNA-binding sites share similar properties. Our method is implemented as a fully automated tool, [Formula: see text], freely accessible at: http://www.lcqb.upmc.fr/JET2DNA. We also provide a new dataset of 187 protein-DNA complex structures, along with a subset of 82 associated unbound structures. The set represents the largest body of high-resolution crystallographic structures of protein-DNA complexes, use biological protein assemblies as DNA-binding units, and covers all major types of protein-DNA interactions. It is available at: http://www.lcqb.upmc.fr/PDNAbenchmarks.https://doi.org/10.1371/journal.pcbi.1007624
spellingShingle Flavia Corsi
Richard Lavery
Elodie Laine
Alessandra Carbone
Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties.
PLoS Computational Biology
title Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties.
title_full Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties.
title_fullStr Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties.
title_full_unstemmed Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties.
title_short Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties.
title_sort multiple protein dna interfaces unravelled by evolutionary information physico chemical and geometrical properties
url https://doi.org/10.1371/journal.pcbi.1007624
work_keys_str_mv AT flaviacorsi multipleproteindnainterfacesunravelledbyevolutionaryinformationphysicochemicalandgeometricalproperties
AT richardlavery multipleproteindnainterfacesunravelledbyevolutionaryinformationphysicochemicalandgeometricalproperties
AT elodielaine multipleproteindnainterfacesunravelledbyevolutionaryinformationphysicochemicalandgeometricalproperties
AT alessandracarbone multipleproteindnainterfacesunravelledbyevolutionaryinformationphysicochemicalandgeometricalproperties