The evolution of contact prediction: Evidence that contact selection in statistical contact prediction is changing

Motivation Over the last few years, the field of protein structure prediction has been transformed by increasingly accurate contact prediction software. These methods are based on the detection of coevolutionary relationships between residues from multiple sequence alignments (MSAs). However, despit...

Full description

Bibliographic Details
Main Authors: Chonofsky, M, de Oliveira, S, Krawczyk, K, Deane, C
Format: Journal article
Language:English
Published: Oxford University Press 2019
_version_ 1826276930800844800
author Chonofsky, M
de Oliveira, S
Krawczyk, K
Deane, C
author_facet Chonofsky, M
de Oliveira, S
Krawczyk, K
Deane, C
author_sort Chonofsky, M
collection OXFORD
description Motivation Over the last few years, the field of protein structure prediction has been transformed by increasingly accurate contact prediction software. These methods are based on the detection of coevolutionary relationships between residues from multiple sequence alignments (MSAs). However, despite speculation, there is little evidence of a link between contact prediction and the physico-chemical interactions which drive amino-acid coevolution. Furthermore, existing protocols predict only a fraction of all protein contacts and it is not clear why some contacts are favoured over others. Using a dataset of 863 protein domains, we assessed the physico-chemical interactions of contacts predicted by CCMpred, MetaPSICOV and DNCON2, as examples of direct coupling analysis, meta-prediction and deep learning. Results We considered correctly predicted contacts and compared their properties against the protein contacts that were not predicted. Predicted contacts tend to form more bonds than non-predicted contacts, which suggests these contacts may be more important than contacts that were not predicted. Comparing the contacts predicted by each method, we found that metaPSICOV and DNCON2 favour accuracy, whereas CCMPred detects contacts with more bonds. This suggests that the push for higher accuracy may lead to a loss of physico-chemically important contacts. These results underscore the connection between protein physico-chemistry and the coevolutionary couplings that can be derived from MSAs. This relationship is likely to be relevant to protein structure prediction and functional analysis of protein structure and may be key to understanding their utility for different problems in structural biology. Availability and implementation We use publicly available databases. Our code is available for download at https://opig.stats.ox.ac.uk/.
first_indexed 2024-03-06T23:21:15Z
format Journal article
id oxford-uuid:68cc4ec7-7a2f-48ae-9f6c-1f22fb9a31ad
institution University of Oxford
language English
last_indexed 2024-03-06T23:21:15Z
publishDate 2019
publisher Oxford University Press
record_format dspace
spelling oxford-uuid:68cc4ec7-7a2f-48ae-9f6c-1f22fb9a31ad2022-03-26T18:47:22ZThe evolution of contact prediction: Evidence that contact selection in statistical contact prediction is changingJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:68cc4ec7-7a2f-48ae-9f6c-1f22fb9a31adEnglishSymplectic Elements at OxfordOxford University Press2019Chonofsky, Mde Oliveira, SKrawczyk, KDeane, CMotivation Over the last few years, the field of protein structure prediction has been transformed by increasingly accurate contact prediction software. These methods are based on the detection of coevolutionary relationships between residues from multiple sequence alignments (MSAs). However, despite speculation, there is little evidence of a link between contact prediction and the physico-chemical interactions which drive amino-acid coevolution. Furthermore, existing protocols predict only a fraction of all protein contacts and it is not clear why some contacts are favoured over others. Using a dataset of 863 protein domains, we assessed the physico-chemical interactions of contacts predicted by CCMpred, MetaPSICOV and DNCON2, as examples of direct coupling analysis, meta-prediction and deep learning. Results We considered correctly predicted contacts and compared their properties against the protein contacts that were not predicted. Predicted contacts tend to form more bonds than non-predicted contacts, which suggests these contacts may be more important than contacts that were not predicted. Comparing the contacts predicted by each method, we found that metaPSICOV and DNCON2 favour accuracy, whereas CCMPred detects contacts with more bonds. This suggests that the push for higher accuracy may lead to a loss of physico-chemically important contacts. These results underscore the connection between protein physico-chemistry and the coevolutionary couplings that can be derived from MSAs. This relationship is likely to be relevant to protein structure prediction and functional analysis of protein structure and may be key to understanding their utility for different problems in structural biology. Availability and implementation We use publicly available databases. Our code is available for download at https://opig.stats.ox.ac.uk/.
spellingShingle Chonofsky, M
de Oliveira, S
Krawczyk, K
Deane, C
The evolution of contact prediction: Evidence that contact selection in statistical contact prediction is changing
title The evolution of contact prediction: Evidence that contact selection in statistical contact prediction is changing
title_full The evolution of contact prediction: Evidence that contact selection in statistical contact prediction is changing
title_fullStr The evolution of contact prediction: Evidence that contact selection in statistical contact prediction is changing
title_full_unstemmed The evolution of contact prediction: Evidence that contact selection in statistical contact prediction is changing
title_short The evolution of contact prediction: Evidence that contact selection in statistical contact prediction is changing
title_sort evolution of contact prediction evidence that contact selection in statistical contact prediction is changing
work_keys_str_mv AT chonofskym theevolutionofcontactpredictionevidencethatcontactselectioninstatisticalcontactpredictionischanging
AT deoliveiras theevolutionofcontactpredictionevidencethatcontactselectioninstatisticalcontactpredictionischanging
AT krawczykk theevolutionofcontactpredictionevidencethatcontactselectioninstatisticalcontactpredictionischanging
AT deanec theevolutionofcontactpredictionevidencethatcontactselectioninstatisticalcontactpredictionischanging
AT chonofskym evolutionofcontactpredictionevidencethatcontactselectioninstatisticalcontactpredictionischanging
AT deoliveiras evolutionofcontactpredictionevidencethatcontactselectioninstatisticalcontactpredictionischanging
AT krawczykk evolutionofcontactpredictionevidencethatcontactselectioninstatisticalcontactpredictionischanging
AT deanec evolutionofcontactpredictionevidencethatcontactselectioninstatisticalcontactpredictionischanging