Enhancing Evolutionary Couplings with Deep Convolutional Neural Networks

While genes are defined by sequence, in biological systems a protein's function is largely determined by its three-dimensional structure. Evolutionary information embedded within multiple sequence alignments provides a rich source of data for inferring structural constraints on macromolecules....

Full description

Bibliographic Details
Main Authors: Liu, Yang, Ye, Qing, Peng, Jian, Palmedo, Peter Franklin, Berger Leighton, Bonnie
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Published: Elsevier 2018
Online Access:http://hdl.handle.net/1721.1/115385
https://orcid.org/0000-0002-2724-7228
_version_ 1826197946126827520
author Liu, Yang
Ye, Qing
Peng, Jian
Palmedo, Peter Franklin
Berger Leighton, Bonnie
author2 Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Liu, Yang
Ye, Qing
Peng, Jian
Palmedo, Peter Franklin
Berger Leighton, Bonnie
author_sort Liu, Yang
collection MIT
description While genes are defined by sequence, in biological systems a protein's function is largely determined by its three-dimensional structure. Evolutionary information embedded within multiple sequence alignments provides a rich source of data for inferring structural constraints on macromolecules. Still, many proteins of interest lack sufficient numbers of related sequences, leading to noisy, error-prone residue-residue contact predictions. Here we introduce DeepContact, a convolutional neural network (CNN)-based approach that discovers co-evolutionary motifs and leverages these patterns to enable accurate inference of contact probabilities, particularly when few related sequences are available. DeepContact significantly improves performance over previous methods, including in the CASP12 blind contact prediction task where we achieved top performance with another CNN-based approach. Moreover, our tool converts hard-to-interpret coupling scores into probabilities, moving the field toward a consistent metric to assess contact prediction across diverse proteins. Through substantially improving the precision-recall behavior of contact prediction, DeepContact suggests we are near a paradigm shift in template-free modeling for protein structure prediction. Many protein structures of interest remain out of reach for both computational prediction and experimental determination. DeepContact learns patterns of co-evolution across thousands of experimentally determined structures, identifying conserved local motifs and leveraging this information to improve protein residue-residue contact predictions. DeepContact extracts additional information from the evolutionary couplings using its knowledge of co-evolution and structural space, while also converting coupling scores into probabilities that are comparable across protein sequences and alignments. Keywords: contact prediction; convolutional neural networks; deep learning; protein structure prediction; structure prediction; co-evolution; evolutionary couplings
first_indexed 2024-09-23T10:56:11Z
format Article
id mit-1721.1/115385
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T10:56:11Z
publishDate 2018
publisher Elsevier
record_format dspace
spelling mit-1721.1/1153852022-09-27T16:02:07Z Enhancing Evolutionary Couplings with Deep Convolutional Neural Networks Liu, Yang Ye, Qing Peng, Jian Palmedo, Peter Franklin Berger Leighton, Bonnie Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology. Department of Mathematics Palmedo, Peter Franklin Berger Leighton, Bonnie While genes are defined by sequence, in biological systems a protein's function is largely determined by its three-dimensional structure. Evolutionary information embedded within multiple sequence alignments provides a rich source of data for inferring structural constraints on macromolecules. Still, many proteins of interest lack sufficient numbers of related sequences, leading to noisy, error-prone residue-residue contact predictions. Here we introduce DeepContact, a convolutional neural network (CNN)-based approach that discovers co-evolutionary motifs and leverages these patterns to enable accurate inference of contact probabilities, particularly when few related sequences are available. DeepContact significantly improves performance over previous methods, including in the CASP12 blind contact prediction task where we achieved top performance with another CNN-based approach. Moreover, our tool converts hard-to-interpret coupling scores into probabilities, moving the field toward a consistent metric to assess contact prediction across diverse proteins. Through substantially improving the precision-recall behavior of contact prediction, DeepContact suggests we are near a paradigm shift in template-free modeling for protein structure prediction. Many protein structures of interest remain out of reach for both computational prediction and experimental determination. DeepContact learns patterns of co-evolution across thousands of experimentally determined structures, identifying conserved local motifs and leveraging this information to improve protein residue-residue contact predictions. DeepContact extracts additional information from the evolutionary couplings using its knowledge of co-evolution and structural space, while also converting coupling scores into probabilities that are comparable across protein sequences and alignments. Keywords: contact prediction; convolutional neural networks; deep learning; protein structure prediction; structure prediction; co-evolution; evolutionary couplings National Institutes of Health (U.S.) (Grant R01GM081871) 2018-05-16T13:26:52Z 2018-05-16T13:26:52Z 2017-12 2017-10 2018-05-15T18:41:59Z Article http://purl.org/eprint/type/JournalArticle 2405-4712 http://hdl.handle.net/1721.1/115385 Liu, Yang et al. “Enhancing Evolutionary Couplings with Deep Convolutional Neural Networks.” Cell Systems 6, 1 (January 2018): 65–74 © 2017 The Authors https://orcid.org/0000-0002-2724-7228 http://dx.doi.org/10.1016/J.CELS.2017.11.014 Cell Systems Creative Commons Attribution-NonCommercial-NoDerivs License http://creativecommons.org/licenses/by-nc-nd/4.0/ application/pdf Elsevier Elsevier
spellingShingle Liu, Yang
Ye, Qing
Peng, Jian
Palmedo, Peter Franklin
Berger Leighton, Bonnie
Enhancing Evolutionary Couplings with Deep Convolutional Neural Networks
title Enhancing Evolutionary Couplings with Deep Convolutional Neural Networks
title_full Enhancing Evolutionary Couplings with Deep Convolutional Neural Networks
title_fullStr Enhancing Evolutionary Couplings with Deep Convolutional Neural Networks
title_full_unstemmed Enhancing Evolutionary Couplings with Deep Convolutional Neural Networks
title_short Enhancing Evolutionary Couplings with Deep Convolutional Neural Networks
title_sort enhancing evolutionary couplings with deep convolutional neural networks
url http://hdl.handle.net/1721.1/115385
https://orcid.org/0000-0002-2724-7228
work_keys_str_mv AT liuyang enhancingevolutionarycouplingswithdeepconvolutionalneuralnetworks
AT yeqing enhancingevolutionarycouplingswithdeepconvolutionalneuralnetworks
AT pengjian enhancingevolutionarycouplingswithdeepconvolutionalneuralnetworks
AT palmedopeterfranklin enhancingevolutionarycouplingswithdeepconvolutionalneuralnetworks
AT bergerleightonbonnie enhancingevolutionarycouplingswithdeepconvolutionalneuralnetworks