Multicoil2: predicting coiled coils and their oligomerization States from sequence in the twilight zone

The alpha-helical coiled coil can adopt a variety of topologies, among the most common of which are parallel and antiparallel dimers and trimers. We present Multicoil2, an algorithm that predicts both the location and oligomerization state (two versus three helices) of coiled coils in protein sequen...

Full description

Bibliographic Details
Main Authors: Gutwin, Karl, Keating, Amy E., Trigg, Jason, Berger Leighton, Bonnie
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Language:en_US
Published: Public Library of Science 2011
Online Access:http://hdl.handle.net/1721.1/66190
https://orcid.org/0000-0003-4074-8980
https://orcid.org/0000-0002-2724-7228
_version_ 1811087052865798144
author Gutwin, Karl
Keating, Amy E.
Trigg, Jason
Berger Leighton, Bonnie
author2 Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Gutwin, Karl
Keating, Amy E.
Trigg, Jason
Berger Leighton, Bonnie
author_sort Gutwin, Karl
collection MIT
description The alpha-helical coiled coil can adopt a variety of topologies, among the most common of which are parallel and antiparallel dimers and trimers. We present Multicoil2, an algorithm that predicts both the location and oligomerization state (two versus three helices) of coiled coils in protein sequences. Multicoil2 combines the pairwise correlations of the previous Multicoil method with the flexibility of Hidden Markov Models (HMMs) in a Markov Random Field (MRF). The resulting algorithm integrates sequence features, including pairwise interactions, through multinomial logistic regression to devise an optimized scoring function for distinguishing dimer, trimer and non-coiled-coil oligomerization states; this scoring function is used to produce Markov Random Field potentials that incorporate pairwise correlations localized in sequence. Multicoil2 significantly improves both coiled-coil detection and dimer versus trimer state prediction over the original Multicoil algorithm retrained on a newly-constructed database of coiled-coil sequences. The new database, comprised of 2,105 sequences containing 124,088 residues, includes reliable structural annotations based on experimental data in the literature. Notably, the enhanced performance of Multicoil2 is evident when tested in stringent leave-family-out cross-validation on the new database, reflecting expected performance on challenging new prediction targets that have minimal sequence similarity to known coiled-coil families. The Multicoil2 program and training database are available for download from http://multicoil2.csail.mit.edu.
first_indexed 2024-09-23T13:39:02Z
format Article
id mit-1721.1/66190
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T13:39:02Z
publishDate 2011
publisher Public Library of Science
record_format dspace
spelling mit-1721.1/661902022-09-28T15:17:32Z Multicoil2: predicting coiled coils and their oligomerization States from sequence in the twilight zone Gutwin, Karl Keating, Amy E. Trigg, Jason Berger Leighton, Bonnie Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology. Department of Biology Massachusetts Institute of Technology. Department of Mathematics Berger, Bonnie Trigg, Jason A. Gutwin, Karl Keating, Amy E. Berger, Bonnie The alpha-helical coiled coil can adopt a variety of topologies, among the most common of which are parallel and antiparallel dimers and trimers. We present Multicoil2, an algorithm that predicts both the location and oligomerization state (two versus three helices) of coiled coils in protein sequences. Multicoil2 combines the pairwise correlations of the previous Multicoil method with the flexibility of Hidden Markov Models (HMMs) in a Markov Random Field (MRF). The resulting algorithm integrates sequence features, including pairwise interactions, through multinomial logistic regression to devise an optimized scoring function for distinguishing dimer, trimer and non-coiled-coil oligomerization states; this scoring function is used to produce Markov Random Field potentials that incorporate pairwise correlations localized in sequence. Multicoil2 significantly improves both coiled-coil detection and dimer versus trimer state prediction over the original Multicoil algorithm retrained on a newly-constructed database of coiled-coil sequences. The new database, comprised of 2,105 sequences containing 124,088 residues, includes reliable structural annotations based on experimental data in the literature. Notably, the enhanced performance of Multicoil2 is evident when tested in stringent leave-family-out cross-validation on the new database, reflecting expected performance on challenging new prediction targets that have minimal sequence similarity to known coiled-coil families. The Multicoil2 program and training database are available for download from http://multicoil2.csail.mit.edu. National Institutes of Health (U.S.) (Grant 1R01GM081871) National Science Foundation (U.S.) (Grant MCB-0347203) National Science Foundation (U.S.) (Grant 0821391) 2011-10-05T17:34:39Z 2011-10-05T17:34:39Z 2011-08 2011-04 Article http://purl.org/eprint/type/JournalArticle 1932-6203 http://hdl.handle.net/1721.1/66190 Trigg, Jason et al. “Multicoil2: Predicting Coiled Coils and Their Oligomerization States from Sequence in the Twilight Zone.” Ed. Ozlem Keskin. PLoS ONE 6 (8) (2011): e23519. © 2011 Trigg et al. https://orcid.org/0000-0003-4074-8980 https://orcid.org/0000-0002-2724-7228 en_US http://dx.doi.org/10.1371/journal.pone.0023519 PLoS one Creative Commons Attribution http://creativecommons.org/licenses/by/2.5/ application/pdf Public Library of Science PLoS
spellingShingle Gutwin, Karl
Keating, Amy E.
Trigg, Jason
Berger Leighton, Bonnie
Multicoil2: predicting coiled coils and their oligomerization States from sequence in the twilight zone
title Multicoil2: predicting coiled coils and their oligomerization States from sequence in the twilight zone
title_full Multicoil2: predicting coiled coils and their oligomerization States from sequence in the twilight zone
title_fullStr Multicoil2: predicting coiled coils and their oligomerization States from sequence in the twilight zone
title_full_unstemmed Multicoil2: predicting coiled coils and their oligomerization States from sequence in the twilight zone
title_short Multicoil2: predicting coiled coils and their oligomerization States from sequence in the twilight zone
title_sort multicoil2 predicting coiled coils and their oligomerization states from sequence in the twilight zone
url http://hdl.handle.net/1721.1/66190
https://orcid.org/0000-0003-4074-8980
https://orcid.org/0000-0002-2724-7228
work_keys_str_mv AT gutwinkarl multicoil2predictingcoiledcoilsandtheiroligomerizationstatesfromsequenceinthetwilightzone
AT keatingamye multicoil2predictingcoiledcoilsandtheiroligomerizationstatesfromsequenceinthetwilightzone
AT triggjason multicoil2predictingcoiledcoilsandtheiroligomerizationstatesfromsequenceinthetwilightzone
AT bergerleightonbonnie multicoil2predictingcoiledcoilsandtheiroligomerizationstatesfromsequenceinthetwilightzone