Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data
Many cellular functions involve protein complexes that are formed by multiple interacting proteins. Tandem Affinity Purification (TAP) is a popular experimental method for detecting such multi-protein interactions. However, current computational methods that predict protein complexes from TAP data r...
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Journal Article |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/107218 http://hdl.handle.net/10220/17736 |
_version_ | 1811696274713346048 |
---|---|
author | Wu, Min Li, Xiaoli Kwoh, Chee Keong Ng, See-Kiong Wong, Limsoon |
author2 | School of Computer Engineering |
author_facet | School of Computer Engineering Wu, Min Li, Xiaoli Kwoh, Chee Keong Ng, See-Kiong Wong, Limsoon |
author_sort | Wu, Min |
collection | NTU |
description | Many cellular functions involve protein complexes that are formed by multiple interacting proteins. Tandem Affinity Purification (TAP) is a popular experimental method for detecting such multi-protein interactions. However, current computational methods that predict protein complexes from TAP data require converting the co-complex relationships in TAP data into binary interactions. The resulting pairwise protein-protein interaction (PPI) network is then mined for densely connected regions that are identified as putative protein complexes. Converting the TAP data into PPI data not only introduces errors but also loses useful information about the underlying multi-protein relationships that can be exploited to detect the internal organization (i.e., core-attachment structures) of protein complexes. In this article, we propose a method called CACHET that detects protein complexes with Core-AttaCHment structures directly from bipartitETAP data. CACHET models the TAP data as a bipartite graph in which the two vertex sets are the baits and the preys, respectively. The edges between the two vertex sets represent bait-prey relationships. CACHET first focuses on detecting high-quality protein-complex cores from the bipartite graph. To minimize the effects of false positive interactions, the bait-prey relationships are indexed with reliability scores. Only non-redundant, reliable bicliques computed from the TAP bipartite graph are regarded as protein-complex cores. CACHET constructs protein complexes by including attachment proteins into the cores. We applied CACHET on large-scale TAP datasets and found that CACHET outperformed existing methods in terms of prediction accuracy (i.e., F-measure and functional homogeneity of predicted complexes). In addition, the protein complexes predicted by CACHET are equipped with core-attachment structures that provide useful biological insights into the inherent functional organization of protein complexes. Our supplementary material can be found at http://www1.i2r.a-star.edu.sg/xlli/CACHET/CACHET.htm; binary executables can also be found there. Supplementary Material is also available at www.liebertonline.com/cmb. |
first_indexed | 2024-10-01T07:36:46Z |
format | Journal Article |
id | ntu-10356/107218 |
institution | Nanyang Technological University |
language | English |
last_indexed | 2024-10-01T07:36:46Z |
publishDate | 2013 |
record_format | dspace |
spelling | ntu-10356/1072182022-02-16T16:31:10Z Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data Wu, Min Li, Xiaoli Kwoh, Chee Keong Ng, See-Kiong Wong, Limsoon School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Computer applications Many cellular functions involve protein complexes that are formed by multiple interacting proteins. Tandem Affinity Purification (TAP) is a popular experimental method for detecting such multi-protein interactions. However, current computational methods that predict protein complexes from TAP data require converting the co-complex relationships in TAP data into binary interactions. The resulting pairwise protein-protein interaction (PPI) network is then mined for densely connected regions that are identified as putative protein complexes. Converting the TAP data into PPI data not only introduces errors but also loses useful information about the underlying multi-protein relationships that can be exploited to detect the internal organization (i.e., core-attachment structures) of protein complexes. In this article, we propose a method called CACHET that detects protein complexes with Core-AttaCHment structures directly from bipartitETAP data. CACHET models the TAP data as a bipartite graph in which the two vertex sets are the baits and the preys, respectively. The edges between the two vertex sets represent bait-prey relationships. CACHET first focuses on detecting high-quality protein-complex cores from the bipartite graph. To minimize the effects of false positive interactions, the bait-prey relationships are indexed with reliability scores. Only non-redundant, reliable bicliques computed from the TAP bipartite graph are regarded as protein-complex cores. CACHET constructs protein complexes by including attachment proteins into the cores. We applied CACHET on large-scale TAP datasets and found that CACHET outperformed existing methods in terms of prediction accuracy (i.e., F-measure and functional homogeneity of predicted complexes). In addition, the protein complexes predicted by CACHET are equipped with core-attachment structures that provide useful biological insights into the inherent functional organization of protein complexes. Our supplementary material can be found at http://www1.i2r.a-star.edu.sg/xlli/CACHET/CACHET.htm; binary executables can also be found there. Supplementary Material is also available at www.liebertonline.com/cmb. Published Version 2013-11-15T07:47:19Z 2019-12-06T22:26:57Z 2013-11-15T07:47:19Z 2019-12-06T22:26:57Z 2012 2012 Journal Article Wu, M., Li, X., Kwoh, C. K., Ng, S. K., & Wong, L. (2012). Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data. Journal of computational biology, 19(9), 1027-1042. https://hdl.handle.net/10356/107218 http://hdl.handle.net/10220/17736 10.1089/cmb.2010.0293 21777084 en Journal of computational biology © 2012 Mary Ann Liebert. This paper was published in Journal of Computational Biology and is made available as an electronic reprint (preprint) with permission of Mary Ann Liebert. The paper can be found at the following official DOI: [http://dx.doi.org/10.1089/cmb.2010.0293]. One print or electronic copy may be made for personal use only. Systematic or multiple reproduction, distribution to multiple locations via electronic or other means, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper is prohibited and is subject to penalties under law. application/pdf |
spellingShingle | DRNTU::Engineering::Computer science and engineering::Computer applications Wu, Min Li, Xiaoli Kwoh, Chee Keong Ng, See-Kiong Wong, Limsoon Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data |
title | Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data |
title_full | Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data |
title_fullStr | Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data |
title_full_unstemmed | Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data |
title_short | Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data |
title_sort | discovery of protein complexes with core attachment structures from tandem affinity purification tap data |
topic | DRNTU::Engineering::Computer science and engineering::Computer applications |
url | https://hdl.handle.net/10356/107218 http://hdl.handle.net/10220/17736 |
work_keys_str_mv | AT wumin discoveryofproteincomplexeswithcoreattachmentstructuresfromtandemaffinitypurificationtapdata AT lixiaoli discoveryofproteincomplexeswithcoreattachmentstructuresfromtandemaffinitypurificationtapdata AT kwohcheekeong discoveryofproteincomplexeswithcoreattachmentstructuresfromtandemaffinitypurificationtapdata AT ngseekiong discoveryofproteincomplexeswithcoreattachmentstructuresfromtandemaffinitypurificationtapdata AT wonglimsoon discoveryofproteincomplexeswithcoreattachmentstructuresfromtandemaffinitypurificationtapdata |