Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data

Many cellular functions involve protein complexes that are formed by multiple interacting proteins. Tandem Affinity Purification (TAP) is a popular experimental method for detecting such multi-protein interactions. However, current computational methods that predict protein complexes from TAP data r...

Full description

Bibliographic Details
Main Authors: Wu, Min, Li, Xiaoli, Kwoh, Chee Keong, Ng, See-Kiong, Wong, Limsoon
Other Authors: School of Computer Engineering
Format: Journal Article
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/107218
http://hdl.handle.net/10220/17736
_version_ 1811696274713346048
author Wu, Min
Li, Xiaoli
Kwoh, Chee Keong
Ng, See-Kiong
Wong, Limsoon
author2 School of Computer Engineering
author_facet School of Computer Engineering
Wu, Min
Li, Xiaoli
Kwoh, Chee Keong
Ng, See-Kiong
Wong, Limsoon
author_sort Wu, Min
collection NTU
description Many cellular functions involve protein complexes that are formed by multiple interacting proteins. Tandem Affinity Purification (TAP) is a popular experimental method for detecting such multi-protein interactions. However, current computational methods that predict protein complexes from TAP data require converting the co-complex relationships in TAP data into binary interactions. The resulting pairwise protein-protein interaction (PPI) network is then mined for densely connected regions that are identified as putative protein complexes. Converting the TAP data into PPI data not only introduces errors but also loses useful information about the underlying multi-protein relationships that can be exploited to detect the internal organization (i.e., core-attachment structures) of protein complexes. In this article, we propose a method called CACHET that detects protein complexes with Core-AttaCHment structures directly from bipartitETAP data. CACHET models the TAP data as a bipartite graph in which the two vertex sets are the baits and the preys, respectively. The edges between the two vertex sets represent bait-prey relationships. CACHET first focuses on detecting high-quality protein-complex cores from the bipartite graph. To minimize the effects of false positive interactions, the bait-prey relationships are indexed with reliability scores. Only non-redundant, reliable bicliques computed from the TAP bipartite graph are regarded as protein-complex cores. CACHET constructs protein complexes by including attachment proteins into the cores. We applied CACHET on large-scale TAP datasets and found that CACHET outperformed existing methods in terms of prediction accuracy (i.e., F-measure and functional homogeneity of predicted complexes). In addition, the protein complexes predicted by CACHET are equipped with core-attachment structures that provide useful biological insights into the inherent functional organization of protein complexes. Our supplementary material can be found at http://www1.i2r.a-star.edu.sg/xlli/CACHET/CACHET.htm; binary executables can also be found there. Supplementary Material is also available at www.liebertonline.com/cmb.
first_indexed 2024-10-01T07:36:46Z
format Journal Article
id ntu-10356/107218
institution Nanyang Technological University
language English
last_indexed 2024-10-01T07:36:46Z
publishDate 2013
record_format dspace
spelling ntu-10356/1072182022-02-16T16:31:10Z Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data Wu, Min Li, Xiaoli Kwoh, Chee Keong Ng, See-Kiong Wong, Limsoon School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Computer applications Many cellular functions involve protein complexes that are formed by multiple interacting proteins. Tandem Affinity Purification (TAP) is a popular experimental method for detecting such multi-protein interactions. However, current computational methods that predict protein complexes from TAP data require converting the co-complex relationships in TAP data into binary interactions. The resulting pairwise protein-protein interaction (PPI) network is then mined for densely connected regions that are identified as putative protein complexes. Converting the TAP data into PPI data not only introduces errors but also loses useful information about the underlying multi-protein relationships that can be exploited to detect the internal organization (i.e., core-attachment structures) of protein complexes. In this article, we propose a method called CACHET that detects protein complexes with Core-AttaCHment structures directly from bipartitETAP data. CACHET models the TAP data as a bipartite graph in which the two vertex sets are the baits and the preys, respectively. The edges between the two vertex sets represent bait-prey relationships. CACHET first focuses on detecting high-quality protein-complex cores from the bipartite graph. To minimize the effects of false positive interactions, the bait-prey relationships are indexed with reliability scores. Only non-redundant, reliable bicliques computed from the TAP bipartite graph are regarded as protein-complex cores. CACHET constructs protein complexes by including attachment proteins into the cores. We applied CACHET on large-scale TAP datasets and found that CACHET outperformed existing methods in terms of prediction accuracy (i.e., F-measure and functional homogeneity of predicted complexes). In addition, the protein complexes predicted by CACHET are equipped with core-attachment structures that provide useful biological insights into the inherent functional organization of protein complexes. Our supplementary material can be found at http://www1.i2r.a-star.edu.sg/xlli/CACHET/CACHET.htm; binary executables can also be found there. Supplementary Material is also available at www.liebertonline.com/cmb. Published Version 2013-11-15T07:47:19Z 2019-12-06T22:26:57Z 2013-11-15T07:47:19Z 2019-12-06T22:26:57Z 2012 2012 Journal Article Wu, M., Li, X., Kwoh, C. K., Ng, S. K., & Wong, L. (2012). Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data. Journal of computational biology, 19(9), 1027-1042. https://hdl.handle.net/10356/107218 http://hdl.handle.net/10220/17736 10.1089/cmb.2010.0293 21777084 en Journal of computational biology © 2012 Mary Ann Liebert. This paper was published in Journal of Computational Biology and is made available as an electronic reprint (preprint) with permission of Mary Ann Liebert. The paper can be found at the following official DOI: [http://dx.doi.org/10.1089/cmb.2010.0293]. One print or electronic copy may be made for personal use only. Systematic or multiple reproduction, distribution to multiple locations via electronic or other means, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper is prohibited and is subject to penalties under law. application/pdf
spellingShingle DRNTU::Engineering::Computer science and engineering::Computer applications
Wu, Min
Li, Xiaoli
Kwoh, Chee Keong
Ng, See-Kiong
Wong, Limsoon
Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data
title Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data
title_full Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data
title_fullStr Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data
title_full_unstemmed Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data
title_short Discovery of protein complexes with core-attachment structures from tandem affinity purification (TAP) data
title_sort discovery of protein complexes with core attachment structures from tandem affinity purification tap data
topic DRNTU::Engineering::Computer science and engineering::Computer applications
url https://hdl.handle.net/10356/107218
http://hdl.handle.net/10220/17736
work_keys_str_mv AT wumin discoveryofproteincomplexeswithcoreattachmentstructuresfromtandemaffinitypurificationtapdata
AT lixiaoli discoveryofproteincomplexeswithcoreattachmentstructuresfromtandemaffinitypurificationtapdata
AT kwohcheekeong discoveryofproteincomplexeswithcoreattachmentstructuresfromtandemaffinitypurificationtapdata
AT ngseekiong discoveryofproteincomplexeswithcoreattachmentstructuresfromtandemaffinitypurificationtapdata
AT wonglimsoon discoveryofproteincomplexeswithcoreattachmentstructuresfromtandemaffinitypurificationtapdata