Detecting overlapping protein complexes by rough-fuzzy clustering in protein-protein interaction networks.

In this paper, we present a novel rough-fuzzy clustering (RFC) method to detect overlapping protein complexes in protein-protein interaction (PPI) networks. RFC focuses on fuzzy relation model rather than graph model by integrating fuzzy sets and rough sets, employs the upper and lower approximation...

Full description

Bibliographic Details
Main Authors: Hao Wu, Lin Gao, Jihua Dong, Xiaofei Yang
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3958373?pdf=render
_version_ 1819132738295300096
author Hao Wu
Lin Gao
Jihua Dong
Xiaofei Yang
author_facet Hao Wu
Lin Gao
Jihua Dong
Xiaofei Yang
author_sort Hao Wu
collection DOAJ
description In this paper, we present a novel rough-fuzzy clustering (RFC) method to detect overlapping protein complexes in protein-protein interaction (PPI) networks. RFC focuses on fuzzy relation model rather than graph model by integrating fuzzy sets and rough sets, employs the upper and lower approximations of rough sets to deal with overlapping complexes, and calculates the number of complexes automatically. Fuzzy relation between proteins is established and then transformed into fuzzy equivalence relation. Non-overlapping complexes correspond to equivalence classes satisfying certain equivalence relation. To obtain overlapping complexes, we calculate the similarity between one protein and each complex, and then determine whether the protein belongs to one or multiple complexes by computing the ratio of each similarity to maximum similarity. To validate RFC quantitatively, we test it in Gavin, Collins, Krogan and BioGRID datasets. Experiment results show that there is a good correspondence to reference complexes in MIPS and SGD databases. Then we compare RFC with several previous methods, including ClusterONE, CMC, MCL, GCE, OSLOM and CFinder. Results show the precision, sensitivity and separation are 32.4%, 42.9% and 81.9% higher than mean of the five methods in four weighted networks, and are 0.5%, 11.2% and 66.1% higher than mean of the six methods in five unweighted networks. Our method RFC works well for protein complexes detection and provides a new insight of network division, and it can also be applied to identify overlapping community structure in social networks and LFR benchmark networks.
first_indexed 2024-12-22T09:36:10Z
format Article
id doaj.art-eaced5c26b7645148c8e8e000408f4d7
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-22T09:36:10Z
publishDate 2014-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-eaced5c26b7645148c8e8e000408f4d72022-12-21T18:30:51ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0193e9185610.1371/journal.pone.0091856Detecting overlapping protein complexes by rough-fuzzy clustering in protein-protein interaction networks.Hao WuLin GaoJihua DongXiaofei YangIn this paper, we present a novel rough-fuzzy clustering (RFC) method to detect overlapping protein complexes in protein-protein interaction (PPI) networks. RFC focuses on fuzzy relation model rather than graph model by integrating fuzzy sets and rough sets, employs the upper and lower approximations of rough sets to deal with overlapping complexes, and calculates the number of complexes automatically. Fuzzy relation between proteins is established and then transformed into fuzzy equivalence relation. Non-overlapping complexes correspond to equivalence classes satisfying certain equivalence relation. To obtain overlapping complexes, we calculate the similarity between one protein and each complex, and then determine whether the protein belongs to one or multiple complexes by computing the ratio of each similarity to maximum similarity. To validate RFC quantitatively, we test it in Gavin, Collins, Krogan and BioGRID datasets. Experiment results show that there is a good correspondence to reference complexes in MIPS and SGD databases. Then we compare RFC with several previous methods, including ClusterONE, CMC, MCL, GCE, OSLOM and CFinder. Results show the precision, sensitivity and separation are 32.4%, 42.9% and 81.9% higher than mean of the five methods in four weighted networks, and are 0.5%, 11.2% and 66.1% higher than mean of the six methods in five unweighted networks. Our method RFC works well for protein complexes detection and provides a new insight of network division, and it can also be applied to identify overlapping community structure in social networks and LFR benchmark networks.http://europepmc.org/articles/PMC3958373?pdf=render
spellingShingle Hao Wu
Lin Gao
Jihua Dong
Xiaofei Yang
Detecting overlapping protein complexes by rough-fuzzy clustering in protein-protein interaction networks.
PLoS ONE
title Detecting overlapping protein complexes by rough-fuzzy clustering in protein-protein interaction networks.
title_full Detecting overlapping protein complexes by rough-fuzzy clustering in protein-protein interaction networks.
title_fullStr Detecting overlapping protein complexes by rough-fuzzy clustering in protein-protein interaction networks.
title_full_unstemmed Detecting overlapping protein complexes by rough-fuzzy clustering in protein-protein interaction networks.
title_short Detecting overlapping protein complexes by rough-fuzzy clustering in protein-protein interaction networks.
title_sort detecting overlapping protein complexes by rough fuzzy clustering in protein protein interaction networks
url http://europepmc.org/articles/PMC3958373?pdf=render
work_keys_str_mv AT haowu detectingoverlappingproteincomplexesbyroughfuzzyclusteringinproteinproteininteractionnetworks
AT lingao detectingoverlappingproteincomplexesbyroughfuzzyclusteringinproteinproteininteractionnetworks
AT jihuadong detectingoverlappingproteincomplexesbyroughfuzzyclusteringinproteinproteininteractionnetworks
AT xiaofeiyang detectingoverlappingproteincomplexesbyroughfuzzyclusteringinproteinproteininteractionnetworks