DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.

Identification of central genes and proteins in biomolecular networks provides credible candidates for pathway analysis, functional analysis, and essentiality prediction. The DiffSLC centrality measure predicts central and essential genes and proteins using a protein-protein interaction network. Net...

Full description

Bibliographic Details
Main Authors: Divya Mistry, Roger P Wise, Julie A Dickerson
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5679606?pdf=render
_version_ 1811327707086061568
author Divya Mistry
Roger P Wise
Julie A Dickerson
author_facet Divya Mistry
Roger P Wise
Julie A Dickerson
author_sort Divya Mistry
collection DOAJ
description Identification of central genes and proteins in biomolecular networks provides credible candidates for pathway analysis, functional analysis, and essentiality prediction. The DiffSLC centrality measure predicts central and essential genes and proteins using a protein-protein interaction network. Network centrality measures prioritize nodes and edges based on their importance to the network topology. These measures helped identify critical genes and proteins in biomolecular networks. The proposed centrality measure, DiffSLC, combines the number of interactions of a protein and the gene coexpression values of genes from which those proteins were translated, as a weighting factor to bias the identification of essential proteins in a protein interaction network. Potentially essential proteins with low node degree are promoted through eigenvector centrality. Thus, the gene coexpression values are used in conjunction with the eigenvector of the network's adjacency matrix and edge clustering coefficient to improve essentiality prediction. The outcome of this prediction is shown using three variations: (1) inclusion or exclusion of gene co-expression data, (2) impact of different coexpression measures, and (3) impact of different gene expression data sets. For a total of seven networks, DiffSLC is compared to other centrality measures using Saccharomyces cerevisiae protein interaction networks and gene expression data. Comparisons are also performed for the top ranked proteins against the known essential genes from the Saccharomyces Gene Deletion Project, which show that DiffSLC detects more essential proteins and has a higher area under the ROC curve than other compared methods. This makes DiffSLC a stronger alternative to other centrality methods for detecting essential genes using a protein-protein interaction network that obeys centrality-lethality principle. DiffSLC is implemented using the igraph package in R, and networkx package in Python. The python package can be obtained from git.io/diffslcpy. The R implementation and code to reproduce the analysis is available via git.io/diffslc.
first_indexed 2024-04-13T15:12:47Z
format Article
id doaj.art-eac45901d09c474a957134f4ff85fd1c
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-13T15:12:47Z
publishDate 2017-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-eac45901d09c474a957134f4ff85fd1c2022-12-22T02:41:58ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-011211e018709110.1371/journal.pone.0187091DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.Divya MistryRoger P WiseJulie A DickersonIdentification of central genes and proteins in biomolecular networks provides credible candidates for pathway analysis, functional analysis, and essentiality prediction. The DiffSLC centrality measure predicts central and essential genes and proteins using a protein-protein interaction network. Network centrality measures prioritize nodes and edges based on their importance to the network topology. These measures helped identify critical genes and proteins in biomolecular networks. The proposed centrality measure, DiffSLC, combines the number of interactions of a protein and the gene coexpression values of genes from which those proteins were translated, as a weighting factor to bias the identification of essential proteins in a protein interaction network. Potentially essential proteins with low node degree are promoted through eigenvector centrality. Thus, the gene coexpression values are used in conjunction with the eigenvector of the network's adjacency matrix and edge clustering coefficient to improve essentiality prediction. The outcome of this prediction is shown using three variations: (1) inclusion or exclusion of gene co-expression data, (2) impact of different coexpression measures, and (3) impact of different gene expression data sets. For a total of seven networks, DiffSLC is compared to other centrality measures using Saccharomyces cerevisiae protein interaction networks and gene expression data. Comparisons are also performed for the top ranked proteins against the known essential genes from the Saccharomyces Gene Deletion Project, which show that DiffSLC detects more essential proteins and has a higher area under the ROC curve than other compared methods. This makes DiffSLC a stronger alternative to other centrality methods for detecting essential genes using a protein-protein interaction network that obeys centrality-lethality principle. DiffSLC is implemented using the igraph package in R, and networkx package in Python. The python package can be obtained from git.io/diffslcpy. The R implementation and code to reproduce the analysis is available via git.io/diffslc.http://europepmc.org/articles/PMC5679606?pdf=render
spellingShingle Divya Mistry
Roger P Wise
Julie A Dickerson
DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.
PLoS ONE
title DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.
title_full DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.
title_fullStr DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.
title_full_unstemmed DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.
title_short DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.
title_sort diffslc a graph centrality method to detect essential proteins of a protein protein interaction network
url http://europepmc.org/articles/PMC5679606?pdf=render
work_keys_str_mv AT divyamistry diffslcagraphcentralitymethodtodetectessentialproteinsofaproteinproteininteractionnetwork
AT rogerpwise diffslcagraphcentralitymethodtodetectessentialproteinsofaproteinproteininteractionnetwork
AT julieadickerson diffslcagraphcentralitymethodtodetectessentialproteinsofaproteinproteininteractionnetwork