Metrics to estimate differential co-expression networks

Abstract Background Detecting the differences in gene expression data is important for understanding the underlying molecular mechanisms. Although the differentially expressed genes are a large component, differences in correlation are becoming an interesting approach to achieving deeper insights. H...

Full description

Bibliographic Details
Main Authors: Elpidio-Emmanuel Gonzalez-Valbuena, Víctor Treviño
Format: Article
Language:English
Published: BMC 2017-11-01
Series:BioData Mining
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13040-017-0152-6
_version_ 1798045136371843072
author Elpidio-Emmanuel Gonzalez-Valbuena
Víctor Treviño
author_facet Elpidio-Emmanuel Gonzalez-Valbuena
Víctor Treviño
author_sort Elpidio-Emmanuel Gonzalez-Valbuena
collection DOAJ
description Abstract Background Detecting the differences in gene expression data is important for understanding the underlying molecular mechanisms. Although the differentially expressed genes are a large component, differences in correlation are becoming an interesting approach to achieving deeper insights. However, diverse metrics have been used to detect differential correlation, making selection and use of a single metric difficult. In addition, available implementations are metric-specific, complicating their use in different contexts. Moreover, because the analyses in the literature have been performed on real data, there are uncertainties regarding the performance of metrics and procedures. Results In this work, we compare four novel and two previously proposed metrics to detect differential correlations. We generated well-controlled datasets into which differences in correlations were carefully introduced by controlled multivariate normal correlation networks and addition of noise. The comparisons were performed on three datasets derived from real tumor data. Our results show that metrics differ in their detection performance and computational time. No single metric was the best in all datasets, but trends show that three metrics are highly correlated and are very good candidates for real data analysis. In contrast, other metrics proposed in the literature seem to show low performance and different detections. Overall, our results suggest that metrics that do not filter correlations perform better. We also show an additional analysis of TCGA breast cancer subtypes. Conclusions We show a methodology to generate controlled datasets for the objective evaluation of differential correlation pipelines, and compare the performance of several metrics. We implemented in R a package called DifCoNet that can provide easy-to-use functions for differential correlation analyses.
first_indexed 2024-04-11T23:15:36Z
format Article
id doaj.art-aa1dd2ea12eb45828415e578edc16170
institution Directory Open Access Journal
issn 1756-0381
language English
last_indexed 2024-04-11T23:15:36Z
publishDate 2017-11-01
publisher BMC
record_format Article
series BioData Mining
spelling doaj.art-aa1dd2ea12eb45828415e578edc161702022-12-22T03:57:37ZengBMCBioData Mining1756-03812017-11-0110111510.1186/s13040-017-0152-6Metrics to estimate differential co-expression networksElpidio-Emmanuel Gonzalez-Valbuena0Víctor Treviño1Cátedra de Bioinformática, Escuela de Medicina, Tecnológico de MonterreyCátedra de Bioinformática, Escuela de Medicina, Tecnológico de MonterreyAbstract Background Detecting the differences in gene expression data is important for understanding the underlying molecular mechanisms. Although the differentially expressed genes are a large component, differences in correlation are becoming an interesting approach to achieving deeper insights. However, diverse metrics have been used to detect differential correlation, making selection and use of a single metric difficult. In addition, available implementations are metric-specific, complicating their use in different contexts. Moreover, because the analyses in the literature have been performed on real data, there are uncertainties regarding the performance of metrics and procedures. Results In this work, we compare four novel and two previously proposed metrics to detect differential correlations. We generated well-controlled datasets into which differences in correlations were carefully introduced by controlled multivariate normal correlation networks and addition of noise. The comparisons were performed on three datasets derived from real tumor data. Our results show that metrics differ in their detection performance and computational time. No single metric was the best in all datasets, but trends show that three metrics are highly correlated and are very good candidates for real data analysis. In contrast, other metrics proposed in the literature seem to show low performance and different detections. Overall, our results suggest that metrics that do not filter correlations perform better. We also show an additional analysis of TCGA breast cancer subtypes. Conclusions We show a methodology to generate controlled datasets for the objective evaluation of differential correlation pipelines, and compare the performance of several metrics. We implemented in R a package called DifCoNet that can provide easy-to-use functions for differential correlation analyses.http://link.springer.com/article/10.1186/s13040-017-0152-6Differential correlationNetworksData simulation
spellingShingle Elpidio-Emmanuel Gonzalez-Valbuena
Víctor Treviño
Metrics to estimate differential co-expression networks
BioData Mining
Differential correlation
Networks
Data simulation
title Metrics to estimate differential co-expression networks
title_full Metrics to estimate differential co-expression networks
title_fullStr Metrics to estimate differential co-expression networks
title_full_unstemmed Metrics to estimate differential co-expression networks
title_short Metrics to estimate differential co-expression networks
title_sort metrics to estimate differential co expression networks
topic Differential correlation
Networks
Data simulation
url http://link.springer.com/article/10.1186/s13040-017-0152-6
work_keys_str_mv AT elpidioemmanuelgonzalezvalbuena metricstoestimatedifferentialcoexpressionnetworks
AT victortrevino metricstoestimatedifferentialcoexpressionnetworks