Metrics to estimate differential co-expression networks
Abstract Background Detecting the differences in gene expression data is important for understanding the underlying molecular mechanisms. Although the differentially expressed genes are a large component, differences in correlation are becoming an interesting approach to achieving deeper insights. H...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2017-11-01
|
Series: | BioData Mining |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s13040-017-0152-6 |
_version_ | 1798045136371843072 |
---|---|
author | Elpidio-Emmanuel Gonzalez-Valbuena Víctor Treviño |
author_facet | Elpidio-Emmanuel Gonzalez-Valbuena Víctor Treviño |
author_sort | Elpidio-Emmanuel Gonzalez-Valbuena |
collection | DOAJ |
description | Abstract Background Detecting the differences in gene expression data is important for understanding the underlying molecular mechanisms. Although the differentially expressed genes are a large component, differences in correlation are becoming an interesting approach to achieving deeper insights. However, diverse metrics have been used to detect differential correlation, making selection and use of a single metric difficult. In addition, available implementations are metric-specific, complicating their use in different contexts. Moreover, because the analyses in the literature have been performed on real data, there are uncertainties regarding the performance of metrics and procedures. Results In this work, we compare four novel and two previously proposed metrics to detect differential correlations. We generated well-controlled datasets into which differences in correlations were carefully introduced by controlled multivariate normal correlation networks and addition of noise. The comparisons were performed on three datasets derived from real tumor data. Our results show that metrics differ in their detection performance and computational time. No single metric was the best in all datasets, but trends show that three metrics are highly correlated and are very good candidates for real data analysis. In contrast, other metrics proposed in the literature seem to show low performance and different detections. Overall, our results suggest that metrics that do not filter correlations perform better. We also show an additional analysis of TCGA breast cancer subtypes. Conclusions We show a methodology to generate controlled datasets for the objective evaluation of differential correlation pipelines, and compare the performance of several metrics. We implemented in R a package called DifCoNet that can provide easy-to-use functions for differential correlation analyses. |
first_indexed | 2024-04-11T23:15:36Z |
format | Article |
id | doaj.art-aa1dd2ea12eb45828415e578edc16170 |
institution | Directory Open Access Journal |
issn | 1756-0381 |
language | English |
last_indexed | 2024-04-11T23:15:36Z |
publishDate | 2017-11-01 |
publisher | BMC |
record_format | Article |
series | BioData Mining |
spelling | doaj.art-aa1dd2ea12eb45828415e578edc161702022-12-22T03:57:37ZengBMCBioData Mining1756-03812017-11-0110111510.1186/s13040-017-0152-6Metrics to estimate differential co-expression networksElpidio-Emmanuel Gonzalez-Valbuena0Víctor Treviño1Cátedra de Bioinformática, Escuela de Medicina, Tecnológico de MonterreyCátedra de Bioinformática, Escuela de Medicina, Tecnológico de MonterreyAbstract Background Detecting the differences in gene expression data is important for understanding the underlying molecular mechanisms. Although the differentially expressed genes are a large component, differences in correlation are becoming an interesting approach to achieving deeper insights. However, diverse metrics have been used to detect differential correlation, making selection and use of a single metric difficult. In addition, available implementations are metric-specific, complicating their use in different contexts. Moreover, because the analyses in the literature have been performed on real data, there are uncertainties regarding the performance of metrics and procedures. Results In this work, we compare four novel and two previously proposed metrics to detect differential correlations. We generated well-controlled datasets into which differences in correlations were carefully introduced by controlled multivariate normal correlation networks and addition of noise. The comparisons were performed on three datasets derived from real tumor data. Our results show that metrics differ in their detection performance and computational time. No single metric was the best in all datasets, but trends show that three metrics are highly correlated and are very good candidates for real data analysis. In contrast, other metrics proposed in the literature seem to show low performance and different detections. Overall, our results suggest that metrics that do not filter correlations perform better. We also show an additional analysis of TCGA breast cancer subtypes. Conclusions We show a methodology to generate controlled datasets for the objective evaluation of differential correlation pipelines, and compare the performance of several metrics. We implemented in R a package called DifCoNet that can provide easy-to-use functions for differential correlation analyses.http://link.springer.com/article/10.1186/s13040-017-0152-6Differential correlationNetworksData simulation |
spellingShingle | Elpidio-Emmanuel Gonzalez-Valbuena Víctor Treviño Metrics to estimate differential co-expression networks BioData Mining Differential correlation Networks Data simulation |
title | Metrics to estimate differential co-expression networks |
title_full | Metrics to estimate differential co-expression networks |
title_fullStr | Metrics to estimate differential co-expression networks |
title_full_unstemmed | Metrics to estimate differential co-expression networks |
title_short | Metrics to estimate differential co-expression networks |
title_sort | metrics to estimate differential co expression networks |
topic | Differential correlation Networks Data simulation |
url | http://link.springer.com/article/10.1186/s13040-017-0152-6 |
work_keys_str_mv | AT elpidioemmanuelgonzalezvalbuena metricstoestimatedifferentialcoexpressionnetworks AT victortrevino metricstoestimatedifferentialcoexpressionnetworks |