Integrated simultaneous analysis of different biomedical data types with exact weighted bi-cluster editing

The explosion of biological data has largely influenced the focus of today’s biology research. Integrating and analysing large quantity of data to provide meaningful insights has become the main challenge to biologists and bioinformaticians. One major problem is the combined data analysis of data fr...

Full description

Bibliographic Details
Main Authors: Sun Peng, Guo Jiong, Baumbach Jan
Format: Article
Language:English
Published: De Gruyter 2012-06-01
Series:Journal of Integrative Bioinformatics
Online Access:https://doi.org/10.1515/jib-2012-197
_version_ 1818582205706797056
author Sun Peng
Guo Jiong
Baumbach Jan
author_facet Sun Peng
Guo Jiong
Baumbach Jan
author_sort Sun Peng
collection DOAJ
description The explosion of biological data has largely influenced the focus of today’s biology research. Integrating and analysing large quantity of data to provide meaningful insights has become the main challenge to biologists and bioinformaticians. One major problem is the combined data analysis of data from different types, such as phenotypes and genotypes. This data is modelled as bi-partite graphs where nodes correspond to the different data points, mutations and diseases for instance, and weighted edges relate to associations between them. Bi-clustering is a special case of clustering designed for partitioning two different types of data simultaneously. We present a bi-clustering approach that solves the NP-hard weighted bi-cluster editing problem by transforming a given bi-partite graph into a disjoint union of bi-cliques. Here we contribute with an exact algorithm that is based on fixed-parameter tractability. We evaluated its performance on artificial graphs first. Afterwards we exemplarily applied our Java implementation to data of genome-wide association studies (GWAS) data aiming for discovering new, previously unobserved geno-to-pheno associations. We believe that our results will serve as guidelines for further wet lab investigations. Generally our software can be applied to any kind of data that can be modelled as bi-partite graphs. To our knowledge it is the fastest exact method for weighted bi-cluster editing problem.
first_indexed 2024-12-16T07:45:41Z
format Article
id doaj.art-d14562d1129641e39a31ddd0b591a334
institution Directory Open Access Journal
issn 1613-4516
language English
last_indexed 2024-12-16T07:45:41Z
publishDate 2012-06-01
publisher De Gruyter
record_format Article
series Journal of Integrative Bioinformatics
spelling doaj.art-d14562d1129641e39a31ddd0b591a3342022-12-21T22:38:59ZengDe GruyterJournal of Integrative Bioinformatics1613-45162012-06-0192536710.1515/jib-2012-197biecoll-jib-2012-197Integrated simultaneous analysis of different biomedical data types with exact weighted bi-cluster editingSun Peng0Guo Jiong1Baumbach Jan2Computational Systems Biology group, Max Planck Institute for Informatics, Campus E1.4, 66123 Saarbrücken, Germany GermanyCluster of Excellence for Multimodal Computing and Interaction, Saarland University, Campus E1.7, 66123 Saarbrücken, Germany GermanyComputational Systems Biology group, Max Planck Institute for Informatics, Campus E1.4, 66123 Saarbrücken, Germany GermanyThe explosion of biological data has largely influenced the focus of today’s biology research. Integrating and analysing large quantity of data to provide meaningful insights has become the main challenge to biologists and bioinformaticians. One major problem is the combined data analysis of data from different types, such as phenotypes and genotypes. This data is modelled as bi-partite graphs where nodes correspond to the different data points, mutations and diseases for instance, and weighted edges relate to associations between them. Bi-clustering is a special case of clustering designed for partitioning two different types of data simultaneously. We present a bi-clustering approach that solves the NP-hard weighted bi-cluster editing problem by transforming a given bi-partite graph into a disjoint union of bi-cliques. Here we contribute with an exact algorithm that is based on fixed-parameter tractability. We evaluated its performance on artificial graphs first. Afterwards we exemplarily applied our Java implementation to data of genome-wide association studies (GWAS) data aiming for discovering new, previously unobserved geno-to-pheno associations. We believe that our results will serve as guidelines for further wet lab investigations. Generally our software can be applied to any kind of data that can be modelled as bi-partite graphs. To our knowledge it is the fastest exact method for weighted bi-cluster editing problem.https://doi.org/10.1515/jib-2012-197
spellingShingle Sun Peng
Guo Jiong
Baumbach Jan
Integrated simultaneous analysis of different biomedical data types with exact weighted bi-cluster editing
Journal of Integrative Bioinformatics
title Integrated simultaneous analysis of different biomedical data types with exact weighted bi-cluster editing
title_full Integrated simultaneous analysis of different biomedical data types with exact weighted bi-cluster editing
title_fullStr Integrated simultaneous analysis of different biomedical data types with exact weighted bi-cluster editing
title_full_unstemmed Integrated simultaneous analysis of different biomedical data types with exact weighted bi-cluster editing
title_short Integrated simultaneous analysis of different biomedical data types with exact weighted bi-cluster editing
title_sort integrated simultaneous analysis of different biomedical data types with exact weighted bi cluster editing
url https://doi.org/10.1515/jib-2012-197
work_keys_str_mv AT sunpeng integratedsimultaneousanalysisofdifferentbiomedicaldatatypeswithexactweightedbiclusterediting
AT guojiong integratedsimultaneousanalysisofdifferentbiomedicaldatatypeswithexactweightedbiclusterediting
AT baumbachjan integratedsimultaneousanalysisofdifferentbiomedicaldatatypeswithexactweightedbiclusterediting