Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing data
Background: Ensemble biclustering comprises a class of biclustering algorithms that generates a consensus, better-quality partition/s as output. This concept has emerged from the fusion of existing biclustering methods hybridized upon selected aspects. The design of the methodology enriches the exis...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2022-05-01
|
Series: | Journal of King Saud University: Computer and Information Sciences |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S1319157819310134 |
_version_ | 1811308924429664256 |
---|---|
author | Bhawani Sankar Biswal Anjali Mohapatra Swati Vipsita |
author_facet | Bhawani Sankar Biswal Anjali Mohapatra Swati Vipsita |
author_sort | Bhawani Sankar Biswal |
collection | DOAJ |
description | Background: Ensemble biclustering comprises a class of biclustering algorithms that generates a consensus, better-quality partition/s as output. This concept has emerged from the fusion of existing biclustering methods hybridized upon selected aspects. The design of the methodology enriches the existing methods furnishing with new properties. Usually biclustering of gene expression microarray data indulges in simultaneous clustering of the expression profiles under specific conditions and determines local two-way clustering models. In general, biclustering solutions rely upon different parameters like biclusters numbers, random initialization etc. However ensemble techniques are proposed to either reduce or eliminate the impact of such parameters on the output bicluster. Methods: In this paper, the authors propose a novel ensemble biclustering approach “Ensemble Neighborhood search (ENS)” based on the concept of neighborhood search. Simulation results verify that the proposed approach appears to be more flexible and adaptive in comparison to the existing competitive methods on high-dimensional gene expression microarray data as well as on scRNA-seq datasets. Conclusion: The performance of the proposed framework demonstrates its effectiveness with the other state-of-the-art schemes. The proposed framework is tested on five different microarray datasets and one single cell RNA sequence(scRNA-seq) dataset. Experimental results reveal that the proposed architecture achieves the prevention of unusual data loss and delivers the output refined as the per user standards. Also this framework preforms effectively on high sparsity scRNA-seq data where most of the algorithms fail to do so as these datasets contain massive zeros within. BicAT analysis of the ENS output validates ENS method as computationally effective and can be used to improve the quality of the biclusters. Finally, the results are statistically significant as shown in the ANOVA table. Hence this ENS method can be considered as a reliable framework and can be preferable over the traditional biclustering approaches to analyze the gene expression microarray data and high sparsity scRNA-seq data. The source code of the ENS algorithm can be accessed at https://github.com/c114002/Research/blob/master/ENS_Code.zip. |
first_indexed | 2024-04-13T09:33:08Z |
format | Article |
id | doaj.art-58008d46b48549a1830a294542ae4d28 |
institution | Directory Open Access Journal |
issn | 1319-1578 |
language | English |
last_indexed | 2024-04-13T09:33:08Z |
publishDate | 2022-05-01 |
publisher | Elsevier |
record_format | Article |
series | Journal of King Saud University: Computer and Information Sciences |
spelling | doaj.art-58008d46b48549a1830a294542ae4d282022-12-22T02:52:11ZengElsevierJournal of King Saud University: Computer and Information Sciences1319-15782022-05-0134522442251Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing dataBhawani Sankar Biswal0Anjali Mohapatra1Swati Vipsita2DST-FIST Bioinformatics Laboratory, Department of Computer Science and Engineering, International Institute of Information Technology, Bhubaneswar, India; Corresponding author.DST-FIST Bioinformatics Laboratory, Department of Computer Science and Engineering, International Institute of Information Technology, Bhubaneswar, IndiaDST-FIST Bioinformatics Laboratory, Department of Computer Science and Engineering, International Institute of Information Technology, Bhubaneswar, IndiaBackground: Ensemble biclustering comprises a class of biclustering algorithms that generates a consensus, better-quality partition/s as output. This concept has emerged from the fusion of existing biclustering methods hybridized upon selected aspects. The design of the methodology enriches the existing methods furnishing with new properties. Usually biclustering of gene expression microarray data indulges in simultaneous clustering of the expression profiles under specific conditions and determines local two-way clustering models. In general, biclustering solutions rely upon different parameters like biclusters numbers, random initialization etc. However ensemble techniques are proposed to either reduce or eliminate the impact of such parameters on the output bicluster. Methods: In this paper, the authors propose a novel ensemble biclustering approach “Ensemble Neighborhood search (ENS)” based on the concept of neighborhood search. Simulation results verify that the proposed approach appears to be more flexible and adaptive in comparison to the existing competitive methods on high-dimensional gene expression microarray data as well as on scRNA-seq datasets. Conclusion: The performance of the proposed framework demonstrates its effectiveness with the other state-of-the-art schemes. The proposed framework is tested on five different microarray datasets and one single cell RNA sequence(scRNA-seq) dataset. Experimental results reveal that the proposed architecture achieves the prevention of unusual data loss and delivers the output refined as the per user standards. Also this framework preforms effectively on high sparsity scRNA-seq data where most of the algorithms fail to do so as these datasets contain massive zeros within. BicAT analysis of the ENS output validates ENS method as computationally effective and can be used to improve the quality of the biclusters. Finally, the results are statistically significant as shown in the ANOVA table. Hence this ENS method can be considered as a reliable framework and can be preferable over the traditional biclustering approaches to analyze the gene expression microarray data and high sparsity scRNA-seq data. The source code of the ENS algorithm can be accessed at https://github.com/c114002/Research/blob/master/ENS_Code.zip.http://www.sciencedirect.com/science/article/pii/S1319157819310134MicroarrayEnsemble biclusteringENSNeighborhood searchscRNA-seq data |
spellingShingle | Bhawani Sankar Biswal Anjali Mohapatra Swati Vipsita Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing data Journal of King Saud University: Computer and Information Sciences Microarray Ensemble biclustering ENS Neighborhood search scRNA-seq data |
title | Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing data |
title_full | Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing data |
title_fullStr | Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing data |
title_full_unstemmed | Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing data |
title_short | Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing data |
title_sort | ensemble neighborhood search ens for biclustering of gene expression microarray data and single cell rna sequencing data |
topic | Microarray Ensemble biclustering ENS Neighborhood search scRNA-seq data |
url | http://www.sciencedirect.com/science/article/pii/S1319157819310134 |
work_keys_str_mv | AT bhawanisankarbiswal ensembleneighborhoodsearchensforbiclusteringofgeneexpressionmicroarraydataandsinglecellrnasequencingdata AT anjalimohapatra ensembleneighborhoodsearchensforbiclusteringofgeneexpressionmicroarraydataandsinglecellrnasequencingdata AT swativipsita ensembleneighborhoodsearchensforbiclusteringofgeneexpressionmicroarraydataandsinglecellrnasequencingdata |