Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing data

Background: Ensemble biclustering comprises a class of biclustering algorithms that generates a consensus, better-quality partition/s as output. This concept has emerged from the fusion of existing biclustering methods hybridized upon selected aspects. The design of the methodology enriches the exis...

Full description

Bibliographic Details
Main Authors: Bhawani Sankar Biswal, Anjali Mohapatra, Swati Vipsita
Format: Article
Language:English
Published: Elsevier 2022-05-01
Series:Journal of King Saud University: Computer and Information Sciences
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1319157819310134
_version_ 1811308924429664256
author Bhawani Sankar Biswal
Anjali Mohapatra
Swati Vipsita
author_facet Bhawani Sankar Biswal
Anjali Mohapatra
Swati Vipsita
author_sort Bhawani Sankar Biswal
collection DOAJ
description Background: Ensemble biclustering comprises a class of biclustering algorithms that generates a consensus, better-quality partition/s as output. This concept has emerged from the fusion of existing biclustering methods hybridized upon selected aspects. The design of the methodology enriches the existing methods furnishing with new properties. Usually biclustering of gene expression microarray data indulges in simultaneous clustering of the expression profiles under specific conditions and determines local two-way clustering models. In general, biclustering solutions rely upon different parameters like biclusters numbers, random initialization etc. However ensemble techniques are proposed to either reduce or eliminate the impact of such parameters on the output bicluster. Methods: In this paper, the authors propose a novel ensemble biclustering approach “Ensemble Neighborhood search (ENS)” based on the concept of neighborhood search. Simulation results verify that the proposed approach appears to be more flexible and adaptive in comparison to the existing competitive methods on high-dimensional gene expression microarray data as well as on scRNA-seq datasets. Conclusion: The performance of the proposed framework demonstrates its effectiveness with the other state-of-the-art schemes. The proposed framework is tested on five different microarray datasets and one single cell RNA sequence(scRNA-seq) dataset. Experimental results reveal that the proposed architecture achieves the prevention of unusual data loss and delivers the output refined as the per user standards. Also this framework preforms effectively on high sparsity scRNA-seq data where most of the algorithms fail to do so as these datasets contain massive zeros within. BicAT analysis of the ENS output validates ENS method as computationally effective and can be used to improve the quality of the biclusters. Finally, the results are statistically significant as shown in the ANOVA table. Hence this ENS method can be considered as a reliable framework and can be preferable over the traditional biclustering approaches to analyze the gene expression microarray data and high sparsity scRNA-seq data. The source code of the ENS algorithm can be accessed at  https://github.com/c114002/Research/blob/master/ENS_Code.zip.
first_indexed 2024-04-13T09:33:08Z
format Article
id doaj.art-58008d46b48549a1830a294542ae4d28
institution Directory Open Access Journal
issn 1319-1578
language English
last_indexed 2024-04-13T09:33:08Z
publishDate 2022-05-01
publisher Elsevier
record_format Article
series Journal of King Saud University: Computer and Information Sciences
spelling doaj.art-58008d46b48549a1830a294542ae4d282022-12-22T02:52:11ZengElsevierJournal of King Saud University: Computer and Information Sciences1319-15782022-05-0134522442251Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing dataBhawani Sankar Biswal0Anjali Mohapatra1Swati Vipsita2DST-FIST Bioinformatics Laboratory, Department of Computer Science and Engineering, International Institute of Information Technology, Bhubaneswar, India; Corresponding author.DST-FIST Bioinformatics Laboratory, Department of Computer Science and Engineering, International Institute of Information Technology, Bhubaneswar, IndiaDST-FIST Bioinformatics Laboratory, Department of Computer Science and Engineering, International Institute of Information Technology, Bhubaneswar, IndiaBackground: Ensemble biclustering comprises a class of biclustering algorithms that generates a consensus, better-quality partition/s as output. This concept has emerged from the fusion of existing biclustering methods hybridized upon selected aspects. The design of the methodology enriches the existing methods furnishing with new properties. Usually biclustering of gene expression microarray data indulges in simultaneous clustering of the expression profiles under specific conditions and determines local two-way clustering models. In general, biclustering solutions rely upon different parameters like biclusters numbers, random initialization etc. However ensemble techniques are proposed to either reduce or eliminate the impact of such parameters on the output bicluster. Methods: In this paper, the authors propose a novel ensemble biclustering approach “Ensemble Neighborhood search (ENS)” based on the concept of neighborhood search. Simulation results verify that the proposed approach appears to be more flexible and adaptive in comparison to the existing competitive methods on high-dimensional gene expression microarray data as well as on scRNA-seq datasets. Conclusion: The performance of the proposed framework demonstrates its effectiveness with the other state-of-the-art schemes. The proposed framework is tested on five different microarray datasets and one single cell RNA sequence(scRNA-seq) dataset. Experimental results reveal that the proposed architecture achieves the prevention of unusual data loss and delivers the output refined as the per user standards. Also this framework preforms effectively on high sparsity scRNA-seq data where most of the algorithms fail to do so as these datasets contain massive zeros within. BicAT analysis of the ENS output validates ENS method as computationally effective and can be used to improve the quality of the biclusters. Finally, the results are statistically significant as shown in the ANOVA table. Hence this ENS method can be considered as a reliable framework and can be preferable over the traditional biclustering approaches to analyze the gene expression microarray data and high sparsity scRNA-seq data. The source code of the ENS algorithm can be accessed at  https://github.com/c114002/Research/blob/master/ENS_Code.zip.http://www.sciencedirect.com/science/article/pii/S1319157819310134MicroarrayEnsemble biclusteringENSNeighborhood searchscRNA-seq data
spellingShingle Bhawani Sankar Biswal
Anjali Mohapatra
Swati Vipsita
Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing data
Journal of King Saud University: Computer and Information Sciences
Microarray
Ensemble biclustering
ENS
Neighborhood search
scRNA-seq data
title Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing data
title_full Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing data
title_fullStr Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing data
title_full_unstemmed Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing data
title_short Ensemble Neighborhood Search (ENS) for biclustering of gene expression microarray data and single cell RNA sequencing data
title_sort ensemble neighborhood search ens for biclustering of gene expression microarray data and single cell rna sequencing data
topic Microarray
Ensemble biclustering
ENS
Neighborhood search
scRNA-seq data
url http://www.sciencedirect.com/science/article/pii/S1319157819310134
work_keys_str_mv AT bhawanisankarbiswal ensembleneighborhoodsearchensforbiclusteringofgeneexpressionmicroarraydataandsinglecellrnasequencingdata
AT anjalimohapatra ensembleneighborhoodsearchensforbiclusteringofgeneexpressionmicroarraydataandsinglecellrnasequencingdata
AT swativipsita ensembleneighborhoodsearchensforbiclusteringofgeneexpressionmicroarraydataandsinglecellrnasequencingdata