Cluster Structure Inference Based on Clustering Stability with Applications to Microarray Data Analysis

<p/> <p>This paper focuses on the stability-based approach for estimating the number of clusters <inline-formula><graphic file="1687-6180-2004-545761-i1.gif"/></inline-formula> in microarray data. The cluster stability approach amounts to performing clustering...

Full description

Bibliographic Details
Main Authors:	Giurcăneanu Ciprian Doru, Tăbuş Ioan
Format:	Article
Language:	English
Published:	SpringerOpen 2004-01-01
Series:	EURASIP Journal on Advances in Signal Processing
Subjects:	clustering stability number of clusters hierarchical clustering methods similarity indices partition-distance microarray data
Online Access:	http://dx.doi.org/10.1155/S1110865704309078

_version_	1811332812152766464
author	Giurcăneanu Ciprian Doru Tăbuş Ioan
author_facet	Giurcăneanu Ciprian Doru Tăbuş Ioan
author_sort	Giurcăneanu Ciprian Doru
collection	DOAJ
description	<p/> <p>This paper focuses on the stability-based approach for estimating the number of clusters <inline-formula><graphic file="1687-6180-2004-545761-i1.gif"/></inline-formula> in microarray data. The cluster stability approach amounts to performing clustering successively over random subsets of the available data and evaluating an index which expresses the similarity of the successive partitions obtained. We present a method for automatically estimating <inline-formula><graphic file="1687-6180-2004-545761-i2.gif"/></inline-formula> by starting from the distribution of the similarity index. We investigate how the selection of the hierarchical clustering (HC) method, respectively, the similarity index, influences the estimation accuracy. The paper introduces a new similarity index based on a partition distance. The performance of the new index and that of other well-known indices are experimentally evaluated by comparing the true data partition with the partition obtained at each level of an HC tree. A case study is conducted with a publicly available Leukemia dataset.</p>
first_indexed	2024-04-13T16:42:35Z
format	Article
id	doaj.art-396cb173935e4a2da7f118d8f22630d4
institution	Directory Open Access Journal
issn	1687-6172 1687-6180
language	English
last_indexed	2024-04-13T16:42:35Z
publishDate	2004-01-01
publisher	SpringerOpen
record_format	Article
series	EURASIP Journal on Advances in Signal Processing
spelling	doaj.art-396cb173935e4a2da7f118d8f22630d42022-12-22T02:39:10ZengSpringerOpenEURASIP Journal on Advances in Signal Processing1687-61721687-61802004-01-0120041545761Cluster Structure Inference Based on Clustering Stability with Applications to Microarray Data AnalysisGiurcăneanu Ciprian DoruTăbuş Ioan<p/> <p>This paper focuses on the stability-based approach for estimating the number of clusters <inline-formula><graphic file="1687-6180-2004-545761-i1.gif"/></inline-formula> in microarray data. The cluster stability approach amounts to performing clustering successively over random subsets of the available data and evaluating an index which expresses the similarity of the successive partitions obtained. We present a method for automatically estimating <inline-formula><graphic file="1687-6180-2004-545761-i2.gif"/></inline-formula> by starting from the distribution of the similarity index. We investigate how the selection of the hierarchical clustering (HC) method, respectively, the similarity index, influences the estimation accuracy. The paper introduces a new similarity index based on a partition distance. The performance of the new index and that of other well-known indices are experimentally evaluated by comparing the true data partition with the partition obtained at each level of an HC tree. A case study is conducted with a publicly available Leukemia dataset.</p>http://dx.doi.org/10.1155/S1110865704309078clustering stabilitynumber of clustershierarchical clustering methodssimilarity indicespartition-distancemicroarray data
spellingShingle	Giurcăneanu Ciprian Doru Tăbuş Ioan Cluster Structure Inference Based on Clustering Stability with Applications to Microarray Data Analysis EURASIP Journal on Advances in Signal Processing clustering stability number of clusters hierarchical clustering methods similarity indices partition-distance microarray data
title	Cluster Structure Inference Based on Clustering Stability with Applications to Microarray Data Analysis
title_full	Cluster Structure Inference Based on Clustering Stability with Applications to Microarray Data Analysis
title_fullStr	Cluster Structure Inference Based on Clustering Stability with Applications to Microarray Data Analysis
title_full_unstemmed	Cluster Structure Inference Based on Clustering Stability with Applications to Microarray Data Analysis
title_short	Cluster Structure Inference Based on Clustering Stability with Applications to Microarray Data Analysis
title_sort	cluster structure inference based on clustering stability with applications to microarray data analysis
topic	clustering stability number of clusters hierarchical clustering methods similarity indices partition-distance microarray data
url	http://dx.doi.org/10.1155/S1110865704309078
work_keys_str_mv	AT giurc259neanucipriandoru clusterstructureinferencebasedonclusteringstabilitywithapplicationstomicroarraydataanalysis AT t259bu351ioan clusterstructureinferencebasedonclusteringstabilitywithapplicationstomicroarraydataanalysis

Cluster Structure Inference Based on Clustering Stability with Applications to Microarray Data Analysis

Similar Items