An introduction to gene expression data clustering algorithims

DNA microarray, also called oligonucleotide array, gene array, DNA chip, gene microarray, or genome chip, is a high-throughput technology that allows detection of expression of thousands of genes simultaneously. Among a variety of its experimental protocols, one is to first fix thousands of DNA pro...

Full description

Bibliographic Details
Main Author:	Guo, Ling Qiong
Other Authors:	Chen Xin
Format:	Thesis
Language:	English
Published:	2010
Subjects:	DRNTU::Science::Mathematics::Applied mathematics
Online Access:	http://hdl.handle.net/10356/41849

_version_	1811691828621082624
author	Guo, Ling Qiong
author2	Chen Xin
author_facet	Chen Xin Guo, Ling Qiong
author_sort	Guo, Ling Qiong
collection	NTU
description	DNA microarray, also called oligonucleotide array, gene array, DNA chip, gene microarray, or genome chip, is a high-throughput technology that allows detection of expression of thousands of genes simultaneously. Among a variety of its experimental protocols, one is to first fix thousands of DNA probes on an objective buttress by situ-synthesis or micro-print to produce a two-dimension DNA probe matrix, and then hybridize these DNA probes with a biological (cDNA or cRNA) sample. Microarray expression data is hence collected by measuring the relative abundances of the hybridized probes, to quantify the activities of genes under a biological experiment. Therefore, as commonly believed, the biological signals of genes are hidden in microarray expression data. However, detecting these hidden biological signals presents a big challenge to the microarray research community. Cluster analysis is one of analytical techniques to detect the biological signals hidden in microarray expression data. Given an expression dataset, it aims to find groups of genes whose expression profiles are similar to each other within a group while dissimilar between groups. Clustering often serves as a very preliminary step in gene expression analysis, and allows us to gain valuable insights into the underlying biological mechanism. Clustering is a hard while well-studied problem. There are many algorithms that have been developed in the literature, some of which are intended in particular for gene expression data clustering. The main goal of this thesis is to discuss many popular clustering algorithms which have been extensively applied to microarray expression data for biological discovery, including hierarchical clustering, K-means clustering, SOM-based clustering and model-based clustering.
first_indexed	2024-10-01T06:26:06Z
format	Thesis
id	ntu-10356/41849
institution	Nanyang Technological University
language	English
last_indexed	2024-10-01T06:26:06Z
publishDate	2010
record_format	dspace
spelling	ntu-10356/418492023-02-28T23:50:08Z An introduction to gene expression data clustering algorithims Guo, Ling Qiong Chen Xin School of Physical and Mathematical Sciences DRNTU::Science::Mathematics::Applied mathematics DNA microarray, also called oligonucleotide array, gene array, DNA chip, gene microarray, or genome chip, is a high-throughput technology that allows detection of expression of thousands of genes simultaneously. Among a variety of its experimental protocols, one is to first fix thousands of DNA probes on an objective buttress by situ-synthesis or micro-print to produce a two-dimension DNA probe matrix, and then hybridize these DNA probes with a biological (cDNA or cRNA) sample. Microarray expression data is hence collected by measuring the relative abundances of the hybridized probes, to quantify the activities of genes under a biological experiment. Therefore, as commonly believed, the biological signals of genes are hidden in microarray expression data. However, detecting these hidden biological signals presents a big challenge to the microarray research community. Cluster analysis is one of analytical techniques to detect the biological signals hidden in microarray expression data. Given an expression dataset, it aims to find groups of genes whose expression profiles are similar to each other within a group while dissimilar between groups. Clustering often serves as a very preliminary step in gene expression analysis, and allows us to gain valuable insights into the underlying biological mechanism. Clustering is a hard while well-studied problem. There are many algorithms that have been developed in the literature, some of which are intended in particular for gene expression data clustering. The main goal of this thesis is to discuss many popular clustering algorithms which have been extensively applied to microarray expression data for biological discovery, including hierarchical clustering, K-means clustering, SOM-based clustering and model-based clustering. Master of Science 2010-08-18T07:33:22Z 2010-08-18T07:33:22Z 2008 2008 Thesis http://hdl.handle.net/10356/41849 en 63 p. application/pdf
spellingShingle	DRNTU::Science::Mathematics::Applied mathematics Guo, Ling Qiong An introduction to gene expression data clustering algorithims
title	An introduction to gene expression data clustering algorithims
title_full	An introduction to gene expression data clustering algorithims
title_fullStr	An introduction to gene expression data clustering algorithims
title_full_unstemmed	An introduction to gene expression data clustering algorithims
title_short	An introduction to gene expression data clustering algorithims
title_sort	introduction to gene expression data clustering algorithims
topic	DRNTU::Science::Mathematics::Applied mathematics
url	http://hdl.handle.net/10356/41849
work_keys_str_mv	AT guolingqiong anintroductiontogeneexpressiondataclusteringalgorithims AT guolingqiong introductiontogeneexpressiondataclusteringalgorithims

An introduction to gene expression data clustering algorithims

Similar Items