A theoretical analysis of gene selection.

A great deal of recent research has focused on the challenging task of selecting differentially expressed genes from microarray data ('gene selection'). Numerous gene selection algorithms have been proposed in the literature, but it is often unclear exactly how these algorithms respond to...

Full description

Bibliographic Details
Main Authors: Mukherjee, S, Roberts, S
Format: Journal article
Language:English
Published: 2004
_version_ 1797073642865033216
author Mukherjee, S
Roberts, S
author_facet Mukherjee, S
Roberts, S
author_sort Mukherjee, S
collection OXFORD
description A great deal of recent research has focused on the challenging task of selecting differentially expressed genes from microarray data ('gene selection'). Numerous gene selection algorithms have been proposed in the literature, but it is often unclear exactly how these algorithms respond to conditions like small sample-sizes or differing variances. Choosing an appropriate algorithm can therefore be difficult in many cases. In this paper we propose a theoretical analysis of gene selection, in which the probability of successfully selecting relevant genes, using a given gene ranking function, is explicitly calculated in terms of population parameters. The theory developed is applicable to any ranking function which has a known sampling distribution, or one which can be approximated analytically. In contrast to empirical methods, the analysis can easily be used to examine the behaviour of gene selection algorithms under a wide variety of conditions, even when the numbers of genes involved runs into the tens of thousands. The utility of our approach is illustrated by comparing three well-known gene ranking functions.
first_indexed 2024-03-06T23:24:59Z
format Journal article
id oxford-uuid:6a0cf6ef-0093-4126-ab20-f7f611deab05
institution University of Oxford
language English
last_indexed 2024-03-06T23:24:59Z
publishDate 2004
record_format dspace
spelling oxford-uuid:6a0cf6ef-0093-4126-ab20-f7f611deab052022-03-26T18:54:58ZA theoretical analysis of gene selection.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:6a0cf6ef-0093-4126-ab20-f7f611deab05EnglishSymplectic Elements at Oxford2004Mukherjee, SRoberts, SA great deal of recent research has focused on the challenging task of selecting differentially expressed genes from microarray data ('gene selection'). Numerous gene selection algorithms have been proposed in the literature, but it is often unclear exactly how these algorithms respond to conditions like small sample-sizes or differing variances. Choosing an appropriate algorithm can therefore be difficult in many cases. In this paper we propose a theoretical analysis of gene selection, in which the probability of successfully selecting relevant genes, using a given gene ranking function, is explicitly calculated in terms of population parameters. The theory developed is applicable to any ranking function which has a known sampling distribution, or one which can be approximated analytically. In contrast to empirical methods, the analysis can easily be used to examine the behaviour of gene selection algorithms under a wide variety of conditions, even when the numbers of genes involved runs into the tens of thousands. The utility of our approach is illustrated by comparing three well-known gene ranking functions.
spellingShingle Mukherjee, S
Roberts, S
A theoretical analysis of gene selection.
title A theoretical analysis of gene selection.
title_full A theoretical analysis of gene selection.
title_fullStr A theoretical analysis of gene selection.
title_full_unstemmed A theoretical analysis of gene selection.
title_short A theoretical analysis of gene selection.
title_sort theoretical analysis of gene selection
work_keys_str_mv AT mukherjees atheoreticalanalysisofgeneselection
AT robertss atheoreticalanalysisofgeneselection
AT mukherjees theoreticalanalysisofgeneselection
AT robertss theoreticalanalysisofgeneselection