An improved procedure for gene selection from microarray experiments using false discovery rate criterion

<p>Abstract</p> <p>Background</p> <p>A large number of genes usually show differential expressions in a microarray experiment with two types of tissues, and the <it>p</it>-values of a proper statistical test are often used to quantify the significance of the...

Full description

Bibliographic Details
Main Authors: Yang Mark CK, Yang James J
Format: Article
Language:English
Published: BMC 2006-01-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/7/15
_version_ 1818857100497911808
author Yang Mark CK
Yang James J
author_facet Yang Mark CK
Yang James J
author_sort Yang Mark CK
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>A large number of genes usually show differential expressions in a microarray experiment with two types of tissues, and the <it>p</it>-values of a proper statistical test are often used to quantify the significance of these differences. The genes with small <it>p</it>-values are then picked as the genes responsible for the differences in the tissue RNA expressions. One key question is what should be the threshold to consider the <it>p</it>-values small. There is always a trade off between this threshold and the rate of false claims. Recent statistical literature shows that the false discovery rate (FDR) criterion is a powerful and reasonable criterion to pick those genes with differential expression. Moreover, the power of detection can be increased by knowing the number of non-differential expression genes. While this number is unknown in practice, there are methods to estimate it from data. The purpose of this paper is to present a new method of estimating this number and use it for the FDR procedure construction.</p> <p>Results</p> <p>A combination of test functions is used to estimate the number of differentially expressed genes. Simulation study shows that the proposed method has a higher power to detect these genes than other existing methods, while still keeping the FDR under control. The improvement can be substantial if the proportion of true differentially expressed genes is large. This procedure has also been tested with good results using a real dataset.</p> <p>Conclusion</p> <p>For a given expected FDR, the method proposed in this paper has better power to pick genes that show differentiation in their expression than two other well known methods.</p>
first_indexed 2024-12-19T08:35:01Z
format Article
id doaj.art-138eed9df2e84851990034f97ac20c82
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-19T08:35:01Z
publishDate 2006-01-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-138eed9df2e84851990034f97ac20c822022-12-21T20:29:04ZengBMCBMC Bioinformatics1471-21052006-01-01711510.1186/1471-2105-7-15An improved procedure for gene selection from microarray experiments using false discovery rate criterionYang Mark CKYang James J<p>Abstract</p> <p>Background</p> <p>A large number of genes usually show differential expressions in a microarray experiment with two types of tissues, and the <it>p</it>-values of a proper statistical test are often used to quantify the significance of these differences. The genes with small <it>p</it>-values are then picked as the genes responsible for the differences in the tissue RNA expressions. One key question is what should be the threshold to consider the <it>p</it>-values small. There is always a trade off between this threshold and the rate of false claims. Recent statistical literature shows that the false discovery rate (FDR) criterion is a powerful and reasonable criterion to pick those genes with differential expression. Moreover, the power of detection can be increased by knowing the number of non-differential expression genes. While this number is unknown in practice, there are methods to estimate it from data. The purpose of this paper is to present a new method of estimating this number and use it for the FDR procedure construction.</p> <p>Results</p> <p>A combination of test functions is used to estimate the number of differentially expressed genes. Simulation study shows that the proposed method has a higher power to detect these genes than other existing methods, while still keeping the FDR under control. The improvement can be substantial if the proportion of true differentially expressed genes is large. This procedure has also been tested with good results using a real dataset.</p> <p>Conclusion</p> <p>For a given expected FDR, the method proposed in this paper has better power to pick genes that show differentiation in their expression than two other well known methods.</p>http://www.biomedcentral.com/1471-2105/7/15
spellingShingle Yang Mark CK
Yang James J
An improved procedure for gene selection from microarray experiments using false discovery rate criterion
BMC Bioinformatics
title An improved procedure for gene selection from microarray experiments using false discovery rate criterion
title_full An improved procedure for gene selection from microarray experiments using false discovery rate criterion
title_fullStr An improved procedure for gene selection from microarray experiments using false discovery rate criterion
title_full_unstemmed An improved procedure for gene selection from microarray experiments using false discovery rate criterion
title_short An improved procedure for gene selection from microarray experiments using false discovery rate criterion
title_sort improved procedure for gene selection from microarray experiments using false discovery rate criterion
url http://www.biomedcentral.com/1471-2105/7/15
work_keys_str_mv AT yangmarkck animprovedprocedureforgeneselectionfrommicroarrayexperimentsusingfalsediscoveryratecriterion
AT yangjamesj animprovedprocedureforgeneselectionfrommicroarrayexperimentsusingfalsediscoveryratecriterion
AT yangmarkck improvedprocedureforgeneselectionfrommicroarrayexperimentsusingfalsediscoveryratecriterion
AT yangjamesj improvedprocedureforgeneselectionfrommicroarrayexperimentsusingfalsediscoveryratecriterion