The statistics of identifying differentially expressed genes in Expresso and TM4: a comparison
<p>Abstract</p> <p>Background</p> <p>Analysis of DNA microarray data takes as input spot intensity measurements from scanner software and returns differential expression of genes between two conditions, together with a statistical significance assessment. This process t...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2006-04-01
|
Series: | BMC Bioinformatics |
Online Access: | http://www.biomedcentral.com/1471-2105/7/215 |
_version_ | 1818050683785445376 |
---|---|
author | Heath Lenwood S Sha Wei Li Pinghua Mane Shrinivasrao P Sioson Allan A Bohnert Hans J Grene Ruth |
author_facet | Heath Lenwood S Sha Wei Li Pinghua Mane Shrinivasrao P Sioson Allan A Bohnert Hans J Grene Ruth |
author_sort | Heath Lenwood S |
collection | DOAJ |
description | <p>Abstract</p> <p>Background</p> <p>Analysis of DNA microarray data takes as input spot intensity measurements from scanner software and returns differential expression of genes between two conditions, together with a statistical significance assessment. This process typically consists of two steps: data normalization and identification of differentially expressed genes through statistical analysis. The Expresso microarray experiment management system implements these steps with a two-stage, log-linear ANOVA mixed model technique, tailored to individual experimental designs. The complement of tools in TM4, on the other hand, is based on a number of preset design choices that limit its flexibility. In the TM4 microarray analysis suite, normalization, filter, and analysis methods form an analysis pipeline. TM4 computes integrated intensity values (IIV) from the average intensities and spot pixel counts returned by the scanner software as input to its normalization steps. By contrast, Expresso can use either IIV data or median intensity values (MIV). Here, we compare Expresso and TM4 analysis of two experiments and assess the results against qRT-PCR data.</p> <p>Results</p> <p>The Expresso analysis using MIV data consistently identifies more genes as differentially expressed, when compared to Expresso analysis with IIV data. The typical TM4 normalization and filtering pipeline corrects systematic intensity-specific bias on a per microarray basis. Subsequent statistical analysis with Expresso or a TM4 <it>t</it>-test can effectively identify differentially expressed genes. The best agreement with qRT-PCR data is obtained through the use of Expresso analysis and MIV data.</p> <p>Conclusion</p> <p>The results of this research are of practical value to biologists who analyze microarray data sets. The TM4 normalization and filtering pipeline corrects microarray-specific systematic bias and complements the normalization stage in Expresso analysis. The results of Expresso using MIV data have the best agreement with qRT-PCR results. In one experiment, MIV is a better choice than IIV as input to data normalization and statistical analysis methods, as it yields as greater number of statistically significant differentially expressed genes; TM4 does not support the choice of MIV input data. Overall, the more flexible and extensive statistical models of Expresso achieve more accurate analytical results, when judged by the yardstick of qRT-PCR data, in the context of an experimental design of modest complexity.</p> |
first_indexed | 2024-12-10T10:57:23Z |
format | Article |
id | doaj.art-f329bd05be2641c99f7499df8d196a6f |
institution | Directory Open Access Journal |
issn | 1471-2105 |
language | English |
last_indexed | 2024-12-10T10:57:23Z |
publishDate | 2006-04-01 |
publisher | BMC |
record_format | Article |
series | BMC Bioinformatics |
spelling | doaj.art-f329bd05be2641c99f7499df8d196a6f2022-12-22T01:51:50ZengBMCBMC Bioinformatics1471-21052006-04-017121510.1186/1471-2105-7-215The statistics of identifying differentially expressed genes in Expresso and TM4: a comparisonHeath Lenwood SSha WeiLi PinghuaMane Shrinivasrao PSioson Allan ABohnert Hans JGrene Ruth<p>Abstract</p> <p>Background</p> <p>Analysis of DNA microarray data takes as input spot intensity measurements from scanner software and returns differential expression of genes between two conditions, together with a statistical significance assessment. This process typically consists of two steps: data normalization and identification of differentially expressed genes through statistical analysis. The Expresso microarray experiment management system implements these steps with a two-stage, log-linear ANOVA mixed model technique, tailored to individual experimental designs. The complement of tools in TM4, on the other hand, is based on a number of preset design choices that limit its flexibility. In the TM4 microarray analysis suite, normalization, filter, and analysis methods form an analysis pipeline. TM4 computes integrated intensity values (IIV) from the average intensities and spot pixel counts returned by the scanner software as input to its normalization steps. By contrast, Expresso can use either IIV data or median intensity values (MIV). Here, we compare Expresso and TM4 analysis of two experiments and assess the results against qRT-PCR data.</p> <p>Results</p> <p>The Expresso analysis using MIV data consistently identifies more genes as differentially expressed, when compared to Expresso analysis with IIV data. The typical TM4 normalization and filtering pipeline corrects systematic intensity-specific bias on a per microarray basis. Subsequent statistical analysis with Expresso or a TM4 <it>t</it>-test can effectively identify differentially expressed genes. The best agreement with qRT-PCR data is obtained through the use of Expresso analysis and MIV data.</p> <p>Conclusion</p> <p>The results of this research are of practical value to biologists who analyze microarray data sets. The TM4 normalization and filtering pipeline corrects microarray-specific systematic bias and complements the normalization stage in Expresso analysis. The results of Expresso using MIV data have the best agreement with qRT-PCR results. In one experiment, MIV is a better choice than IIV as input to data normalization and statistical analysis methods, as it yields as greater number of statistically significant differentially expressed genes; TM4 does not support the choice of MIV input data. Overall, the more flexible and extensive statistical models of Expresso achieve more accurate analytical results, when judged by the yardstick of qRT-PCR data, in the context of an experimental design of modest complexity.</p>http://www.biomedcentral.com/1471-2105/7/215 |
spellingShingle | Heath Lenwood S Sha Wei Li Pinghua Mane Shrinivasrao P Sioson Allan A Bohnert Hans J Grene Ruth The statistics of identifying differentially expressed genes in Expresso and TM4: a comparison BMC Bioinformatics |
title | The statistics of identifying differentially expressed genes in Expresso and TM4: a comparison |
title_full | The statistics of identifying differentially expressed genes in Expresso and TM4: a comparison |
title_fullStr | The statistics of identifying differentially expressed genes in Expresso and TM4: a comparison |
title_full_unstemmed | The statistics of identifying differentially expressed genes in Expresso and TM4: a comparison |
title_short | The statistics of identifying differentially expressed genes in Expresso and TM4: a comparison |
title_sort | statistics of identifying differentially expressed genes in expresso and tm4 a comparison |
url | http://www.biomedcentral.com/1471-2105/7/215 |
work_keys_str_mv | AT heathlenwoods thestatisticsofidentifyingdifferentiallyexpressedgenesinexpressoandtm4acomparison AT shawei thestatisticsofidentifyingdifferentiallyexpressedgenesinexpressoandtm4acomparison AT lipinghua thestatisticsofidentifyingdifferentiallyexpressedgenesinexpressoandtm4acomparison AT maneshrinivasraop thestatisticsofidentifyingdifferentiallyexpressedgenesinexpressoandtm4acomparison AT siosonallana thestatisticsofidentifyingdifferentiallyexpressedgenesinexpressoandtm4acomparison AT bohnerthansj thestatisticsofidentifyingdifferentiallyexpressedgenesinexpressoandtm4acomparison AT greneruth thestatisticsofidentifyingdifferentiallyexpressedgenesinexpressoandtm4acomparison AT heathlenwoods statisticsofidentifyingdifferentiallyexpressedgenesinexpressoandtm4acomparison AT shawei statisticsofidentifyingdifferentiallyexpressedgenesinexpressoandtm4acomparison AT lipinghua statisticsofidentifyingdifferentiallyexpressedgenesinexpressoandtm4acomparison AT maneshrinivasraop statisticsofidentifyingdifferentiallyexpressedgenesinexpressoandtm4acomparison AT siosonallana statisticsofidentifyingdifferentiallyexpressedgenesinexpressoandtm4acomparison AT bohnerthansj statisticsofidentifyingdifferentiallyexpressedgenesinexpressoandtm4acomparison AT greneruth statisticsofidentifyingdifferentiallyexpressedgenesinexpressoandtm4acomparison |