Transcriptome software results show significant variation among different commercial pipelines

Abstract Background We have been documenting the biological responses to low levels of radiation (natural background) and very low level radiation (below background), and thus these studies are testing mild external stimuli to which we would expect relatively mild biological responses. We recently p...

Full description

Bibliographic Details
Main Authors: Cung Nawl Thawng, Geoffrey Battle Smith
Format: Article
Language:English
Published: BMC 2023-11-01
Series:BMC Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12864-023-09683-w
_version_ 1797637097330311168
author Cung Nawl Thawng
Geoffrey Battle Smith
author_facet Cung Nawl Thawng
Geoffrey Battle Smith
author_sort Cung Nawl Thawng
collection DOAJ
description Abstract Background We have been documenting the biological responses to low levels of radiation (natural background) and very low level radiation (below background), and thus these studies are testing mild external stimuli to which we would expect relatively mild biological responses. We recently published a transcriptome software comparison study based on RNA-Seqs from a below background radiation treatment of two model organisms, E. coli and C. elegans (Thawng and Smith, BMC Genomics 23:452, 2022). We reported DNAstar-D (Deseq2 in the DNAstar software pipeline) to be the more conservative, realistic tool for differential gene expression compared to other transcriptome software packages (CLC, Partek and DNAstar-E (using edgeR). Here we report two follow-up studies (one with a new model organism, Aedes aegypti and another software package (Azenta) on transcriptome responses from varying dose rates using three different sources of natural radiation. Results When E. coli was exposed to varying levels of K40, we again found that the DNAstar-D pipeline yielded a more conservative number of DEGs and a lower fold-difference than the CLC pipeline and DNAstar-E run in parallel. After a 30 read minimum cutoff criterion was applied to the data, the number of significant DEGs ranged from 0 to 81 with DNAstar-D, while the number of significant DEGs ranged from 4 to 117 and 14 to 139 using DNAstar-E and the CLC pipelines, respectively. In terms of the extent of expression, the highest foldchange DEG was observed in DNAstar-E with 19.7-fold followed by 12.5-fold in CLC and 4.3-fold in DNAstar-D. In a recently completed study with Ae. Aegypti and using another software package (Azenta), we analyzed the RNA-Seq response to similar sources of low-level radiation and again found the DNAstar-D pipeline to give the more conservative number and fold-expression of DEGs compared to other softwares. The number of significant DEGs ranged 31–221 in Azenta and 31 to 237 in CLC, 19–252 in DNAstar-E and 0–67 in DNAStar-D. The highest fold-change of DEGs were found in CLC (1,350.9-fold), with DNAstar-E (5.9 -fold) and Azenta (5.5-fold) intermediate, and the lowest levels of expression (4-fold) found in DNAstar-D. Conclusions This study once again highlights the importance of choosing appropriate software for transcriptome analysis. Using three different biological models (bacteria, nematode and mosquito) in four different studies testing very low levels of radiation (Van Voorhies et al., Front Public Health 8:581796, 2020; Thawng and Smith, BMC Genomics 23:452, 2022; current study), the CLC software package resulted in what appears to be an exaggerated gene expression response in terms of numbers of DEGs and extent of expression. Setting a 30-read cutoff diminishes this exaggerated response in most of the software tested. We have further affirmed that DNAstar-Deseq2 gives a more conservative transcriptome expression pattern which appears more suitable for studies expecting subtle gene expression patterns.
first_indexed 2024-03-11T12:43:32Z
format Article
id doaj.art-379a6e194d834e5b81767053a6eead63
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-03-11T12:43:32Z
publishDate 2023-11-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-379a6e194d834e5b81767053a6eead632023-11-05T12:08:32ZengBMCBMC Genomics1471-21642023-11-0124111210.1186/s12864-023-09683-wTranscriptome software results show significant variation among different commercial pipelinesCung Nawl Thawng0Geoffrey Battle Smith1Biology Department and Molecular Biology Program, New Mexico State UniversityBiology Department and Molecular Biology Program, New Mexico State UniversityAbstract Background We have been documenting the biological responses to low levels of radiation (natural background) and very low level radiation (below background), and thus these studies are testing mild external stimuli to which we would expect relatively mild biological responses. We recently published a transcriptome software comparison study based on RNA-Seqs from a below background radiation treatment of two model organisms, E. coli and C. elegans (Thawng and Smith, BMC Genomics 23:452, 2022). We reported DNAstar-D (Deseq2 in the DNAstar software pipeline) to be the more conservative, realistic tool for differential gene expression compared to other transcriptome software packages (CLC, Partek and DNAstar-E (using edgeR). Here we report two follow-up studies (one with a new model organism, Aedes aegypti and another software package (Azenta) on transcriptome responses from varying dose rates using three different sources of natural radiation. Results When E. coli was exposed to varying levels of K40, we again found that the DNAstar-D pipeline yielded a more conservative number of DEGs and a lower fold-difference than the CLC pipeline and DNAstar-E run in parallel. After a 30 read minimum cutoff criterion was applied to the data, the number of significant DEGs ranged from 0 to 81 with DNAstar-D, while the number of significant DEGs ranged from 4 to 117 and 14 to 139 using DNAstar-E and the CLC pipelines, respectively. In terms of the extent of expression, the highest foldchange DEG was observed in DNAstar-E with 19.7-fold followed by 12.5-fold in CLC and 4.3-fold in DNAstar-D. In a recently completed study with Ae. Aegypti and using another software package (Azenta), we analyzed the RNA-Seq response to similar sources of low-level radiation and again found the DNAstar-D pipeline to give the more conservative number and fold-expression of DEGs compared to other softwares. The number of significant DEGs ranged 31–221 in Azenta and 31 to 237 in CLC, 19–252 in DNAstar-E and 0–67 in DNAStar-D. The highest fold-change of DEGs were found in CLC (1,350.9-fold), with DNAstar-E (5.9 -fold) and Azenta (5.5-fold) intermediate, and the lowest levels of expression (4-fold) found in DNAstar-D. Conclusions This study once again highlights the importance of choosing appropriate software for transcriptome analysis. Using three different biological models (bacteria, nematode and mosquito) in four different studies testing very low levels of radiation (Van Voorhies et al., Front Public Health 8:581796, 2020; Thawng and Smith, BMC Genomics 23:452, 2022; current study), the CLC software package resulted in what appears to be an exaggerated gene expression response in terms of numbers of DEGs and extent of expression. Setting a 30-read cutoff diminishes this exaggerated response in most of the software tested. We have further affirmed that DNAstar-Deseq2 gives a more conservative transcriptome expression pattern which appears more suitable for studies expecting subtle gene expression patterns.https://doi.org/10.1186/s12864-023-09683-wTranscriptome softwareRNA-SeqLow radiationPipelineFold-changesModel organisms
spellingShingle Cung Nawl Thawng
Geoffrey Battle Smith
Transcriptome software results show significant variation among different commercial pipelines
BMC Genomics
Transcriptome software
RNA-Seq
Low radiation
Pipeline
Fold-changes
Model organisms
title Transcriptome software results show significant variation among different commercial pipelines
title_full Transcriptome software results show significant variation among different commercial pipelines
title_fullStr Transcriptome software results show significant variation among different commercial pipelines
title_full_unstemmed Transcriptome software results show significant variation among different commercial pipelines
title_short Transcriptome software results show significant variation among different commercial pipelines
title_sort transcriptome software results show significant variation among different commercial pipelines
topic Transcriptome software
RNA-Seq
Low radiation
Pipeline
Fold-changes
Model organisms
url https://doi.org/10.1186/s12864-023-09683-w
work_keys_str_mv AT cungnawlthawng transcriptomesoftwareresultsshowsignificantvariationamongdifferentcommercialpipelines
AT geoffreybattlesmith transcriptomesoftwareresultsshowsignificantvariationamongdifferentcommercialpipelines