The design of experiments for the transcriptome studies by high-throughput sequencing methods
The common questions in the design of the highthroughput sequencing experiments using RNA-Seq or Ribo-Seq methods are reviewed. The ENCODE guidelines (2011) as well as the recently published advances in the design of the studies of mammalian, animal and plant transcriptomes are also summarized in th...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Siberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and Breeders
2016-05-01
|
Series: | Вавиловский журнал генетики и селекции |
Subjects: | |
Online Access: | https://vavilov.elpub.ru/jour/article/view/594 |
_version_ | 1797214121003843584 |
---|---|
author | P. N. Menshanov N. N. Dygalo |
author_facet | P. N. Menshanov N. N. Dygalo |
author_sort | P. N. Menshanov |
collection | DOAJ |
description | The common questions in the design of the highthroughput sequencing experiments using RNA-Seq or Ribo-Seq methods are reviewed. The ENCODE guidelines (2011) as well as the recently published advances in the design of the studies of mammalian, animal and plant transcriptomes are also summarized in this review. The optimal limit of the sequencing depth does exist for the identification of almost all actively transcribed genes. This limit depends on the transcriptome size in the biological object studied. Additional sequencing does not provide any substantial additional information about the transcriptome complexity. For mammals, the optimal limit of the sequencing depth for the identification of the actively transcribed genes is equal to ~ 2 × 109 bp per biological sample. For other species, the optimal limit of the sequencing depth per biological sample is determined similarly for mammals; however, the transcriptome size and the mean RNA content in the studied object should be taken into account, in comparison to the mammalian transcriptomes. The discovery of differentially expressed genes, as well as the identification of splicing sites in the mRNA could be enhanced by increasing the number of biological samples analyzed per each experimental group. The minimal number of biological replicates per experimental group is equal to 2. However, the optimal number of biological replicates per experimental group is equal to 5–8 (similar to the experiments quantifying the expression of single genes by qRT-PCR). For the transcriptome studies, it is recommended to use the sequencing technologies that have the accuracy of sequencing ≥ 0.999 per bp. For RNASeq, it is also recommended to use the technologies that are able to produce reads equal to or larger than 75 bp, to minimize the cost of the effective identification of the sequences. The relative cost for the sequencing of the control samples could be reduced by increasing the number of experimental groups in the experiment or by combining several independent experiments with similar control groups. The present notes could be utilized during the design step in the experimental studies devoted to the research of transcriptomes. |
first_indexed | 2024-03-07T16:06:47Z |
format | Article |
id | doaj.art-0a5d4e71507e4ac2a2fd84274a7290b3 |
institution | Directory Open Access Journal |
issn | 2500-3259 |
language | English |
last_indexed | 2024-04-24T11:09:08Z |
publishDate | 2016-05-01 |
publisher | Siberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and Breeders |
record_format | Article |
series | Вавиловский журнал генетики и селекции |
spelling | doaj.art-0a5d4e71507e4ac2a2fd84274a7290b32024-04-11T15:30:56ZengSiberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and BreedersВавиловский журнал генетики и селекции2500-32592016-05-0120224725410.18699/VJ16.148473The design of experiments for the transcriptome studies by high-throughput sequencing methodsP. N. Menshanov0N. N. Dygalo1Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, RussiaInstitute of Cytology and Genetics SB RAS, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, RussiaThe common questions in the design of the highthroughput sequencing experiments using RNA-Seq or Ribo-Seq methods are reviewed. The ENCODE guidelines (2011) as well as the recently published advances in the design of the studies of mammalian, animal and plant transcriptomes are also summarized in this review. The optimal limit of the sequencing depth does exist for the identification of almost all actively transcribed genes. This limit depends on the transcriptome size in the biological object studied. Additional sequencing does not provide any substantial additional information about the transcriptome complexity. For mammals, the optimal limit of the sequencing depth for the identification of the actively transcribed genes is equal to ~ 2 × 109 bp per biological sample. For other species, the optimal limit of the sequencing depth per biological sample is determined similarly for mammals; however, the transcriptome size and the mean RNA content in the studied object should be taken into account, in comparison to the mammalian transcriptomes. The discovery of differentially expressed genes, as well as the identification of splicing sites in the mRNA could be enhanced by increasing the number of biological samples analyzed per each experimental group. The minimal number of biological replicates per experimental group is equal to 2. However, the optimal number of biological replicates per experimental group is equal to 5–8 (similar to the experiments quantifying the expression of single genes by qRT-PCR). For the transcriptome studies, it is recommended to use the sequencing technologies that have the accuracy of sequencing ≥ 0.999 per bp. For RNASeq, it is also recommended to use the technologies that are able to produce reads equal to or larger than 75 bp, to minimize the cost of the effective identification of the sequences. The relative cost for the sequencing of the control samples could be reduced by increasing the number of experimental groups in the experiment or by combining several independent experiments with similar control groups. The present notes could be utilized during the design step in the experimental studies devoted to the research of transcriptomes.https://vavilov.elpub.ru/jour/article/view/594high-throughput sequencingtranscriptomerna-seqribo-seqdesign of the experiment |
spellingShingle | P. N. Menshanov N. N. Dygalo The design of experiments for the transcriptome studies by high-throughput sequencing methods Вавиловский журнал генетики и селекции high-throughput sequencing transcriptome rna-seq ribo-seq design of the experiment |
title | The design of experiments for the transcriptome studies by high-throughput sequencing methods |
title_full | The design of experiments for the transcriptome studies by high-throughput sequencing methods |
title_fullStr | The design of experiments for the transcriptome studies by high-throughput sequencing methods |
title_full_unstemmed | The design of experiments for the transcriptome studies by high-throughput sequencing methods |
title_short | The design of experiments for the transcriptome studies by high-throughput sequencing methods |
title_sort | design of experiments for the transcriptome studies by high throughput sequencing methods |
topic | high-throughput sequencing transcriptome rna-seq ribo-seq design of the experiment |
url | https://vavilov.elpub.ru/jour/article/view/594 |
work_keys_str_mv | AT pnmenshanov thedesignofexperimentsforthetranscriptomestudiesbyhighthroughputsequencingmethods AT nndygalo thedesignofexperimentsforthetranscriptomestudiesbyhighthroughputsequencingmethods AT pnmenshanov designofexperimentsforthetranscriptomestudiesbyhighthroughputsequencingmethods AT nndygalo designofexperimentsforthetranscriptomestudiesbyhighthroughputsequencingmethods |