The design of experiments for the transcriptome studies by high-throughput sequencing methods

The common questions in the design of the highthroughput sequencing experiments using RNA-Seq or Ribo-Seq methods are reviewed. The ENCODE guidelines (2011) as well as the recently published advances in the design of the studies of mammalian, animal and plant transcriptomes are also summarized in th...

Full description

Bibliographic Details
Main Authors: P. N. Menshanov, N. N. Dygalo
Format: Article
Language:English
Published: Siberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and Breeders 2016-05-01
Series:Вавиловский журнал генетики и селекции
Subjects:
Online Access:https://vavilov.elpub.ru/jour/article/view/594
_version_ 1797214121003843584
author P. N. Menshanov
N. N. Dygalo
author_facet P. N. Menshanov
N. N. Dygalo
author_sort P. N. Menshanov
collection DOAJ
description The common questions in the design of the highthroughput sequencing experiments using RNA-Seq or Ribo-Seq methods are reviewed. The ENCODE guidelines (2011) as well as the recently published advances in the design of the studies of mammalian, animal and plant transcriptomes are also summarized in this review. The optimal limit of the sequencing depth does exist for the identification of almost all actively transcribed genes. This limit depends on the transcriptome size in the biological object studied. Additional sequencing does not provide any substantial additional information about the transcriptome complexity. For mammals, the optimal limit of the sequencing depth for the identification of the actively transcribed genes is equal to ~ 2 × 109 bp per biological sample. For other species, the optimal limit of the sequencing depth per biological sample is determined similarly for mammals; however, the transcriptome size and the mean RNA content in the studied object should be taken into account, in comparison to the mammalian transcriptomes. The discovery of differentially expressed genes, as well as the identification of splicing sites in the mRNA could be enhanced by increasing the number of biological samples analyzed per each experimental group. The minimal number of biological replicates per experimental group is equal to 2. However, the optimal number of biological replicates per experimental group is equal to 5–8 (similar to the experiments quantifying the expression of single genes by qRT-PCR). For the transcriptome studies, it is recommended to use the sequencing technologies that have the accuracy of sequencing ≥ 0.999 per bp. For RNASeq, it is also recommended to use the technologies that are able to produce reads equal to or larger than 75 bp, to minimize the cost of the effective identification of the sequences. The relative cost for the sequencing of the control samples could be reduced by increasing the number of experimental groups in the experiment or by combining several independent experiments with similar control groups. The present notes could be utilized during the design step in the experimental studies devoted to the research of transcriptomes.
first_indexed 2024-03-07T16:06:47Z
format Article
id doaj.art-0a5d4e71507e4ac2a2fd84274a7290b3
institution Directory Open Access Journal
issn 2500-3259
language English
last_indexed 2024-04-24T11:09:08Z
publishDate 2016-05-01
publisher Siberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and Breeders
record_format Article
series Вавиловский журнал генетики и селекции
spelling doaj.art-0a5d4e71507e4ac2a2fd84274a7290b32024-04-11T15:30:56ZengSiberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and BreedersВавиловский журнал генетики и селекции2500-32592016-05-0120224725410.18699/VJ16.148473The design of experiments for the transcriptome studies by high-throughput sequencing methodsP. N. Menshanov0N. N. Dygalo1Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, RussiaInstitute of Cytology and Genetics SB RAS, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, RussiaThe common questions in the design of the highthroughput sequencing experiments using RNA-Seq or Ribo-Seq methods are reviewed. The ENCODE guidelines (2011) as well as the recently published advances in the design of the studies of mammalian, animal and plant transcriptomes are also summarized in this review. The optimal limit of the sequencing depth does exist for the identification of almost all actively transcribed genes. This limit depends on the transcriptome size in the biological object studied. Additional sequencing does not provide any substantial additional information about the transcriptome complexity. For mammals, the optimal limit of the sequencing depth for the identification of the actively transcribed genes is equal to ~ 2 × 109 bp per biological sample. For other species, the optimal limit of the sequencing depth per biological sample is determined similarly for mammals; however, the transcriptome size and the mean RNA content in the studied object should be taken into account, in comparison to the mammalian transcriptomes. The discovery of differentially expressed genes, as well as the identification of splicing sites in the mRNA could be enhanced by increasing the number of biological samples analyzed per each experimental group. The minimal number of biological replicates per experimental group is equal to 2. However, the optimal number of biological replicates per experimental group is equal to 5–8 (similar to the experiments quantifying the expression of single genes by qRT-PCR). For the transcriptome studies, it is recommended to use the sequencing technologies that have the accuracy of sequencing ≥ 0.999 per bp. For RNASeq, it is also recommended to use the technologies that are able to produce reads equal to or larger than 75 bp, to minimize the cost of the effective identification of the sequences. The relative cost for the sequencing of the control samples could be reduced by increasing the number of experimental groups in the experiment or by combining several independent experiments with similar control groups. The present notes could be utilized during the design step in the experimental studies devoted to the research of transcriptomes.https://vavilov.elpub.ru/jour/article/view/594high-throughput sequencingtranscriptomerna-seqribo-seqdesign of the experiment
spellingShingle P. N. Menshanov
N. N. Dygalo
The design of experiments for the transcriptome studies by high-throughput sequencing methods
Вавиловский журнал генетики и селекции
high-throughput sequencing
transcriptome
rna-seq
ribo-seq
design of the experiment
title The design of experiments for the transcriptome studies by high-throughput sequencing methods
title_full The design of experiments for the transcriptome studies by high-throughput sequencing methods
title_fullStr The design of experiments for the transcriptome studies by high-throughput sequencing methods
title_full_unstemmed The design of experiments for the transcriptome studies by high-throughput sequencing methods
title_short The design of experiments for the transcriptome studies by high-throughput sequencing methods
title_sort design of experiments for the transcriptome studies by high throughput sequencing methods
topic high-throughput sequencing
transcriptome
rna-seq
ribo-seq
design of the experiment
url https://vavilov.elpub.ru/jour/article/view/594
work_keys_str_mv AT pnmenshanov thedesignofexperimentsforthetranscriptomestudiesbyhighthroughputsequencingmethods
AT nndygalo thedesignofexperimentsforthetranscriptomestudiesbyhighthroughputsequencingmethods
AT pnmenshanov designofexperimentsforthetranscriptomestudiesbyhighthroughputsequencingmethods
AT nndygalo designofexperimentsforthetranscriptomestudiesbyhighthroughputsequencingmethods