Transcriptomic dataset of cultivated (Sesamum indicum), wild (S. mulayanum), and interspecific hybrid sesame in response to induced Macrophomina phaseolina infection

We report here the data of transcriptome sequencing of control and infected sesame genotypes. Sesame is an emerging oilseed crop [1]. The destructive soil-borne fungi Macrophomina phaseolina Tassi (Goid) causes charcoal rot of sesame, leading to high (>50%) yield loss. Most of the high-yielding s...

Full description

Bibliographic Details
Main Authors: Debabrata Dutta, Vivek Kumar Awon, Gaurab Gangopadhyay
Format: Article
Language:English
Published: Elsevier 2020-12-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340920313305
_version_ 1819072539812429824
author Debabrata Dutta
Vivek Kumar Awon
Gaurab Gangopadhyay
author_facet Debabrata Dutta
Vivek Kumar Awon
Gaurab Gangopadhyay
author_sort Debabrata Dutta
collection DOAJ
description We report here the data of transcriptome sequencing of control and infected sesame genotypes. Sesame is an emerging oilseed crop [1]. The destructive soil-borne fungi Macrophomina phaseolina Tassi (Goid) causes charcoal rot of sesame, leading to high (>50%) yield loss. Most of the high-yielding sesame cultivars (Sesamum indicum) of India are susceptible to charcoal rot. Wild sesame, Sesamum mulayanum shows a high degree of tolerance against many pathogens [2]. We have earlier developed an interspecific hybrid between Indian cultivated sesame and S. mulayanum. The parents and the F6 recombinant constitute the three experimental genotypes in the present report. The seedlings were infected with M. phaseolina. The data of the infected and control (mock-inoculated) transcriptome is presented. The RNA-seq by Illumina NovaSeq 6000 technology generated 2.9 × 108 paired-end reads. We deposited the data in NCBI sequence read archive (SRA) with accession number PRJNA642699. The de novo assembly of clean reads generated 106,295 unigenes with an average length of 1,342 bp covering 1.42 × 108 nucleotides. The screening of 106,295 unigenes with MISA and SAMtools software resulted in the identification of 26,880 simple sequence repeats (SSRs), 90,181 single nucleotide polymorphisms (SNPs), and 25,063 insertion deletions (InDels). Apart from mono-base repeats, di-nucleotides repeats (42.51%) were found to be the most abundant, followed by tri-nucleotides (14.28%) among the SSRs. Subsequently, we have designed 22,494 pairs of primers based on perfect di and tri-nucleotide SSRs. Transitions (Ts, 60%) were the most abundant substitution type among the SNPs followed by transversions type (Tv, 40%), with a Ts/Tv ratio of 1.48. The development of genic-SSR markers and SNP information will pave the way for molecular marker-assisted breeding of sesame for tolerance against charcoal rot.
first_indexed 2024-12-21T17:39:20Z
format Article
id doaj.art-412e58f76243450c8b1bb6d7e0011631
institution Directory Open Access Journal
issn 2352-3409
language English
last_indexed 2024-12-21T17:39:20Z
publishDate 2020-12-01
publisher Elsevier
record_format Article
series Data in Brief
spelling doaj.art-412e58f76243450c8b1bb6d7e00116312022-12-21T18:55:41ZengElsevierData in Brief2352-34092020-12-0133106448Transcriptomic dataset of cultivated (Sesamum indicum), wild (S. mulayanum), and interspecific hybrid sesame in response to induced Macrophomina phaseolina infectionDebabrata Dutta0Vivek Kumar Awon1Gaurab Gangopadhyay2Division of Plant Biology, Bose Institute (Main Campus), 93/1 APC Road, Kolkata - 700009, IndiaDivision of Plant Biology, Bose Institute (Main Campus), 93/1 APC Road, Kolkata - 700009, IndiaCorresponding author.; Division of Plant Biology, Bose Institute (Main Campus), 93/1 APC Road, Kolkata - 700009, IndiaWe report here the data of transcriptome sequencing of control and infected sesame genotypes. Sesame is an emerging oilseed crop [1]. The destructive soil-borne fungi Macrophomina phaseolina Tassi (Goid) causes charcoal rot of sesame, leading to high (>50%) yield loss. Most of the high-yielding sesame cultivars (Sesamum indicum) of India are susceptible to charcoal rot. Wild sesame, Sesamum mulayanum shows a high degree of tolerance against many pathogens [2]. We have earlier developed an interspecific hybrid between Indian cultivated sesame and S. mulayanum. The parents and the F6 recombinant constitute the three experimental genotypes in the present report. The seedlings were infected with M. phaseolina. The data of the infected and control (mock-inoculated) transcriptome is presented. The RNA-seq by Illumina NovaSeq 6000 technology generated 2.9 × 108 paired-end reads. We deposited the data in NCBI sequence read archive (SRA) with accession number PRJNA642699. The de novo assembly of clean reads generated 106,295 unigenes with an average length of 1,342 bp covering 1.42 × 108 nucleotides. The screening of 106,295 unigenes with MISA and SAMtools software resulted in the identification of 26,880 simple sequence repeats (SSRs), 90,181 single nucleotide polymorphisms (SNPs), and 25,063 insertion deletions (InDels). Apart from mono-base repeats, di-nucleotides repeats (42.51%) were found to be the most abundant, followed by tri-nucleotides (14.28%) among the SSRs. Subsequently, we have designed 22,494 pairs of primers based on perfect di and tri-nucleotide SSRs. Transitions (Ts, 60%) were the most abundant substitution type among the SNPs followed by transversions type (Tv, 40%), with a Ts/Tv ratio of 1.48. The development of genic-SSR markers and SNP information will pave the way for molecular marker-assisted breeding of sesame for tolerance against charcoal rot.http://www.sciencedirect.com/science/article/pii/S2352340920313305SesameMacrophomina infectionTranscriptomeDatasetSSRSNP
spellingShingle Debabrata Dutta
Vivek Kumar Awon
Gaurab Gangopadhyay
Transcriptomic dataset of cultivated (Sesamum indicum), wild (S. mulayanum), and interspecific hybrid sesame in response to induced Macrophomina phaseolina infection
Data in Brief
Sesame
Macrophomina infection
Transcriptome
Dataset
SSR
SNP
title Transcriptomic dataset of cultivated (Sesamum indicum), wild (S. mulayanum), and interspecific hybrid sesame in response to induced Macrophomina phaseolina infection
title_full Transcriptomic dataset of cultivated (Sesamum indicum), wild (S. mulayanum), and interspecific hybrid sesame in response to induced Macrophomina phaseolina infection
title_fullStr Transcriptomic dataset of cultivated (Sesamum indicum), wild (S. mulayanum), and interspecific hybrid sesame in response to induced Macrophomina phaseolina infection
title_full_unstemmed Transcriptomic dataset of cultivated (Sesamum indicum), wild (S. mulayanum), and interspecific hybrid sesame in response to induced Macrophomina phaseolina infection
title_short Transcriptomic dataset of cultivated (Sesamum indicum), wild (S. mulayanum), and interspecific hybrid sesame in response to induced Macrophomina phaseolina infection
title_sort transcriptomic dataset of cultivated sesamum indicum wild s mulayanum and interspecific hybrid sesame in response to induced macrophomina phaseolina infection
topic Sesame
Macrophomina infection
Transcriptome
Dataset
SSR
SNP
url http://www.sciencedirect.com/science/article/pii/S2352340920313305
work_keys_str_mv AT debabratadutta transcriptomicdatasetofcultivatedsesamumindicumwildsmulayanumandinterspecifichybridsesameinresponsetoinducedmacrophominaphaseolinainfection
AT vivekkumarawon transcriptomicdatasetofcultivatedsesamumindicumwildsmulayanumandinterspecifichybridsesameinresponsetoinducedmacrophominaphaseolinainfection
AT gaurabgangopadhyay transcriptomicdatasetofcultivatedsesamumindicumwildsmulayanumandinterspecifichybridsesameinresponsetoinducedmacrophominaphaseolinainfection