Mining sequence variations in representative polyploid sugarcane germplasm accessions

Abstract Background Sugarcane (Saccharum spp.) is one of the most important economic crops because of its high sugar production and biofuel potential. Due to the high polyploid level and complex genome of sugarcane, it has been a huge challenge to investigate genomic sequence variations, which are c...

Full description

Bibliographic Details
Main Authors: Xiping Yang, Jian Song, Qian You, Dev R. Paudel, Jisen Zhang, Jianping Wang
Format: Article
Language:English
Published: BMC 2017-08-01
Series:BMC Genomics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12864-017-3980-3
_version_ 1819067216605216768
author Xiping Yang
Jian Song
Qian You
Dev R. Paudel
Jisen Zhang
Jianping Wang
author_facet Xiping Yang
Jian Song
Qian You
Dev R. Paudel
Jisen Zhang
Jianping Wang
author_sort Xiping Yang
collection DOAJ
description Abstract Background Sugarcane (Saccharum spp.) is one of the most important economic crops because of its high sugar production and biofuel potential. Due to the high polyploid level and complex genome of sugarcane, it has been a huge challenge to investigate genomic sequence variations, which are critical for identifying alleles contributing to important agronomic traits. In order to mine the genetic variations in sugarcane, genotyping by sequencing (GBS), was used to genotype 14 representative Saccharum complex accessions. GBS is a method to generate a large number of markers, enabled by next generation sequencing (NGS) and the genome complexity reduction using restriction enzymes. Results To use GBS for high throughput genotyping highly polyploid sugarcane, the GBS analysis pipelines in 14 Saccharum complex accessions were established by evaluating different alignment methods, sequence variants callers, and sequence depth for single nucleotide polymorphism (SNP) filtering. By using the established pipeline, a total of 76,251 non-redundant SNPs, 5642 InDels, 6380 presence/absence variants (PAVs), and 826 copy number variations (CNVs) were detected among the 14 accessions. In addition, non-reference based universal network enabled analysis kit and Stacks de novo called 34,353 and 109,043 SNPs, respectively. In the 14 accessions, the percentages of single dose SNPs ranged from 38.3% to 62.3% with an average of 49.6%, much more than the portions of multiple dosage SNPs. Concordantly called SNPs were used to evaluate the phylogenetic relationship among the 14 accessions. The results showed that the divergence time between the Erianthus genus and the Saccharum genus was more than 10 million years ago (MYA). The Saccharum species separated from their common ancestors ranging from 0.19 to 1.65 MYA. Conclusions The GBS pipelines including the reference sequences, alignment methods, sequence variant callers, and sequence depth were recommended and discussed for the Saccharum complex and other related species. A large number of sequence variations were discovered in the Saccharum complex, including SNPs, InDels, PAVs, and CNVs. Genome-wide SNPs were further used to illustrate sequence features of polyploid species and demonstrated the divergence of different species in the Saccharum complex. The results of this study showed that GBS was an effective NGS-based method to discover genomic sequence variations in highly polyploid and heterozygous species.
first_indexed 2024-12-21T16:14:44Z
format Article
id doaj.art-ddd5c0a5a0dd48d587f5aa28ea3d2e10
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-12-21T16:14:44Z
publishDate 2017-08-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-ddd5c0a5a0dd48d587f5aa28ea3d2e102022-12-21T18:57:43ZengBMCBMC Genomics1471-21642017-08-0118111610.1186/s12864-017-3980-3Mining sequence variations in representative polyploid sugarcane germplasm accessionsXiping Yang0Jian Song1Qian You2Dev R. Paudel3Jisen Zhang4Jianping Wang5Department of Agronomy, University of FloridaDepartment of Agronomy, University of FloridaDepartment of Agronomy, University of FloridaDepartment of Agronomy, University of FloridaFAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Haixia Institute of Science and Techonology, Fujian Agriculture and Forestry UniversityDepartment of Agronomy, University of FloridaAbstract Background Sugarcane (Saccharum spp.) is one of the most important economic crops because of its high sugar production and biofuel potential. Due to the high polyploid level and complex genome of sugarcane, it has been a huge challenge to investigate genomic sequence variations, which are critical for identifying alleles contributing to important agronomic traits. In order to mine the genetic variations in sugarcane, genotyping by sequencing (GBS), was used to genotype 14 representative Saccharum complex accessions. GBS is a method to generate a large number of markers, enabled by next generation sequencing (NGS) and the genome complexity reduction using restriction enzymes. Results To use GBS for high throughput genotyping highly polyploid sugarcane, the GBS analysis pipelines in 14 Saccharum complex accessions were established by evaluating different alignment methods, sequence variants callers, and sequence depth for single nucleotide polymorphism (SNP) filtering. By using the established pipeline, a total of 76,251 non-redundant SNPs, 5642 InDels, 6380 presence/absence variants (PAVs), and 826 copy number variations (CNVs) were detected among the 14 accessions. In addition, non-reference based universal network enabled analysis kit and Stacks de novo called 34,353 and 109,043 SNPs, respectively. In the 14 accessions, the percentages of single dose SNPs ranged from 38.3% to 62.3% with an average of 49.6%, much more than the portions of multiple dosage SNPs. Concordantly called SNPs were used to evaluate the phylogenetic relationship among the 14 accessions. The results showed that the divergence time between the Erianthus genus and the Saccharum genus was more than 10 million years ago (MYA). The Saccharum species separated from their common ancestors ranging from 0.19 to 1.65 MYA. Conclusions The GBS pipelines including the reference sequences, alignment methods, sequence variant callers, and sequence depth were recommended and discussed for the Saccharum complex and other related species. A large number of sequence variations were discovered in the Saccharum complex, including SNPs, InDels, PAVs, and CNVs. Genome-wide SNPs were further used to illustrate sequence features of polyploid species and demonstrated the divergence of different species in the Saccharum complex. The results of this study showed that GBS was an effective NGS-based method to discover genomic sequence variations in highly polyploid and heterozygous species.http://link.springer.com/article/10.1186/s12864-017-3980-3SugarcaneSaccharum ComplexSequence variationsGenotyping by sequencing (GBS)PolyploidSequence features
spellingShingle Xiping Yang
Jian Song
Qian You
Dev R. Paudel
Jisen Zhang
Jianping Wang
Mining sequence variations in representative polyploid sugarcane germplasm accessions
BMC Genomics
Sugarcane
Saccharum Complex
Sequence variations
Genotyping by sequencing (GBS)
Polyploid
Sequence features
title Mining sequence variations in representative polyploid sugarcane germplasm accessions
title_full Mining sequence variations in representative polyploid sugarcane germplasm accessions
title_fullStr Mining sequence variations in representative polyploid sugarcane germplasm accessions
title_full_unstemmed Mining sequence variations in representative polyploid sugarcane germplasm accessions
title_short Mining sequence variations in representative polyploid sugarcane germplasm accessions
title_sort mining sequence variations in representative polyploid sugarcane germplasm accessions
topic Sugarcane
Saccharum Complex
Sequence variations
Genotyping by sequencing (GBS)
Polyploid
Sequence features
url http://link.springer.com/article/10.1186/s12864-017-3980-3
work_keys_str_mv AT xipingyang miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions
AT jiansong miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions
AT qianyou miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions
AT devrpaudel miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions
AT jisenzhang miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions
AT jianpingwang miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions