Mining sequence variations in representative polyploid sugarcane germplasm accessions
Abstract Background Sugarcane (Saccharum spp.) is one of the most important economic crops because of its high sugar production and biofuel potential. Due to the high polyploid level and complex genome of sugarcane, it has been a huge challenge to investigate genomic sequence variations, which are c...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2017-08-01
|
Series: | BMC Genomics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12864-017-3980-3 |
_version_ | 1819067216605216768 |
---|---|
author | Xiping Yang Jian Song Qian You Dev R. Paudel Jisen Zhang Jianping Wang |
author_facet | Xiping Yang Jian Song Qian You Dev R. Paudel Jisen Zhang Jianping Wang |
author_sort | Xiping Yang |
collection | DOAJ |
description | Abstract Background Sugarcane (Saccharum spp.) is one of the most important economic crops because of its high sugar production and biofuel potential. Due to the high polyploid level and complex genome of sugarcane, it has been a huge challenge to investigate genomic sequence variations, which are critical for identifying alleles contributing to important agronomic traits. In order to mine the genetic variations in sugarcane, genotyping by sequencing (GBS), was used to genotype 14 representative Saccharum complex accessions. GBS is a method to generate a large number of markers, enabled by next generation sequencing (NGS) and the genome complexity reduction using restriction enzymes. Results To use GBS for high throughput genotyping highly polyploid sugarcane, the GBS analysis pipelines in 14 Saccharum complex accessions were established by evaluating different alignment methods, sequence variants callers, and sequence depth for single nucleotide polymorphism (SNP) filtering. By using the established pipeline, a total of 76,251 non-redundant SNPs, 5642 InDels, 6380 presence/absence variants (PAVs), and 826 copy number variations (CNVs) were detected among the 14 accessions. In addition, non-reference based universal network enabled analysis kit and Stacks de novo called 34,353 and 109,043 SNPs, respectively. In the 14 accessions, the percentages of single dose SNPs ranged from 38.3% to 62.3% with an average of 49.6%, much more than the portions of multiple dosage SNPs. Concordantly called SNPs were used to evaluate the phylogenetic relationship among the 14 accessions. The results showed that the divergence time between the Erianthus genus and the Saccharum genus was more than 10 million years ago (MYA). The Saccharum species separated from their common ancestors ranging from 0.19 to 1.65 MYA. Conclusions The GBS pipelines including the reference sequences, alignment methods, sequence variant callers, and sequence depth were recommended and discussed for the Saccharum complex and other related species. A large number of sequence variations were discovered in the Saccharum complex, including SNPs, InDels, PAVs, and CNVs. Genome-wide SNPs were further used to illustrate sequence features of polyploid species and demonstrated the divergence of different species in the Saccharum complex. The results of this study showed that GBS was an effective NGS-based method to discover genomic sequence variations in highly polyploid and heterozygous species. |
first_indexed | 2024-12-21T16:14:44Z |
format | Article |
id | doaj.art-ddd5c0a5a0dd48d587f5aa28ea3d2e10 |
institution | Directory Open Access Journal |
issn | 1471-2164 |
language | English |
last_indexed | 2024-12-21T16:14:44Z |
publishDate | 2017-08-01 |
publisher | BMC |
record_format | Article |
series | BMC Genomics |
spelling | doaj.art-ddd5c0a5a0dd48d587f5aa28ea3d2e102022-12-21T18:57:43ZengBMCBMC Genomics1471-21642017-08-0118111610.1186/s12864-017-3980-3Mining sequence variations in representative polyploid sugarcane germplasm accessionsXiping Yang0Jian Song1Qian You2Dev R. Paudel3Jisen Zhang4Jianping Wang5Department of Agronomy, University of FloridaDepartment of Agronomy, University of FloridaDepartment of Agronomy, University of FloridaDepartment of Agronomy, University of FloridaFAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Haixia Institute of Science and Techonology, Fujian Agriculture and Forestry UniversityDepartment of Agronomy, University of FloridaAbstract Background Sugarcane (Saccharum spp.) is one of the most important economic crops because of its high sugar production and biofuel potential. Due to the high polyploid level and complex genome of sugarcane, it has been a huge challenge to investigate genomic sequence variations, which are critical for identifying alleles contributing to important agronomic traits. In order to mine the genetic variations in sugarcane, genotyping by sequencing (GBS), was used to genotype 14 representative Saccharum complex accessions. GBS is a method to generate a large number of markers, enabled by next generation sequencing (NGS) and the genome complexity reduction using restriction enzymes. Results To use GBS for high throughput genotyping highly polyploid sugarcane, the GBS analysis pipelines in 14 Saccharum complex accessions were established by evaluating different alignment methods, sequence variants callers, and sequence depth for single nucleotide polymorphism (SNP) filtering. By using the established pipeline, a total of 76,251 non-redundant SNPs, 5642 InDels, 6380 presence/absence variants (PAVs), and 826 copy number variations (CNVs) were detected among the 14 accessions. In addition, non-reference based universal network enabled analysis kit and Stacks de novo called 34,353 and 109,043 SNPs, respectively. In the 14 accessions, the percentages of single dose SNPs ranged from 38.3% to 62.3% with an average of 49.6%, much more than the portions of multiple dosage SNPs. Concordantly called SNPs were used to evaluate the phylogenetic relationship among the 14 accessions. The results showed that the divergence time between the Erianthus genus and the Saccharum genus was more than 10 million years ago (MYA). The Saccharum species separated from their common ancestors ranging from 0.19 to 1.65 MYA. Conclusions The GBS pipelines including the reference sequences, alignment methods, sequence variant callers, and sequence depth were recommended and discussed for the Saccharum complex and other related species. A large number of sequence variations were discovered in the Saccharum complex, including SNPs, InDels, PAVs, and CNVs. Genome-wide SNPs were further used to illustrate sequence features of polyploid species and demonstrated the divergence of different species in the Saccharum complex. The results of this study showed that GBS was an effective NGS-based method to discover genomic sequence variations in highly polyploid and heterozygous species.http://link.springer.com/article/10.1186/s12864-017-3980-3SugarcaneSaccharum ComplexSequence variationsGenotyping by sequencing (GBS)PolyploidSequence features |
spellingShingle | Xiping Yang Jian Song Qian You Dev R. Paudel Jisen Zhang Jianping Wang Mining sequence variations in representative polyploid sugarcane germplasm accessions BMC Genomics Sugarcane Saccharum Complex Sequence variations Genotyping by sequencing (GBS) Polyploid Sequence features |
title | Mining sequence variations in representative polyploid sugarcane germplasm accessions |
title_full | Mining sequence variations in representative polyploid sugarcane germplasm accessions |
title_fullStr | Mining sequence variations in representative polyploid sugarcane germplasm accessions |
title_full_unstemmed | Mining sequence variations in representative polyploid sugarcane germplasm accessions |
title_short | Mining sequence variations in representative polyploid sugarcane germplasm accessions |
title_sort | mining sequence variations in representative polyploid sugarcane germplasm accessions |
topic | Sugarcane Saccharum Complex Sequence variations Genotyping by sequencing (GBS) Polyploid Sequence features |
url | http://link.springer.com/article/10.1186/s12864-017-3980-3 |
work_keys_str_mv | AT xipingyang miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions AT jiansong miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions AT qianyou miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions AT devrpaudel miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions AT jisenzhang miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions AT jianpingwang miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions |