Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1
Abstract Objective The 1,000 wheat exome project captured the single nucleotide variants in the coding regions of a diverse set of 890 wheat accessions to analyse the contribution of introgression to adaptation of wheat. However, this highly useful single nucleotide polymorphism (SNP) dataset is bas...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2023-09-01
|
Series: | BMC Research Notes |
Subjects: | |
Online Access: | https://doi.org/10.1186/s13104-023-06496-8 |
_version_ | 1827635673685819392 |
---|---|
author | Akshaya Vasudevan Sylvie Cloutier |
author_facet | Akshaya Vasudevan Sylvie Cloutier |
author_sort | Akshaya Vasudevan |
collection | DOAJ |
description | Abstract Objective The 1,000 wheat exome project captured the single nucleotide variants in the coding regions of a diverse set of 890 wheat accessions to analyse the contribution of introgression to adaptation of wheat. However, this highly useful single nucleotide polymorphism (SNP) dataset is based on RefSeq v1.0 of the International Wheat Genome Sequencing Consortium (IWGSC) assembly of the bread wheat genome of Chinese Spring. This reference sequence has recently been updated using optical maps and long-read sequencing to produce the improved RefSeq v2.1. Our objective was to develop a reliable high-density SNP dataset positioned onto RefSeq v2.1 because it is the current standard reference sequence used by wheat researchers. Results The 3,039,822 SNPs originally positioned on RefSeq v1.0 were projected to v2.1 using Liftoff with four different flanking regions, and 2,946,536 SNPs were consistently lifted to the same location irrespective of the flanking region lengths. Of these, 2,799,166 were located on the ‘+’ ve strand. The distribution of the SNPs across the 21 chromosomes on RefSeq v2.1 was similar to that of RefSeq v1.0. Among the SNPs that were based on unanchored scaffolds in RefSeq v1.0, 11,938 were projected to one of the 21 pseudomolecules in the upgraded assembly. This SNP dataset constitutes a much-needed standardized resource for the wheat research community. |
first_indexed | 2024-03-09T15:30:50Z |
format | Article |
id | doaj.art-0ecb4fcac7bb403080412a1a1de6a090 |
institution | Directory Open Access Journal |
issn | 1756-0500 |
language | English |
last_indexed | 2024-03-09T15:30:50Z |
publishDate | 2023-09-01 |
publisher | BMC |
record_format | Article |
series | BMC Research Notes |
spelling | doaj.art-0ecb4fcac7bb403080412a1a1de6a0902023-11-26T12:14:24ZengBMCBMC Research Notes1756-05002023-09-011611510.1186/s13104-023-06496-8Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1Akshaya Vasudevan0Sylvie Cloutier1Agriculture and Agri-Food Canada, Ottawa Research and Development CentreAgriculture and Agri-Food Canada, Ottawa Research and Development CentreAbstract Objective The 1,000 wheat exome project captured the single nucleotide variants in the coding regions of a diverse set of 890 wheat accessions to analyse the contribution of introgression to adaptation of wheat. However, this highly useful single nucleotide polymorphism (SNP) dataset is based on RefSeq v1.0 of the International Wheat Genome Sequencing Consortium (IWGSC) assembly of the bread wheat genome of Chinese Spring. This reference sequence has recently been updated using optical maps and long-read sequencing to produce the improved RefSeq v2.1. Our objective was to develop a reliable high-density SNP dataset positioned onto RefSeq v2.1 because it is the current standard reference sequence used by wheat researchers. Results The 3,039,822 SNPs originally positioned on RefSeq v1.0 were projected to v2.1 using Liftoff with four different flanking regions, and 2,946,536 SNPs were consistently lifted to the same location irrespective of the flanking region lengths. Of these, 2,799,166 were located on the ‘+’ ve strand. The distribution of the SNPs across the 21 chromosomes on RefSeq v2.1 was similar to that of RefSeq v1.0. Among the SNPs that were based on unanchored scaffolds in RefSeq v1.0, 11,938 were projected to one of the 21 pseudomolecules in the upgraded assembly. This SNP dataset constitutes a much-needed standardized resource for the wheat research community.https://doi.org/10.1186/s13104-023-06496-8LiftoffSNP liftingRefSeq v2.1Chinese Spring reference genome |
spellingShingle | Akshaya Vasudevan Sylvie Cloutier Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1 BMC Research Notes Liftoff SNP lifting RefSeq v2.1 Chinese Spring reference genome |
title | Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1 |
title_full | Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1 |
title_fullStr | Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1 |
title_full_unstemmed | Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1 |
title_short | Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1 |
title_sort | lifting of the 1 000 wheat exome project snps from triticum aestivum cv chinese spring assembly refseq v1 0 to refseq v2 1 |
topic | Liftoff SNP lifting RefSeq v2.1 Chinese Spring reference genome |
url | https://doi.org/10.1186/s13104-023-06496-8 |
work_keys_str_mv | AT akshayavasudevan liftingofthe1000wheatexomeprojectsnpsfromtriticumaestivumcvchinesespringassemblyrefseqv10torefseqv21 AT sylviecloutier liftingofthe1000wheatexomeprojectsnpsfromtriticumaestivumcvchinesespringassemblyrefseqv10torefseqv21 |