Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1

Abstract Objective The 1,000 wheat exome project captured the single nucleotide variants in the coding regions of a diverse set of 890 wheat accessions to analyse the contribution of introgression to adaptation of wheat. However, this highly useful single nucleotide polymorphism (SNP) dataset is bas...

Full description

Bibliographic Details
Main Authors: Akshaya Vasudevan, Sylvie Cloutier
Format: Article
Language:English
Published: BMC 2023-09-01
Series:BMC Research Notes
Subjects:
Online Access:https://doi.org/10.1186/s13104-023-06496-8
_version_ 1827635673685819392
author Akshaya Vasudevan
Sylvie Cloutier
author_facet Akshaya Vasudevan
Sylvie Cloutier
author_sort Akshaya Vasudevan
collection DOAJ
description Abstract Objective The 1,000 wheat exome project captured the single nucleotide variants in the coding regions of a diverse set of 890 wheat accessions to analyse the contribution of introgression to adaptation of wheat. However, this highly useful single nucleotide polymorphism (SNP) dataset is based on RefSeq v1.0 of the International Wheat Genome Sequencing Consortium (IWGSC) assembly of the bread wheat genome of Chinese Spring. This reference sequence has recently been updated using optical maps and long-read sequencing to produce the improved RefSeq v2.1. Our objective was to develop a reliable high-density SNP dataset positioned onto RefSeq v2.1 because it is the current standard reference sequence used by wheat researchers. Results The 3,039,822 SNPs originally positioned on RefSeq v1.0 were projected to v2.1 using Liftoff with four different flanking regions, and 2,946,536 SNPs were consistently lifted to the same location irrespective of the flanking region lengths. Of these, 2,799,166 were located on the ‘+’ ve strand. The distribution of the SNPs across the 21 chromosomes on RefSeq v2.1 was similar to that of RefSeq v1.0. Among the SNPs that were based on unanchored scaffolds in RefSeq v1.0, 11,938 were projected to one of the 21 pseudomolecules in the upgraded assembly. This SNP dataset constitutes a much-needed standardized resource for the wheat research community.
first_indexed 2024-03-09T15:30:50Z
format Article
id doaj.art-0ecb4fcac7bb403080412a1a1de6a090
institution Directory Open Access Journal
issn 1756-0500
language English
last_indexed 2024-03-09T15:30:50Z
publishDate 2023-09-01
publisher BMC
record_format Article
series BMC Research Notes
spelling doaj.art-0ecb4fcac7bb403080412a1a1de6a0902023-11-26T12:14:24ZengBMCBMC Research Notes1756-05002023-09-011611510.1186/s13104-023-06496-8Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1Akshaya Vasudevan0Sylvie Cloutier1Agriculture and Agri-Food Canada, Ottawa Research and Development CentreAgriculture and Agri-Food Canada, Ottawa Research and Development CentreAbstract Objective The 1,000 wheat exome project captured the single nucleotide variants in the coding regions of a diverse set of 890 wheat accessions to analyse the contribution of introgression to adaptation of wheat. However, this highly useful single nucleotide polymorphism (SNP) dataset is based on RefSeq v1.0 of the International Wheat Genome Sequencing Consortium (IWGSC) assembly of the bread wheat genome of Chinese Spring. This reference sequence has recently been updated using optical maps and long-read sequencing to produce the improved RefSeq v2.1. Our objective was to develop a reliable high-density SNP dataset positioned onto RefSeq v2.1 because it is the current standard reference sequence used by wheat researchers. Results The 3,039,822 SNPs originally positioned on RefSeq v1.0 were projected to v2.1 using Liftoff with four different flanking regions, and 2,946,536 SNPs were consistently lifted to the same location irrespective of the flanking region lengths. Of these, 2,799,166 were located on the ‘+’ ve strand. The distribution of the SNPs across the 21 chromosomes on RefSeq v2.1 was similar to that of RefSeq v1.0. Among the SNPs that were based on unanchored scaffolds in RefSeq v1.0, 11,938 were projected to one of the 21 pseudomolecules in the upgraded assembly. This SNP dataset constitutes a much-needed standardized resource for the wheat research community.https://doi.org/10.1186/s13104-023-06496-8LiftoffSNP liftingRefSeq v2.1Chinese Spring reference genome
spellingShingle Akshaya Vasudevan
Sylvie Cloutier
Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1
BMC Research Notes
Liftoff
SNP lifting
RefSeq v2.1
Chinese Spring reference genome
title Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1
title_full Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1
title_fullStr Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1
title_full_unstemmed Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1
title_short Lifting of the 1,000 wheat exome project SNPs from Triticum aestivum cv. Chinese Spring assembly RefSeq v1.0 to RefSeq v2.1
title_sort lifting of the 1 000 wheat exome project snps from triticum aestivum cv chinese spring assembly refseq v1 0 to refseq v2 1
topic Liftoff
SNP lifting
RefSeq v2.1
Chinese Spring reference genome
url https://doi.org/10.1186/s13104-023-06496-8
work_keys_str_mv AT akshayavasudevan liftingofthe1000wheatexomeprojectsnpsfromtriticumaestivumcvchinesespringassemblyrefseqv10torefseqv21
AT sylviecloutier liftingofthe1000wheatexomeprojectsnpsfromtriticumaestivumcvchinesespringassemblyrefseqv10torefseqv21