Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in Canola

Whole genome sequencing offers genome wide, unbiased markers, and inexpensive library preparation. With the cost of sequencing decreasing rapidly, many plant genomes of modest size are amenable to skim whole genome resequencing (skim WGR). The use of skim WGR in diverse sample sets without the use o...

Full description

Bibliographic Details
Main Authors: M. Michelle Malmberg, Denise M. Barbulescu, Michelle C. Drayton, Maiko Shinozuka, Preeti Thakur, Yvonne O. Ogaji, German C. Spangenberg, Hans D. Daetwyler, Noel O. I. Cogan
Format: Article
Language:English
Published: Frontiers Media S.A. 2018-12-01
Series:Frontiers in Plant Science
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fpls.2018.01809/full
_version_ 1818128550923862016
author M. Michelle Malmberg
M. Michelle Malmberg
Denise M. Barbulescu
Michelle C. Drayton
Maiko Shinozuka
Preeti Thakur
Yvonne O. Ogaji
German C. Spangenberg
German C. Spangenberg
Hans D. Daetwyler
Hans D. Daetwyler
Noel O. I. Cogan
Noel O. I. Cogan
author_facet M. Michelle Malmberg
M. Michelle Malmberg
Denise M. Barbulescu
Michelle C. Drayton
Maiko Shinozuka
Preeti Thakur
Yvonne O. Ogaji
German C. Spangenberg
German C. Spangenberg
Hans D. Daetwyler
Hans D. Daetwyler
Noel O. I. Cogan
Noel O. I. Cogan
author_sort M. Michelle Malmberg
collection DOAJ
description Whole genome sequencing offers genome wide, unbiased markers, and inexpensive library preparation. With the cost of sequencing decreasing rapidly, many plant genomes of modest size are amenable to skim whole genome resequencing (skim WGR). The use of skim WGR in diverse sample sets without the use of imputation was evaluated in silico in 149 canola samples representative of global diversity. Fastq files with an average of 10x coverage of the reference genome were used to generate skim samples representing 0.25x, 0.5x, 1x, 2x, 3x, 4x, and 5x sequencing coverage. Applying a pre-defined list of SNPs versus de novo SNP discovery was evaluated. As skim WGR is expected to result in some degree of insufficient allele sampling, all skim coverage levels were filtered at a range of minimum read depths from a relaxed minimum read depth of 2 to a stringent read depth of 5, resulting in 28 list-based SNP sets. As a broad recommendation, genotyping pre-defined SNPs between 1x and 2x coverage with relatively stringent depth filtering is appropriate for a diverse sample set of canola due to a balance between marker number, sufficient accuracy, and sequencing cost, but depends on the intended application. This was experimentally examined in two sample sets with different genetic backgrounds: 1x coverage of 1,590 individuals from 84 Australian spring type four-parent crosses aimed at maximizing diversity as well as one commercial F1 hybrid, and 2x coverage of 379 doubled haploids (DHs) derived from a subset of the four-parent crosses. To determine optimal coverage in a simpler genetic background, the DH sample sequence coverage was further down sampled in silico. The flexible and cost-effective nature of the protocol makes it highly applicable across a range of species and purposes.
first_indexed 2024-12-11T07:35:02Z
format Article
id doaj.art-d3f18ae88b3d460cb7a64a750ebc0ce3
institution Directory Open Access Journal
issn 1664-462X
language English
last_indexed 2024-12-11T07:35:02Z
publishDate 2018-12-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Plant Science
spelling doaj.art-d3f18ae88b3d460cb7a64a750ebc0ce32022-12-22T01:15:43ZengFrontiers Media S.A.Frontiers in Plant Science1664-462X2018-12-01910.3389/fpls.2018.01809412949Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in CanolaM. Michelle Malmberg0M. Michelle Malmberg1Denise M. Barbulescu2Michelle C. Drayton3Maiko Shinozuka4Preeti Thakur5Yvonne O. Ogaji6German C. Spangenberg7German C. Spangenberg8Hans D. Daetwyler9Hans D. Daetwyler10Noel O. I. Cogan11Noel O. I. Cogan12Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaSchool of Applied Systems Biology, La Trobe University, Bundoora, VIC, AustraliaAgriculture Victoria, Grains Innovation Park, Horsham, VIC, AustraliaAgriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaAgriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaAgriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaAgriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaAgriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaSchool of Applied Systems Biology, La Trobe University, Bundoora, VIC, AustraliaAgriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaSchool of Applied Systems Biology, La Trobe University, Bundoora, VIC, AustraliaAgriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaSchool of Applied Systems Biology, La Trobe University, Bundoora, VIC, AustraliaWhole genome sequencing offers genome wide, unbiased markers, and inexpensive library preparation. With the cost of sequencing decreasing rapidly, many plant genomes of modest size are amenable to skim whole genome resequencing (skim WGR). The use of skim WGR in diverse sample sets without the use of imputation was evaluated in silico in 149 canola samples representative of global diversity. Fastq files with an average of 10x coverage of the reference genome were used to generate skim samples representing 0.25x, 0.5x, 1x, 2x, 3x, 4x, and 5x sequencing coverage. Applying a pre-defined list of SNPs versus de novo SNP discovery was evaluated. As skim WGR is expected to result in some degree of insufficient allele sampling, all skim coverage levels were filtered at a range of minimum read depths from a relaxed minimum read depth of 2 to a stringent read depth of 5, resulting in 28 list-based SNP sets. As a broad recommendation, genotyping pre-defined SNPs between 1x and 2x coverage with relatively stringent depth filtering is appropriate for a diverse sample set of canola due to a balance between marker number, sufficient accuracy, and sequencing cost, but depends on the intended application. This was experimentally examined in two sample sets with different genetic backgrounds: 1x coverage of 1,590 individuals from 84 Australian spring type four-parent crosses aimed at maximizing diversity as well as one commercial F1 hybrid, and 2x coverage of 379 doubled haploids (DHs) derived from a subset of the four-parent crosses. To determine optimal coverage in a simpler genetic background, the DH sample sequence coverage was further down sampled in silico. The flexible and cost-effective nature of the protocol makes it highly applicable across a range of species and purposes.https://www.frontiersin.org/article/10.3389/fpls.2018.01809/fullGBSlow coverageBrassica napusdoubled haploidplant
spellingShingle M. Michelle Malmberg
M. Michelle Malmberg
Denise M. Barbulescu
Michelle C. Drayton
Maiko Shinozuka
Preeti Thakur
Yvonne O. Ogaji
German C. Spangenberg
German C. Spangenberg
Hans D. Daetwyler
Hans D. Daetwyler
Noel O. I. Cogan
Noel O. I. Cogan
Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in Canola
Frontiers in Plant Science
GBS
low coverage
Brassica napus
doubled haploid
plant
title Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in Canola
title_full Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in Canola
title_fullStr Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in Canola
title_full_unstemmed Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in Canola
title_short Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in Canola
title_sort evaluation and recommendations for routine genotyping using skim whole genome re sequencing in canola
topic GBS
low coverage
Brassica napus
doubled haploid
plant
url https://www.frontiersin.org/article/10.3389/fpls.2018.01809/full
work_keys_str_mv AT mmichellemalmberg evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola
AT mmichellemalmberg evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola
AT denisembarbulescu evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola
AT michellecdrayton evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola
AT maikoshinozuka evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola
AT preetithakur evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola
AT yvonneoogaji evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola
AT germancspangenberg evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola
AT germancspangenberg evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola
AT hansddaetwyler evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola
AT hansddaetwyler evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola
AT noeloicogan evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola
AT noeloicogan evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola