Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in Canola
Whole genome sequencing offers genome wide, unbiased markers, and inexpensive library preparation. With the cost of sequencing decreasing rapidly, many plant genomes of modest size are amenable to skim whole genome resequencing (skim WGR). The use of skim WGR in diverse sample sets without the use o...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2018-12-01
|
Series: | Frontiers in Plant Science |
Subjects: | |
Online Access: | https://www.frontiersin.org/article/10.3389/fpls.2018.01809/full |
_version_ | 1818128550923862016 |
---|---|
author | M. Michelle Malmberg M. Michelle Malmberg Denise M. Barbulescu Michelle C. Drayton Maiko Shinozuka Preeti Thakur Yvonne O. Ogaji German C. Spangenberg German C. Spangenberg Hans D. Daetwyler Hans D. Daetwyler Noel O. I. Cogan Noel O. I. Cogan |
author_facet | M. Michelle Malmberg M. Michelle Malmberg Denise M. Barbulescu Michelle C. Drayton Maiko Shinozuka Preeti Thakur Yvonne O. Ogaji German C. Spangenberg German C. Spangenberg Hans D. Daetwyler Hans D. Daetwyler Noel O. I. Cogan Noel O. I. Cogan |
author_sort | M. Michelle Malmberg |
collection | DOAJ |
description | Whole genome sequencing offers genome wide, unbiased markers, and inexpensive library preparation. With the cost of sequencing decreasing rapidly, many plant genomes of modest size are amenable to skim whole genome resequencing (skim WGR). The use of skim WGR in diverse sample sets without the use of imputation was evaluated in silico in 149 canola samples representative of global diversity. Fastq files with an average of 10x coverage of the reference genome were used to generate skim samples representing 0.25x, 0.5x, 1x, 2x, 3x, 4x, and 5x sequencing coverage. Applying a pre-defined list of SNPs versus de novo SNP discovery was evaluated. As skim WGR is expected to result in some degree of insufficient allele sampling, all skim coverage levels were filtered at a range of minimum read depths from a relaxed minimum read depth of 2 to a stringent read depth of 5, resulting in 28 list-based SNP sets. As a broad recommendation, genotyping pre-defined SNPs between 1x and 2x coverage with relatively stringent depth filtering is appropriate for a diverse sample set of canola due to a balance between marker number, sufficient accuracy, and sequencing cost, but depends on the intended application. This was experimentally examined in two sample sets with different genetic backgrounds: 1x coverage of 1,590 individuals from 84 Australian spring type four-parent crosses aimed at maximizing diversity as well as one commercial F1 hybrid, and 2x coverage of 379 doubled haploids (DHs) derived from a subset of the four-parent crosses. To determine optimal coverage in a simpler genetic background, the DH sample sequence coverage was further down sampled in silico. The flexible and cost-effective nature of the protocol makes it highly applicable across a range of species and purposes. |
first_indexed | 2024-12-11T07:35:02Z |
format | Article |
id | doaj.art-d3f18ae88b3d460cb7a64a750ebc0ce3 |
institution | Directory Open Access Journal |
issn | 1664-462X |
language | English |
last_indexed | 2024-12-11T07:35:02Z |
publishDate | 2018-12-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Plant Science |
spelling | doaj.art-d3f18ae88b3d460cb7a64a750ebc0ce32022-12-22T01:15:43ZengFrontiers Media S.A.Frontiers in Plant Science1664-462X2018-12-01910.3389/fpls.2018.01809412949Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in CanolaM. Michelle Malmberg0M. Michelle Malmberg1Denise M. Barbulescu2Michelle C. Drayton3Maiko Shinozuka4Preeti Thakur5Yvonne O. Ogaji6German C. Spangenberg7German C. Spangenberg8Hans D. Daetwyler9Hans D. Daetwyler10Noel O. I. Cogan11Noel O. I. Cogan12Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaSchool of Applied Systems Biology, La Trobe University, Bundoora, VIC, AustraliaAgriculture Victoria, Grains Innovation Park, Horsham, VIC, AustraliaAgriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaAgriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaAgriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaAgriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaAgriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaSchool of Applied Systems Biology, La Trobe University, Bundoora, VIC, AustraliaAgriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaSchool of Applied Systems Biology, La Trobe University, Bundoora, VIC, AustraliaAgriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, AustraliaSchool of Applied Systems Biology, La Trobe University, Bundoora, VIC, AustraliaWhole genome sequencing offers genome wide, unbiased markers, and inexpensive library preparation. With the cost of sequencing decreasing rapidly, many plant genomes of modest size are amenable to skim whole genome resequencing (skim WGR). The use of skim WGR in diverse sample sets without the use of imputation was evaluated in silico in 149 canola samples representative of global diversity. Fastq files with an average of 10x coverage of the reference genome were used to generate skim samples representing 0.25x, 0.5x, 1x, 2x, 3x, 4x, and 5x sequencing coverage. Applying a pre-defined list of SNPs versus de novo SNP discovery was evaluated. As skim WGR is expected to result in some degree of insufficient allele sampling, all skim coverage levels were filtered at a range of minimum read depths from a relaxed minimum read depth of 2 to a stringent read depth of 5, resulting in 28 list-based SNP sets. As a broad recommendation, genotyping pre-defined SNPs between 1x and 2x coverage with relatively stringent depth filtering is appropriate for a diverse sample set of canola due to a balance between marker number, sufficient accuracy, and sequencing cost, but depends on the intended application. This was experimentally examined in two sample sets with different genetic backgrounds: 1x coverage of 1,590 individuals from 84 Australian spring type four-parent crosses aimed at maximizing diversity as well as one commercial F1 hybrid, and 2x coverage of 379 doubled haploids (DHs) derived from a subset of the four-parent crosses. To determine optimal coverage in a simpler genetic background, the DH sample sequence coverage was further down sampled in silico. The flexible and cost-effective nature of the protocol makes it highly applicable across a range of species and purposes.https://www.frontiersin.org/article/10.3389/fpls.2018.01809/fullGBSlow coverageBrassica napusdoubled haploidplant |
spellingShingle | M. Michelle Malmberg M. Michelle Malmberg Denise M. Barbulescu Michelle C. Drayton Maiko Shinozuka Preeti Thakur Yvonne O. Ogaji German C. Spangenberg German C. Spangenberg Hans D. Daetwyler Hans D. Daetwyler Noel O. I. Cogan Noel O. I. Cogan Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in Canola Frontiers in Plant Science GBS low coverage Brassica napus doubled haploid plant |
title | Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in Canola |
title_full | Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in Canola |
title_fullStr | Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in Canola |
title_full_unstemmed | Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in Canola |
title_short | Evaluation and Recommendations for Routine Genotyping Using Skim Whole Genome Re-sequencing in Canola |
title_sort | evaluation and recommendations for routine genotyping using skim whole genome re sequencing in canola |
topic | GBS low coverage Brassica napus doubled haploid plant |
url | https://www.frontiersin.org/article/10.3389/fpls.2018.01809/full |
work_keys_str_mv | AT mmichellemalmberg evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola AT mmichellemalmberg evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola AT denisembarbulescu evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola AT michellecdrayton evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola AT maikoshinozuka evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola AT preetithakur evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola AT yvonneoogaji evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola AT germancspangenberg evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola AT germancspangenberg evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola AT hansddaetwyler evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola AT hansddaetwyler evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola AT noeloicogan evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola AT noeloicogan evaluationandrecommendationsforroutinegenotypingusingskimwholegenomeresequencingincanola |