Representing true plant genomes: haplotype-resolved hybrid pepper genome with trio-binning

As sequencing costs decrease and availability of high fidelity long-read sequencing increases, generating experiment specific de novo genome assemblies becomes feasible. In many crop species, obtaining the genome of a hybrid or heterozygous individual is necessary for systems that do not tolerate in...

Full description

Bibliographic Details
Main Authors: Emily E. Delorean, Ramey C. Youngblood, Sheron A. Simpson, Ashley N. Schoonmaker, Brian E. Scheffler, William B. Rutter, Amanda M. Hulse-Kemp
Format: Article
Language:English
Published: Frontiers Media S.A. 2023-11-01
Series:Frontiers in Plant Science
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fpls.2023.1184112/full
_version_ 1827761503561842688
author Emily E. Delorean
Emily E. Delorean
Ramey C. Youngblood
Sheron A. Simpson
Ashley N. Schoonmaker
Brian E. Scheffler
William B. Rutter
Amanda M. Hulse-Kemp
Amanda M. Hulse-Kemp
author_facet Emily E. Delorean
Emily E. Delorean
Ramey C. Youngblood
Sheron A. Simpson
Ashley N. Schoonmaker
Brian E. Scheffler
William B. Rutter
Amanda M. Hulse-Kemp
Amanda M. Hulse-Kemp
author_sort Emily E. Delorean
collection DOAJ
description As sequencing costs decrease and availability of high fidelity long-read sequencing increases, generating experiment specific de novo genome assemblies becomes feasible. In many crop species, obtaining the genome of a hybrid or heterozygous individual is necessary for systems that do not tolerate inbreeding or for investigating important biological questions, such as hybrid vigor. However, most genome assembly methods that have been used in plants result in a merged single sequence representation that is not a true biologically accurate representation of either haplotype within a diploid individual. The resulting genome assembly is often fragmented and exhibits a mosaic of the two haplotypes, referred to as haplotype-switching. Important haplotype level information, such as causal mutations and structural variation is therefore lost causing difficulties in interpreting downstream analyses. To overcome this challenge, we have applied a method developed for animal genome assembly called trio-binning to an intra-specific hybrid of chili pepper (Capsicum annuum L. cv. HDA149 x Capsicum annuum L. cv. HDA330). We tested all currently available softwares for performing trio-binning, combined with multiple scaffolding technologies including Bionano to determine the optimal method of producing the best haplotype-resolved assembly. Ultimately, we produced highly contiguous biologically true haplotype-resolved genome assemblies for each parent, with scaffold N50s of 266.0 Mb and 281.3 Mb, with 99.6% and 99.8% positioned into chromosomes respectively. The assemblies captured 3.10 Gb and 3.12 Gb of the estimated 3.5 Gb chili pepper genome size. These assemblies represent the complete genome structure of the intraspecific hybrid, as well as the two parental genomes, and show measurable improvements over the currently available reference genomes. Our manuscript provides a valuable guide on how to apply trio-binning to other plant genomes.
first_indexed 2024-03-11T10:12:44Z
format Article
id doaj.art-57f4b1036a084e05a92a545384b0c4b0
institution Directory Open Access Journal
issn 1664-462X
language English
last_indexed 2024-03-11T10:12:44Z
publishDate 2023-11-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Plant Science
spelling doaj.art-57f4b1036a084e05a92a545384b0c4b02023-11-16T13:29:00ZengFrontiers Media S.A.Frontiers in Plant Science1664-462X2023-11-011410.3389/fpls.2023.11841121184112Representing true plant genomes: haplotype-resolved hybrid pepper genome with trio-binningEmily E. Delorean0Emily E. Delorean1Ramey C. Youngblood2Sheron A. Simpson3Ashley N. Schoonmaker4Brian E. Scheffler5William B. Rutter6Amanda M. Hulse-Kemp7Amanda M. Hulse-Kemp8Genomics and Bioinformatics Research Unit, USDA-ARS, Raleigh, NC, United StatesCrop and Soil Sciences Department, North Carolina State University, Raleigh, NC, United StatesInstitute for Genomics, Biocomputing and Biotechnology, Mississippi State University, Starkville, MS, United StatesGenomics and Bioinformatics Research Unit, United States Department of Agriculture - Agriculture Research Service (USDA-ARS), Stoneville, MS, United StatesCrop and Soil Sciences Department, North Carolina State University, Raleigh, NC, United StatesGenomics and Bioinformatics Research Unit, United States Department of Agriculture - Agriculture Research Service (USDA-ARS), Stoneville, MS, United StatesUS Vegetable Laboratory, United States Department of Agriculture - Agriculture Research Service (USDA-ARS), Charleston, SC, United StatesGenomics and Bioinformatics Research Unit, USDA-ARS, Raleigh, NC, United StatesCrop and Soil Sciences Department, North Carolina State University, Raleigh, NC, United StatesAs sequencing costs decrease and availability of high fidelity long-read sequencing increases, generating experiment specific de novo genome assemblies becomes feasible. In many crop species, obtaining the genome of a hybrid or heterozygous individual is necessary for systems that do not tolerate inbreeding or for investigating important biological questions, such as hybrid vigor. However, most genome assembly methods that have been used in plants result in a merged single sequence representation that is not a true biologically accurate representation of either haplotype within a diploid individual. The resulting genome assembly is often fragmented and exhibits a mosaic of the two haplotypes, referred to as haplotype-switching. Important haplotype level information, such as causal mutations and structural variation is therefore lost causing difficulties in interpreting downstream analyses. To overcome this challenge, we have applied a method developed for animal genome assembly called trio-binning to an intra-specific hybrid of chili pepper (Capsicum annuum L. cv. HDA149 x Capsicum annuum L. cv. HDA330). We tested all currently available softwares for performing trio-binning, combined with multiple scaffolding technologies including Bionano to determine the optimal method of producing the best haplotype-resolved assembly. Ultimately, we produced highly contiguous biologically true haplotype-resolved genome assemblies for each parent, with scaffold N50s of 266.0 Mb and 281.3 Mb, with 99.6% and 99.8% positioned into chromosomes respectively. The assemblies captured 3.10 Gb and 3.12 Gb of the estimated 3.5 Gb chili pepper genome size. These assemblies represent the complete genome structure of the intraspecific hybrid, as well as the two parental genomes, and show measurable improvements over the currently available reference genomes. Our manuscript provides a valuable guide on how to apply trio-binning to other plant genomes.https://www.frontiersin.org/articles/10.3389/fpls.2023.1184112/fullhaplotypepeppergenome assemblytrio-binningHiFi
spellingShingle Emily E. Delorean
Emily E. Delorean
Ramey C. Youngblood
Sheron A. Simpson
Ashley N. Schoonmaker
Brian E. Scheffler
William B. Rutter
Amanda M. Hulse-Kemp
Amanda M. Hulse-Kemp
Representing true plant genomes: haplotype-resolved hybrid pepper genome with trio-binning
Frontiers in Plant Science
haplotype
pepper
genome assembly
trio-binning
HiFi
title Representing true plant genomes: haplotype-resolved hybrid pepper genome with trio-binning
title_full Representing true plant genomes: haplotype-resolved hybrid pepper genome with trio-binning
title_fullStr Representing true plant genomes: haplotype-resolved hybrid pepper genome with trio-binning
title_full_unstemmed Representing true plant genomes: haplotype-resolved hybrid pepper genome with trio-binning
title_short Representing true plant genomes: haplotype-resolved hybrid pepper genome with trio-binning
title_sort representing true plant genomes haplotype resolved hybrid pepper genome with trio binning
topic haplotype
pepper
genome assembly
trio-binning
HiFi
url https://www.frontiersin.org/articles/10.3389/fpls.2023.1184112/full
work_keys_str_mv AT emilyedelorean representingtrueplantgenomeshaplotyperesolvedhybridpeppergenomewithtriobinning
AT emilyedelorean representingtrueplantgenomeshaplotyperesolvedhybridpeppergenomewithtriobinning
AT rameycyoungblood representingtrueplantgenomeshaplotyperesolvedhybridpeppergenomewithtriobinning
AT sheronasimpson representingtrueplantgenomeshaplotyperesolvedhybridpeppergenomewithtriobinning
AT ashleynschoonmaker representingtrueplantgenomeshaplotyperesolvedhybridpeppergenomewithtriobinning
AT brianescheffler representingtrueplantgenomeshaplotyperesolvedhybridpeppergenomewithtriobinning
AT williambrutter representingtrueplantgenomeshaplotyperesolvedhybridpeppergenomewithtriobinning
AT amandamhulsekemp representingtrueplantgenomeshaplotyperesolvedhybridpeppergenomewithtriobinning
AT amandamhulsekemp representingtrueplantgenomeshaplotyperesolvedhybridpeppergenomewithtriobinning