A high-quality de novo genome assembly based on nanopore sequencing of a wild-caught coconut rhinoceros beetle (Oryctes rhinoceros)

Abstract Background An optimal starting point for relating genome function to organismal biology is a high-quality nuclear genome assembly, and long-read sequencing is revolutionizing the production of this genomic resource in insects. Despite this, nuclear genome assemblies have been under-represen...

Full description

Bibliographic Details
Main Authors: Igor Filipović, Gordana Rašić, James Hereward, Maria Gharuka, Gregor J. Devine, Michael J. Furlong, Kayvan Etebari
Format: Article
Language:English
Published: BMC 2022-06-01
Series:BMC Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12864-022-08628-z
_version_ 1811257984560398336
author Igor Filipović
Gordana Rašić
James Hereward
Maria Gharuka
Gregor J. Devine
Michael J. Furlong
Kayvan Etebari
author_facet Igor Filipović
Gordana Rašić
James Hereward
Maria Gharuka
Gregor J. Devine
Michael J. Furlong
Kayvan Etebari
author_sort Igor Filipović
collection DOAJ
description Abstract Background An optimal starting point for relating genome function to organismal biology is a high-quality nuclear genome assembly, and long-read sequencing is revolutionizing the production of this genomic resource in insects. Despite this, nuclear genome assemblies have been under-represented for agricultural insect pests, particularly from the order Coleoptera. Here we present a de novo genome assembly and structural annotation for the coconut rhinoceros beetle, Oryctes rhinoceros (Coleoptera: Scarabaeidae), based on Oxford Nanopore Technologies (ONT) long-read data generated from a wild-caught female, as well as the assembly process that also led to the recovery of the complete circular genome assemblies of the beetle’s mitochondrial genome and that of the biocontrol agent, Oryctes rhinoceros nudivirus (OrNV). As an invasive pest of palm trees, O. rhinoceros is undergoing an expansion in its range across the Pacific Islands, requiring new approaches to management that may include strategies facilitated by genome assembly and annotation. Results High-quality DNA isolated from an adult female was used to create four ONT libraries that were sequenced using four MinION flow cells, producing a total of 27.2 Gb of high-quality long-read sequences. We employed an iterative assembly process and polishing with one lane of high-accuracy Illumina reads, obtaining a final size of the assembly of 377.36 Mb that had high contiguity (fragment N50 length = 12 Mb) and accuracy, as evidenced by the exceptionally high completeness of the benchmarked set of conserved single-copy orthologous genes (BUSCO completeness = 99.1%). These quality metrics place our assembly ahead of the published Coleopteran genomes, including that of an insect model, the red flour beetle (Tribolium castaneum). The structural annotation of the nuclear genome assembly contained a highly-accurate set of 16,371 protein-coding genes, with only 2.8% missing BUSCOs, and the expected number of non-coding RNAs. The number and structure of paralogous genes in a gene family like Sigma GST is lower than in another scarab beetle (Onthophagus taurus), but higher than in the red flour beetle (Tribolium castaneum), which suggests expansion of this GST class in Scarabaeidae. The quality of our gene models was also confirmed with the correct placement of O. rhinoceros among other members of the rhinoceros beetles (subfamily Dynastinae) in a phylogeny based on the sequences of 95 protein-coding genes in 373 beetle species from all major lineages of Coleoptera. Finally, we provide a list of 30 candidate dsRNA targets whose orthologs have been experimentally validated as highly effective targets for RNAi-based control of several beetles. Conclusions The genomic resources produced in this study form a foundation for further functional genetic research and management programs that may inform the control and surveillance of O. rhinoceros populations, and we demonstrate the efficacy of de novo genome assembly using long-read ONT data from a single field-caught insect.
first_indexed 2024-04-12T18:06:07Z
format Article
id doaj.art-4457976727d4473f9315a495bb7974de
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-04-12T18:06:07Z
publishDate 2022-06-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-4457976727d4473f9315a495bb7974de2022-12-22T03:21:58ZengBMCBMC Genomics1471-21642022-06-0123111510.1186/s12864-022-08628-zA high-quality de novo genome assembly based on nanopore sequencing of a wild-caught coconut rhinoceros beetle (Oryctes rhinoceros)Igor Filipović0Gordana Rašić1James Hereward2Maria Gharuka3Gregor J. Devine4Michael J. Furlong5Kayvan Etebari6School of Biological Sciences, The University of QueenslandMosquito Control Laboratory, QIMR Berghofer Medical Research InstituteSchool of Biological Sciences, The University of QueenslandResearch Division, Ministry of Agriculture and LivestockMosquito Control Laboratory, QIMR Berghofer Medical Research InstituteSchool of Biological Sciences, The University of QueenslandSchool of Biological Sciences, The University of QueenslandAbstract Background An optimal starting point for relating genome function to organismal biology is a high-quality nuclear genome assembly, and long-read sequencing is revolutionizing the production of this genomic resource in insects. Despite this, nuclear genome assemblies have been under-represented for agricultural insect pests, particularly from the order Coleoptera. Here we present a de novo genome assembly and structural annotation for the coconut rhinoceros beetle, Oryctes rhinoceros (Coleoptera: Scarabaeidae), based on Oxford Nanopore Technologies (ONT) long-read data generated from a wild-caught female, as well as the assembly process that also led to the recovery of the complete circular genome assemblies of the beetle’s mitochondrial genome and that of the biocontrol agent, Oryctes rhinoceros nudivirus (OrNV). As an invasive pest of palm trees, O. rhinoceros is undergoing an expansion in its range across the Pacific Islands, requiring new approaches to management that may include strategies facilitated by genome assembly and annotation. Results High-quality DNA isolated from an adult female was used to create four ONT libraries that were sequenced using four MinION flow cells, producing a total of 27.2 Gb of high-quality long-read sequences. We employed an iterative assembly process and polishing with one lane of high-accuracy Illumina reads, obtaining a final size of the assembly of 377.36 Mb that had high contiguity (fragment N50 length = 12 Mb) and accuracy, as evidenced by the exceptionally high completeness of the benchmarked set of conserved single-copy orthologous genes (BUSCO completeness = 99.1%). These quality metrics place our assembly ahead of the published Coleopteran genomes, including that of an insect model, the red flour beetle (Tribolium castaneum). The structural annotation of the nuclear genome assembly contained a highly-accurate set of 16,371 protein-coding genes, with only 2.8% missing BUSCOs, and the expected number of non-coding RNAs. The number and structure of paralogous genes in a gene family like Sigma GST is lower than in another scarab beetle (Onthophagus taurus), but higher than in the red flour beetle (Tribolium castaneum), which suggests expansion of this GST class in Scarabaeidae. The quality of our gene models was also confirmed with the correct placement of O. rhinoceros among other members of the rhinoceros beetles (subfamily Dynastinae) in a phylogeny based on the sequences of 95 protein-coding genes in 373 beetle species from all major lineages of Coleoptera. Finally, we provide a list of 30 candidate dsRNA targets whose orthologs have been experimentally validated as highly effective targets for RNAi-based control of several beetles. Conclusions The genomic resources produced in this study form a foundation for further functional genetic research and management programs that may inform the control and surveillance of O. rhinoceros populations, and we demonstrate the efficacy of de novo genome assembly using long-read ONT data from a single field-caught insect.https://doi.org/10.1186/s12864-022-08628-zGenome assemblyGenome annotationSingle insect nanopore sequencingOryctes rhinocerosColeoptera
spellingShingle Igor Filipović
Gordana Rašić
James Hereward
Maria Gharuka
Gregor J. Devine
Michael J. Furlong
Kayvan Etebari
A high-quality de novo genome assembly based on nanopore sequencing of a wild-caught coconut rhinoceros beetle (Oryctes rhinoceros)
BMC Genomics
Genome assembly
Genome annotation
Single insect nanopore sequencing
Oryctes rhinoceros
Coleoptera
title A high-quality de novo genome assembly based on nanopore sequencing of a wild-caught coconut rhinoceros beetle (Oryctes rhinoceros)
title_full A high-quality de novo genome assembly based on nanopore sequencing of a wild-caught coconut rhinoceros beetle (Oryctes rhinoceros)
title_fullStr A high-quality de novo genome assembly based on nanopore sequencing of a wild-caught coconut rhinoceros beetle (Oryctes rhinoceros)
title_full_unstemmed A high-quality de novo genome assembly based on nanopore sequencing of a wild-caught coconut rhinoceros beetle (Oryctes rhinoceros)
title_short A high-quality de novo genome assembly based on nanopore sequencing of a wild-caught coconut rhinoceros beetle (Oryctes rhinoceros)
title_sort high quality de novo genome assembly based on nanopore sequencing of a wild caught coconut rhinoceros beetle oryctes rhinoceros
topic Genome assembly
Genome annotation
Single insect nanopore sequencing
Oryctes rhinoceros
Coleoptera
url https://doi.org/10.1186/s12864-022-08628-z
work_keys_str_mv AT igorfilipovic ahighqualitydenovogenomeassemblybasedonnanoporesequencingofawildcaughtcoconutrhinocerosbeetleoryctesrhinoceros
AT gordanarasic ahighqualitydenovogenomeassemblybasedonnanoporesequencingofawildcaughtcoconutrhinocerosbeetleoryctesrhinoceros
AT jameshereward ahighqualitydenovogenomeassemblybasedonnanoporesequencingofawildcaughtcoconutrhinocerosbeetleoryctesrhinoceros
AT mariagharuka ahighqualitydenovogenomeassemblybasedonnanoporesequencingofawildcaughtcoconutrhinocerosbeetleoryctesrhinoceros
AT gregorjdevine ahighqualitydenovogenomeassemblybasedonnanoporesequencingofawildcaughtcoconutrhinocerosbeetleoryctesrhinoceros
AT michaeljfurlong ahighqualitydenovogenomeassemblybasedonnanoporesequencingofawildcaughtcoconutrhinocerosbeetleoryctesrhinoceros
AT kayvanetebari ahighqualitydenovogenomeassemblybasedonnanoporesequencingofawildcaughtcoconutrhinocerosbeetleoryctesrhinoceros
AT igorfilipovic highqualitydenovogenomeassemblybasedonnanoporesequencingofawildcaughtcoconutrhinocerosbeetleoryctesrhinoceros
AT gordanarasic highqualitydenovogenomeassemblybasedonnanoporesequencingofawildcaughtcoconutrhinocerosbeetleoryctesrhinoceros
AT jameshereward highqualitydenovogenomeassemblybasedonnanoporesequencingofawildcaughtcoconutrhinocerosbeetleoryctesrhinoceros
AT mariagharuka highqualitydenovogenomeassemblybasedonnanoporesequencingofawildcaughtcoconutrhinocerosbeetleoryctesrhinoceros
AT gregorjdevine highqualitydenovogenomeassemblybasedonnanoporesequencingofawildcaughtcoconutrhinocerosbeetleoryctesrhinoceros
AT michaeljfurlong highqualitydenovogenomeassemblybasedonnanoporesequencingofawildcaughtcoconutrhinocerosbeetleoryctesrhinoceros
AT kayvanetebari highqualitydenovogenomeassemblybasedonnanoporesequencingofawildcaughtcoconutrhinocerosbeetleoryctesrhinoceros