Nanopore Sequencing and Hi-C Based De Novo Assembly of <i>Trachidermus fasciatus</i> Genome

<i>Trachidermus fasciatus</i> is a roughskin sculpin fish widespread across the coastal areas of East Asia. Due to environmental destruction and overfishing, the population of this species is under threat. In order to protect this endangered species, it is important to have the genome se...

Full description

Bibliographic Details
Main Authors: Gangcai Xie, Xu Zhang, Feng Lv, Mengmeng Sang, Hairong Hu, Jinqiu Wang, Dong Liu
Format: Article
Language:English
Published: MDPI AG 2021-05-01
Series:Genes
Subjects:
Online Access:https://www.mdpi.com/2073-4425/12/5/692
Description
Summary:<i>Trachidermus fasciatus</i> is a roughskin sculpin fish widespread across the coastal areas of East Asia. Due to environmental destruction and overfishing, the population of this species is under threat. In order to protect this endangered species, it is important to have the genome sequenced. Reference genomes are essential for studying population genetics, domestic farming, and genetic resource protection. However, currently, no reference genome is available for <i>Trachidermus fasciatus</i>, and this has greatly hindered the research on this species. In this study, we integrated nanopore long-read sequencing, Illumina short-read sequencing, and Hi-C methods to thoroughly assemble the <i>Trachidermus fasciatus</i> genome. Our results provided a chromosome-level high-quality genome assembly with a predicted genome size of 542.6 Mbp (2<i>n</i> = 40) and a scaffold N50 of 24.9 Mbp. The BUSCO value for genome assembly completeness was higher than 96%, and the single-base accuracy was 99.997%. Based on EVM-StringTie genome annotation, a total of 19,147 protein-coding genes were identified, including 35,093 mRNA transcripts. In addition, a novel gene-finding strategy named RNR was introduced, and in total, 51 (82) novel genes (transcripts) were identified. Lastly, we present here the first reference genome for <i>Trachidermus fasciatus</i>; this sequence is expected to greatly facilitate future research on this species.
ISSN:2073-4425