A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae)

This study introduces a meticulously constructed genome assembly at the chromosome level for the Rosaceae family species <i>Prinsepia uniflora</i>, a traditional Chinese medicinal herb. The final assembly encompasses 1272.71 megabases (Mb) distributed across 16 pseudochromosomes, boastin...

Full description

Bibliographic Details
Main Authors: Lei Zhang, Chaopan Zhang, Yajing An, Qiang Zhu, Mingcheng Wang
Format: Article
Language:English
Published: MDPI AG 2023-11-01
Series:Genes
Subjects:
Online Access:https://www.mdpi.com/2073-4425/14/11/2035
_version_ 1797459207116554240
author Lei Zhang
Chaopan Zhang
Yajing An
Qiang Zhu
Mingcheng Wang
author_facet Lei Zhang
Chaopan Zhang
Yajing An
Qiang Zhu
Mingcheng Wang
author_sort Lei Zhang
collection DOAJ
description This study introduces a meticulously constructed genome assembly at the chromosome level for the Rosaceae family species <i>Prinsepia uniflora</i>, a traditional Chinese medicinal herb. The final assembly encompasses 1272.71 megabases (Mb) distributed across 16 pseudochromosomes, boasting contig and super-scaffold N50 values of 2.77 and 79.32 Mb, respectively. Annotated within this genome is a substantial 875.99 Mb of repetitive sequences, with transposable elements occupying 777.28 Mb, constituting 61.07% of the entire genome. Our predictive efforts identified 49,261 protein-coding genes within the repeat-masked assembly, with 45,256 (91.87%) having functional annotations, 5127 (10.41%) demonstrating tandem duplication, and 2373 (4.82%) classified as transcription factor genes. Additionally, our investigation unveiled 3080 non-coding RNAs spanning 0.51 Mb of the genome sequences. According to our evolutionary study, <i>P. uniflora</i> underwent recent whole-genome duplication following its separation from <i>Prunus salicina</i>. The presented reference-level genome assembly and annotation for <i>P. uniflora</i> will significantly facilitate the in-depth exploration of genomic information pertaining to this species, offering substantial utility in comparative genomics and evolutionary analyses involving Rosaceae species.
first_indexed 2024-03-09T16:48:05Z
format Article
id doaj.art-9e12fced87e9446281d0afa0bfc7ed7e
institution Directory Open Access Journal
issn 2073-4425
language English
last_indexed 2024-03-09T16:48:05Z
publishDate 2023-11-01
publisher MDPI AG
record_format Article
series Genes
spelling doaj.art-9e12fced87e9446281d0afa0bfc7ed7e2023-11-24T14:43:52ZengMDPI AGGenes2073-44252023-11-011411203510.3390/genes14112035A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae)Lei Zhang0Chaopan Zhang1Yajing An2Qiang Zhu3Mingcheng Wang4Key Laboratory of Ecological Protection of Agro-Pastoral Ecotones in the Yellow River Basin, National Ethnic Affairs Commission of the People’s Republic of China, College of Biological Science & Engineering, North Minzu University, Yinchuan 750021, ChinaKey Laboratory of Ecological Protection of Agro-Pastoral Ecotones in the Yellow River Basin, National Ethnic Affairs Commission of the People’s Republic of China, College of Biological Science & Engineering, North Minzu University, Yinchuan 750021, ChinaKey Laboratory of Ecological Protection of Agro-Pastoral Ecotones in the Yellow River Basin, National Ethnic Affairs Commission of the People’s Republic of China, College of Biological Science & Engineering, North Minzu University, Yinchuan 750021, ChinaState Key Laboratory of Efficient Production of Forest Resources, Ningxia Forestry Institute, Yinchuan 750001, ChinaInstitute for Advanced Study, Chengdu University, No. 2025 Chengluo Road, Chengdu 610106, ChinaThis study introduces a meticulously constructed genome assembly at the chromosome level for the Rosaceae family species <i>Prinsepia uniflora</i>, a traditional Chinese medicinal herb. The final assembly encompasses 1272.71 megabases (Mb) distributed across 16 pseudochromosomes, boasting contig and super-scaffold N50 values of 2.77 and 79.32 Mb, respectively. Annotated within this genome is a substantial 875.99 Mb of repetitive sequences, with transposable elements occupying 777.28 Mb, constituting 61.07% of the entire genome. Our predictive efforts identified 49,261 protein-coding genes within the repeat-masked assembly, with 45,256 (91.87%) having functional annotations, 5127 (10.41%) demonstrating tandem duplication, and 2373 (4.82%) classified as transcription factor genes. Additionally, our investigation unveiled 3080 non-coding RNAs spanning 0.51 Mb of the genome sequences. According to our evolutionary study, <i>P. uniflora</i> underwent recent whole-genome duplication following its separation from <i>Prunus salicina</i>. The presented reference-level genome assembly and annotation for <i>P. uniflora</i> will significantly facilitate the in-depth exploration of genomic information pertaining to this species, offering substantial utility in comparative genomics and evolutionary analyses involving Rosaceae species.https://www.mdpi.com/2073-4425/14/11/2035<i>Prinsepia uniflora</i>medicinal plantPacBio high-fidelity sequencingchromosome-level genome assemblygenome annotation
spellingShingle Lei Zhang
Chaopan Zhang
Yajing An
Qiang Zhu
Mingcheng Wang
A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae)
Genes
<i>Prinsepia uniflora</i>
medicinal plant
PacBio high-fidelity sequencing
chromosome-level genome assembly
genome annotation
title A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae)
title_full A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae)
title_fullStr A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae)
title_full_unstemmed A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae)
title_short A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae)
title_sort high quality reference genome assembly of i prinsepia uniflora i rosaceae
topic <i>Prinsepia uniflora</i>
medicinal plant
PacBio high-fidelity sequencing
chromosome-level genome assembly
genome annotation
url https://www.mdpi.com/2073-4425/14/11/2035
work_keys_str_mv AT leizhang ahighqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae
AT chaopanzhang ahighqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae
AT yajingan ahighqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae
AT qiangzhu ahighqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae
AT mingchengwang ahighqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae
AT leizhang highqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae
AT chaopanzhang highqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae
AT yajingan highqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae
AT qiangzhu highqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae
AT mingchengwang highqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae