A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae)
This study introduces a meticulously constructed genome assembly at the chromosome level for the Rosaceae family species <i>Prinsepia uniflora</i>, a traditional Chinese medicinal herb. The final assembly encompasses 1272.71 megabases (Mb) distributed across 16 pseudochromosomes, boastin...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-11-01
|
Series: | Genes |
Subjects: | |
Online Access: | https://www.mdpi.com/2073-4425/14/11/2035 |
_version_ | 1797459207116554240 |
---|---|
author | Lei Zhang Chaopan Zhang Yajing An Qiang Zhu Mingcheng Wang |
author_facet | Lei Zhang Chaopan Zhang Yajing An Qiang Zhu Mingcheng Wang |
author_sort | Lei Zhang |
collection | DOAJ |
description | This study introduces a meticulously constructed genome assembly at the chromosome level for the Rosaceae family species <i>Prinsepia uniflora</i>, a traditional Chinese medicinal herb. The final assembly encompasses 1272.71 megabases (Mb) distributed across 16 pseudochromosomes, boasting contig and super-scaffold N50 values of 2.77 and 79.32 Mb, respectively. Annotated within this genome is a substantial 875.99 Mb of repetitive sequences, with transposable elements occupying 777.28 Mb, constituting 61.07% of the entire genome. Our predictive efforts identified 49,261 protein-coding genes within the repeat-masked assembly, with 45,256 (91.87%) having functional annotations, 5127 (10.41%) demonstrating tandem duplication, and 2373 (4.82%) classified as transcription factor genes. Additionally, our investigation unveiled 3080 non-coding RNAs spanning 0.51 Mb of the genome sequences. According to our evolutionary study, <i>P. uniflora</i> underwent recent whole-genome duplication following its separation from <i>Prunus salicina</i>. The presented reference-level genome assembly and annotation for <i>P. uniflora</i> will significantly facilitate the in-depth exploration of genomic information pertaining to this species, offering substantial utility in comparative genomics and evolutionary analyses involving Rosaceae species. |
first_indexed | 2024-03-09T16:48:05Z |
format | Article |
id | doaj.art-9e12fced87e9446281d0afa0bfc7ed7e |
institution | Directory Open Access Journal |
issn | 2073-4425 |
language | English |
last_indexed | 2024-03-09T16:48:05Z |
publishDate | 2023-11-01 |
publisher | MDPI AG |
record_format | Article |
series | Genes |
spelling | doaj.art-9e12fced87e9446281d0afa0bfc7ed7e2023-11-24T14:43:52ZengMDPI AGGenes2073-44252023-11-011411203510.3390/genes14112035A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae)Lei Zhang0Chaopan Zhang1Yajing An2Qiang Zhu3Mingcheng Wang4Key Laboratory of Ecological Protection of Agro-Pastoral Ecotones in the Yellow River Basin, National Ethnic Affairs Commission of the People’s Republic of China, College of Biological Science & Engineering, North Minzu University, Yinchuan 750021, ChinaKey Laboratory of Ecological Protection of Agro-Pastoral Ecotones in the Yellow River Basin, National Ethnic Affairs Commission of the People’s Republic of China, College of Biological Science & Engineering, North Minzu University, Yinchuan 750021, ChinaKey Laboratory of Ecological Protection of Agro-Pastoral Ecotones in the Yellow River Basin, National Ethnic Affairs Commission of the People’s Republic of China, College of Biological Science & Engineering, North Minzu University, Yinchuan 750021, ChinaState Key Laboratory of Efficient Production of Forest Resources, Ningxia Forestry Institute, Yinchuan 750001, ChinaInstitute for Advanced Study, Chengdu University, No. 2025 Chengluo Road, Chengdu 610106, ChinaThis study introduces a meticulously constructed genome assembly at the chromosome level for the Rosaceae family species <i>Prinsepia uniflora</i>, a traditional Chinese medicinal herb. The final assembly encompasses 1272.71 megabases (Mb) distributed across 16 pseudochromosomes, boasting contig and super-scaffold N50 values of 2.77 and 79.32 Mb, respectively. Annotated within this genome is a substantial 875.99 Mb of repetitive sequences, with transposable elements occupying 777.28 Mb, constituting 61.07% of the entire genome. Our predictive efforts identified 49,261 protein-coding genes within the repeat-masked assembly, with 45,256 (91.87%) having functional annotations, 5127 (10.41%) demonstrating tandem duplication, and 2373 (4.82%) classified as transcription factor genes. Additionally, our investigation unveiled 3080 non-coding RNAs spanning 0.51 Mb of the genome sequences. According to our evolutionary study, <i>P. uniflora</i> underwent recent whole-genome duplication following its separation from <i>Prunus salicina</i>. The presented reference-level genome assembly and annotation for <i>P. uniflora</i> will significantly facilitate the in-depth exploration of genomic information pertaining to this species, offering substantial utility in comparative genomics and evolutionary analyses involving Rosaceae species.https://www.mdpi.com/2073-4425/14/11/2035<i>Prinsepia uniflora</i>medicinal plantPacBio high-fidelity sequencingchromosome-level genome assemblygenome annotation |
spellingShingle | Lei Zhang Chaopan Zhang Yajing An Qiang Zhu Mingcheng Wang A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae) Genes <i>Prinsepia uniflora</i> medicinal plant PacBio high-fidelity sequencing chromosome-level genome assembly genome annotation |
title | A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae) |
title_full | A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae) |
title_fullStr | A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae) |
title_full_unstemmed | A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae) |
title_short | A High-Quality Reference Genome Assembly of <i>Prinsepia uniflora</i> (Rosaceae) |
title_sort | high quality reference genome assembly of i prinsepia uniflora i rosaceae |
topic | <i>Prinsepia uniflora</i> medicinal plant PacBio high-fidelity sequencing chromosome-level genome assembly genome annotation |
url | https://www.mdpi.com/2073-4425/14/11/2035 |
work_keys_str_mv | AT leizhang ahighqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae AT chaopanzhang ahighqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae AT yajingan ahighqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae AT qiangzhu ahighqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae AT mingchengwang ahighqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae AT leizhang highqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae AT chaopanzhang highqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae AT yajingan highqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae AT qiangzhu highqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae AT mingchengwang highqualityreferencegenomeassemblyofiprinsepiauniflorairosaceae |