Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion

The chloroplast genome (CPG) of Pinus massoniana belonging to the genus Pinus (Pinaceae), which is a primary source of turpentine, was sequenced and analyzed in terms of gene rearrangements, ndh genes loss, and the contraction and expansion of short inverted repeats (IRs). P. massoniana CPG has a ty...

Full description

Bibliographic Details
Main Authors: ZhouXian Ni, YouJu Ye, Tiandao Bai, Meng Xu, Li-An Xu
Format: Article
Language:English
Published: MDPI AG 2017-09-01
Series:Molecules
Subjects:
Online Access:https://www.mdpi.com/1420-3049/22/9/1528
_version_ 1819238305091289088
author ZhouXian Ni
YouJu Ye
Tiandao Bai
Meng Xu
Li-An Xu
author_facet ZhouXian Ni
YouJu Ye
Tiandao Bai
Meng Xu
Li-An Xu
author_sort ZhouXian Ni
collection DOAJ
description The chloroplast genome (CPG) of Pinus massoniana belonging to the genus Pinus (Pinaceae), which is a primary source of turpentine, was sequenced and analyzed in terms of gene rearrangements, ndh genes loss, and the contraction and expansion of short inverted repeats (IRs). P. massoniana CPG has a typical quadripartite structure that includes large single copy (LSC) (65,563 bp), small single copy (SSC) (53,230 bp) and two IRs (IRa and IRb, 485 bp). The 108 unique genes were identified, including 73 protein-coding genes, 31 tRNAs, and 4 rRNAs. Most of the 81 simple sequence repeats (SSRs) identified in CPG were mononucleotides motifs of A/T types and located in non-coding regions. Comparisons with related species revealed an inversion (21,556 bp) in the LSC region; P. massoniana CPG lacks all 11 intact ndh genes (four ndh genes lost completely; the five remained truncated as pseudogenes; and the other two ndh genes remain as pseudogenes because of short insertions or deletions). A pair of short IRs was found instead of large IRs, and size variations among pine species were observed, which resulted from short insertions or deletions and non-synchronized variations between “IRa” and “IRb”. The results of phylogenetic analyses based on whole CPG sequences of 16 conifers indicated that the whole CPG sequences could be used as a powerful tool in phylogenetic analyses.
first_indexed 2024-12-23T13:34:06Z
format Article
id doaj.art-8ee8102739f54dd7950b171b6c68d23a
institution Directory Open Access Journal
issn 1420-3049
language English
last_indexed 2024-12-23T13:34:06Z
publishDate 2017-09-01
publisher MDPI AG
record_format Article
series Molecules
spelling doaj.art-8ee8102739f54dd7950b171b6c68d23a2022-12-21T17:45:04ZengMDPI AGMolecules1420-30492017-09-01229152810.3390/molecules22091528molecules22091528Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, ExpansionZhouXian Ni0YouJu Ye1Tiandao Bai2Meng Xu3Li-An Xu4Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, ChinaCo-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, ChinaCo-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, ChinaCo-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, ChinaCo-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, ChinaThe chloroplast genome (CPG) of Pinus massoniana belonging to the genus Pinus (Pinaceae), which is a primary source of turpentine, was sequenced and analyzed in terms of gene rearrangements, ndh genes loss, and the contraction and expansion of short inverted repeats (IRs). P. massoniana CPG has a typical quadripartite structure that includes large single copy (LSC) (65,563 bp), small single copy (SSC) (53,230 bp) and two IRs (IRa and IRb, 485 bp). The 108 unique genes were identified, including 73 protein-coding genes, 31 tRNAs, and 4 rRNAs. Most of the 81 simple sequence repeats (SSRs) identified in CPG were mononucleotides motifs of A/T types and located in non-coding regions. Comparisons with related species revealed an inversion (21,556 bp) in the LSC region; P. massoniana CPG lacks all 11 intact ndh genes (four ndh genes lost completely; the five remained truncated as pseudogenes; and the other two ndh genes remain as pseudogenes because of short insertions or deletions). A pair of short IRs was found instead of large IRs, and size variations among pine species were observed, which resulted from short insertions or deletions and non-synchronized variations between “IRa” and “IRb”. The results of phylogenetic analyses based on whole CPG sequences of 16 conifers indicated that the whole CPG sequences could be used as a powerful tool in phylogenetic analyses.https://www.mdpi.com/1420-3049/22/9/1528conifer speciesgenome annotationstructural inversioncomparative genomicsphylogenetic analysis
spellingShingle ZhouXian Ni
YouJu Ye
Tiandao Bai
Meng Xu
Li-An Xu
Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion
Molecules
conifer species
genome annotation
structural inversion
comparative genomics
phylogenetic analysis
title Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion
title_full Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion
title_fullStr Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion
title_full_unstemmed Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion
title_short Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion
title_sort complete chloroplast genome of pinus massoniana pinaceae gene rearrangements loss of ndh genes and short inverted repeats contraction expansion
topic conifer species
genome annotation
structural inversion
comparative genomics
phylogenetic analysis
url https://www.mdpi.com/1420-3049/22/9/1528
work_keys_str_mv AT zhouxianni completechloroplastgenomeofpinusmassonianapinaceaegenerearrangementslossofndhgenesandshortinvertedrepeatscontractionexpansion
AT youjuye completechloroplastgenomeofpinusmassonianapinaceaegenerearrangementslossofndhgenesandshortinvertedrepeatscontractionexpansion
AT tiandaobai completechloroplastgenomeofpinusmassonianapinaceaegenerearrangementslossofndhgenesandshortinvertedrepeatscontractionexpansion
AT mengxu completechloroplastgenomeofpinusmassonianapinaceaegenerearrangementslossofndhgenesandshortinvertedrepeatscontractionexpansion
AT lianxu completechloroplastgenomeofpinusmassonianapinaceaegenerearrangementslossofndhgenesandshortinvertedrepeatscontractionexpansion