Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion
The chloroplast genome (CPG) of Pinus massoniana belonging to the genus Pinus (Pinaceae), which is a primary source of turpentine, was sequenced and analyzed in terms of gene rearrangements, ndh genes loss, and the contraction and expansion of short inverted repeats (IRs). P. massoniana CPG has a ty...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2017-09-01
|
Series: | Molecules |
Subjects: | |
Online Access: | https://www.mdpi.com/1420-3049/22/9/1528 |
_version_ | 1819238305091289088 |
---|---|
author | ZhouXian Ni YouJu Ye Tiandao Bai Meng Xu Li-An Xu |
author_facet | ZhouXian Ni YouJu Ye Tiandao Bai Meng Xu Li-An Xu |
author_sort | ZhouXian Ni |
collection | DOAJ |
description | The chloroplast genome (CPG) of Pinus massoniana belonging to the genus Pinus (Pinaceae), which is a primary source of turpentine, was sequenced and analyzed in terms of gene rearrangements, ndh genes loss, and the contraction and expansion of short inverted repeats (IRs). P. massoniana CPG has a typical quadripartite structure that includes large single copy (LSC) (65,563 bp), small single copy (SSC) (53,230 bp) and two IRs (IRa and IRb, 485 bp). The 108 unique genes were identified, including 73 protein-coding genes, 31 tRNAs, and 4 rRNAs. Most of the 81 simple sequence repeats (SSRs) identified in CPG were mononucleotides motifs of A/T types and located in non-coding regions. Comparisons with related species revealed an inversion (21,556 bp) in the LSC region; P. massoniana CPG lacks all 11 intact ndh genes (four ndh genes lost completely; the five remained truncated as pseudogenes; and the other two ndh genes remain as pseudogenes because of short insertions or deletions). A pair of short IRs was found instead of large IRs, and size variations among pine species were observed, which resulted from short insertions or deletions and non-synchronized variations between “IRa” and “IRb”. The results of phylogenetic analyses based on whole CPG sequences of 16 conifers indicated that the whole CPG sequences could be used as a powerful tool in phylogenetic analyses. |
first_indexed | 2024-12-23T13:34:06Z |
format | Article |
id | doaj.art-8ee8102739f54dd7950b171b6c68d23a |
institution | Directory Open Access Journal |
issn | 1420-3049 |
language | English |
last_indexed | 2024-12-23T13:34:06Z |
publishDate | 2017-09-01 |
publisher | MDPI AG |
record_format | Article |
series | Molecules |
spelling | doaj.art-8ee8102739f54dd7950b171b6c68d23a2022-12-21T17:45:04ZengMDPI AGMolecules1420-30492017-09-01229152810.3390/molecules22091528molecules22091528Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, ExpansionZhouXian Ni0YouJu Ye1Tiandao Bai2Meng Xu3Li-An Xu4Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, ChinaCo-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, ChinaCo-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, ChinaCo-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, ChinaCo-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, ChinaThe chloroplast genome (CPG) of Pinus massoniana belonging to the genus Pinus (Pinaceae), which is a primary source of turpentine, was sequenced and analyzed in terms of gene rearrangements, ndh genes loss, and the contraction and expansion of short inverted repeats (IRs). P. massoniana CPG has a typical quadripartite structure that includes large single copy (LSC) (65,563 bp), small single copy (SSC) (53,230 bp) and two IRs (IRa and IRb, 485 bp). The 108 unique genes were identified, including 73 protein-coding genes, 31 tRNAs, and 4 rRNAs. Most of the 81 simple sequence repeats (SSRs) identified in CPG were mononucleotides motifs of A/T types and located in non-coding regions. Comparisons with related species revealed an inversion (21,556 bp) in the LSC region; P. massoniana CPG lacks all 11 intact ndh genes (four ndh genes lost completely; the five remained truncated as pseudogenes; and the other two ndh genes remain as pseudogenes because of short insertions or deletions). A pair of short IRs was found instead of large IRs, and size variations among pine species were observed, which resulted from short insertions or deletions and non-synchronized variations between “IRa” and “IRb”. The results of phylogenetic analyses based on whole CPG sequences of 16 conifers indicated that the whole CPG sequences could be used as a powerful tool in phylogenetic analyses.https://www.mdpi.com/1420-3049/22/9/1528conifer speciesgenome annotationstructural inversioncomparative genomicsphylogenetic analysis |
spellingShingle | ZhouXian Ni YouJu Ye Tiandao Bai Meng Xu Li-An Xu Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion Molecules conifer species genome annotation structural inversion comparative genomics phylogenetic analysis |
title | Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion |
title_full | Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion |
title_fullStr | Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion |
title_full_unstemmed | Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion |
title_short | Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion |
title_sort | complete chloroplast genome of pinus massoniana pinaceae gene rearrangements loss of ndh genes and short inverted repeats contraction expansion |
topic | conifer species genome annotation structural inversion comparative genomics phylogenetic analysis |
url | https://www.mdpi.com/1420-3049/22/9/1528 |
work_keys_str_mv | AT zhouxianni completechloroplastgenomeofpinusmassonianapinaceaegenerearrangementslossofndhgenesandshortinvertedrepeatscontractionexpansion AT youjuye completechloroplastgenomeofpinusmassonianapinaceaegenerearrangementslossofndhgenesandshortinvertedrepeatscontractionexpansion AT tiandaobai completechloroplastgenomeofpinusmassonianapinaceaegenerearrangementslossofndhgenesandshortinvertedrepeatscontractionexpansion AT mengxu completechloroplastgenomeofpinusmassonianapinaceaegenerearrangementslossofndhgenesandshortinvertedrepeatscontractionexpansion AT lianxu completechloroplastgenomeofpinusmassonianapinaceaegenerearrangementslossofndhgenesandshortinvertedrepeatscontractionexpansion |