Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players.

Pathways leading to formation of non-coding RNA and protein genes are varied and complex. We report finding a conserved repeat sequence present in human and chimpanzee genomes that appears to have originated from a common primate ancestor. This sequence is repeatedly copied in human chromosome 22 (c...

Full description

Bibliographic Details
Main Author: Nicholas Delihas
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0230236
_version_ 1818583757735591936
author Nicholas Delihas
author_facet Nicholas Delihas
author_sort Nicholas Delihas
collection DOAJ
description Pathways leading to formation of non-coding RNA and protein genes are varied and complex. We report finding a conserved repeat sequence present in human and chimpanzee genomes that appears to have originated from a common primate ancestor. This sequence is repeatedly copied in human chromosome 22 (chr22) low copy repeats (LCR22) or segmental duplications and forms twenty-one different genes, which include the human long intergenic non-coding RNA (lincRNA) family FAM230, a newly discovered lincRNA gene family termed conserved long intergenic non-coding RNAs (clincRNA), pseudogene families, as well as the gamma-glutamyltransferase (GGT) protein gene family and the RNA pseudogenes that originate from GGT sequences. Of particular interest are the GGT5 and USP18 protein genes that appear to have formed from an homologous repeat sequence that also forms the clincRNA gene family. The data point to ancestral DNA sequences, conserved through evolution and duplicated in humans by chromosomal repeat sequences that may serve as functional genomic elements in the development of diverse genes.
first_indexed 2024-12-16T08:10:21Z
format Article
id doaj.art-c7e325f6b73b4bf9a6a77a0b446c5445
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-16T08:10:21Z
publishDate 2020-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-c7e325f6b73b4bf9a6a77a0b446c54452022-12-21T22:38:21ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01153e023023610.1371/journal.pone.0230236Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players.Nicholas DelihasPathways leading to formation of non-coding RNA and protein genes are varied and complex. We report finding a conserved repeat sequence present in human and chimpanzee genomes that appears to have originated from a common primate ancestor. This sequence is repeatedly copied in human chromosome 22 (chr22) low copy repeats (LCR22) or segmental duplications and forms twenty-one different genes, which include the human long intergenic non-coding RNA (lincRNA) family FAM230, a newly discovered lincRNA gene family termed conserved long intergenic non-coding RNAs (clincRNA), pseudogene families, as well as the gamma-glutamyltransferase (GGT) protein gene family and the RNA pseudogenes that originate from GGT sequences. Of particular interest are the GGT5 and USP18 protein genes that appear to have formed from an homologous repeat sequence that also forms the clincRNA gene family. The data point to ancestral DNA sequences, conserved through evolution and duplicated in humans by chromosomal repeat sequences that may serve as functional genomic elements in the development of diverse genes.https://doi.org/10.1371/journal.pone.0230236
spellingShingle Nicholas Delihas
Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players.
PLoS ONE
title Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players.
title_full Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players.
title_fullStr Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players.
title_full_unstemmed Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players.
title_short Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players.
title_sort formation of human long intergenic non coding rna genes pseudogenes and protein genes ancestral sequences are key players
url https://doi.org/10.1371/journal.pone.0230236
work_keys_str_mv AT nicholasdelihas formationofhumanlongintergenicnoncodingrnagenespseudogenesandproteingenesancestralsequencesarekeyplayers