Computational identification and functional predictions of long noncoding RNA in Zea mays.

BACKGROUND: Computational analysis of cDNA sequences from multiple organisms suggests that a large portion of transcribed DNA does not code for a functional protein. In mammals, noncoding transcription is abundant, and often results in functional RNA molecules that do not appear to encode proteins....

Full description

Bibliographic Details
Main Authors: Susan Boerner, Karen M McGinnis
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3420876?pdf=render
_version_ 1818943648316784640
author Susan Boerner
Karen M McGinnis
author_facet Susan Boerner
Karen M McGinnis
author_sort Susan Boerner
collection DOAJ
description BACKGROUND: Computational analysis of cDNA sequences from multiple organisms suggests that a large portion of transcribed DNA does not code for a functional protein. In mammals, noncoding transcription is abundant, and often results in functional RNA molecules that do not appear to encode proteins. Many long noncoding RNAs (lncRNAs) appear to have epigenetic regulatory function in humans, including HOTAIR and XIST. While epigenetic gene regulation is clearly an essential mechanism in plants, relatively little is known about the presence or function of lncRNAs in plants. METHODOLOGY/PRINCIPAL FINDINGS: To explore the connection between lncRNA and epigenetic regulation of gene expression in plants, a computational pipeline using the programming language Python has been developed and applied to maize full length cDNA sequences to identify, classify, and localize potential lncRNAs. The pipeline was used in parallel with an SVM tool for identifying ncRNAs to identify the maximal number of ncRNAs in the dataset. Although the available library of sequences was small and potentially biased toward protein coding transcripts, 15% of the sequences were predicted to be noncoding. Approximately 60% of these sequences appear to act as precursors for small RNA molecules and may function to regulate gene expression via a small RNA dependent mechanism. ncRNAs were predicted to originate from both genic and intergenic loci. Of the lncRNAs that originated from genic loci, ∼20% were antisense to the host gene loci. CONCLUSIONS/SIGNIFICANCE: Consistent with similar studies in other organisms, noncoding transcription appears to be widespread in the maize genome. Computational predictions indicate that maize lncRNAs may function to regulate expression of other genes through multiple RNA mediated mechanisms.
first_indexed 2024-12-20T07:30:40Z
format Article
id doaj.art-fbcfec5b05e04a50802d3e5dc7d2646b
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-20T07:30:40Z
publishDate 2012-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-fbcfec5b05e04a50802d3e5dc7d2646b2022-12-21T19:48:26ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0178e4304710.1371/journal.pone.0043047Computational identification and functional predictions of long noncoding RNA in Zea mays.Susan BoernerKaren M McGinnisBACKGROUND: Computational analysis of cDNA sequences from multiple organisms suggests that a large portion of transcribed DNA does not code for a functional protein. In mammals, noncoding transcription is abundant, and often results in functional RNA molecules that do not appear to encode proteins. Many long noncoding RNAs (lncRNAs) appear to have epigenetic regulatory function in humans, including HOTAIR and XIST. While epigenetic gene regulation is clearly an essential mechanism in plants, relatively little is known about the presence or function of lncRNAs in plants. METHODOLOGY/PRINCIPAL FINDINGS: To explore the connection between lncRNA and epigenetic regulation of gene expression in plants, a computational pipeline using the programming language Python has been developed and applied to maize full length cDNA sequences to identify, classify, and localize potential lncRNAs. The pipeline was used in parallel with an SVM tool for identifying ncRNAs to identify the maximal number of ncRNAs in the dataset. Although the available library of sequences was small and potentially biased toward protein coding transcripts, 15% of the sequences were predicted to be noncoding. Approximately 60% of these sequences appear to act as precursors for small RNA molecules and may function to regulate gene expression via a small RNA dependent mechanism. ncRNAs were predicted to originate from both genic and intergenic loci. Of the lncRNAs that originated from genic loci, ∼20% were antisense to the host gene loci. CONCLUSIONS/SIGNIFICANCE: Consistent with similar studies in other organisms, noncoding transcription appears to be widespread in the maize genome. Computational predictions indicate that maize lncRNAs may function to regulate expression of other genes through multiple RNA mediated mechanisms.http://europepmc.org/articles/PMC3420876?pdf=render
spellingShingle Susan Boerner
Karen M McGinnis
Computational identification and functional predictions of long noncoding RNA in Zea mays.
PLoS ONE
title Computational identification and functional predictions of long noncoding RNA in Zea mays.
title_full Computational identification and functional predictions of long noncoding RNA in Zea mays.
title_fullStr Computational identification and functional predictions of long noncoding RNA in Zea mays.
title_full_unstemmed Computational identification and functional predictions of long noncoding RNA in Zea mays.
title_short Computational identification and functional predictions of long noncoding RNA in Zea mays.
title_sort computational identification and functional predictions of long noncoding rna in zea mays
url http://europepmc.org/articles/PMC3420876?pdf=render
work_keys_str_mv AT susanboerner computationalidentificationandfunctionalpredictionsoflongnoncodingrnainzeamays
AT karenmmcginnis computationalidentificationandfunctionalpredictionsoflongnoncodingrnainzeamays