The sequence structures of human microRNA molecules and their implications.

The count of the nucleotides in a cloned, short genomic sequence has become an important criterion to annotate such a sequence as a miRNA molecule. While the majority of human mature miRNA sequences consist of 22 nucleotides, there exists discrepancy in the characteristic lengths of the miRNA sequen...

Full description

Bibliographic Details
Main Authors: Zhide Fang, Ruofei Du, Andrea Edwards, Erik K Flemington, Kun Zhang
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3548844?pdf=render
_version_ 1818188538720550912
author Zhide Fang
Ruofei Du
Andrea Edwards
Erik K Flemington
Kun Zhang
author_facet Zhide Fang
Ruofei Du
Andrea Edwards
Erik K Flemington
Kun Zhang
author_sort Zhide Fang
collection DOAJ
description The count of the nucleotides in a cloned, short genomic sequence has become an important criterion to annotate such a sequence as a miRNA molecule. While the majority of human mature miRNA sequences consist of 22 nucleotides, there exists discrepancy in the characteristic lengths of the miRNA sequences. There is also a lack of systematic studies on such length distribution and on the biological factors that are related to or may affect this length. In this paper, we intend to fill this gap by investigating the sequence structure of human miRNA molecules using statistics tools. We demonstrate that the traditional discrete probability distributions do not model the length distribution of the human mature miRNAs well, and we obtain the statistical distribution model with a decent fit. We observe that the four nucleotide bases in a miRNA sequence are not randomly distributed, implying that possible structural patterns such as dinucleotide (trinucleotide or higher order) may exist. Furthermore, we study the relationships of this length distribution to multiple important factors such as evolutionary conservation, tumorigenesis, the length of precursor loop structures, and the number of predicted targets. The association between the miRNA sequence length and the distributions of target site counts in corresponding predicted genes is also presented. This study results in several novel findings worthy of further investigation that include: (1) rapid evolution introduces variation to the miRNA sequence length distribution; (2) miRNAs with extreme sequence lengths are unlikely to be cancer-related; and (3) the miRNA sequence length is positively correlated to the precursor length and the number of predicted target genes.
first_indexed 2024-12-11T23:28:31Z
format Article
id doaj.art-c09b2b5b53184c99933f9e0343396082
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-11T23:28:31Z
publishDate 2013-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-c09b2b5b53184c99933f9e03433960822022-12-22T00:46:08ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-0181e5421510.1371/journal.pone.0054215The sequence structures of human microRNA molecules and their implications.Zhide FangRuofei DuAndrea EdwardsErik K FlemingtonKun ZhangThe count of the nucleotides in a cloned, short genomic sequence has become an important criterion to annotate such a sequence as a miRNA molecule. While the majority of human mature miRNA sequences consist of 22 nucleotides, there exists discrepancy in the characteristic lengths of the miRNA sequences. There is also a lack of systematic studies on such length distribution and on the biological factors that are related to or may affect this length. In this paper, we intend to fill this gap by investigating the sequence structure of human miRNA molecules using statistics tools. We demonstrate that the traditional discrete probability distributions do not model the length distribution of the human mature miRNAs well, and we obtain the statistical distribution model with a decent fit. We observe that the four nucleotide bases in a miRNA sequence are not randomly distributed, implying that possible structural patterns such as dinucleotide (trinucleotide or higher order) may exist. Furthermore, we study the relationships of this length distribution to multiple important factors such as evolutionary conservation, tumorigenesis, the length of precursor loop structures, and the number of predicted targets. The association between the miRNA sequence length and the distributions of target site counts in corresponding predicted genes is also presented. This study results in several novel findings worthy of further investigation that include: (1) rapid evolution introduces variation to the miRNA sequence length distribution; (2) miRNAs with extreme sequence lengths are unlikely to be cancer-related; and (3) the miRNA sequence length is positively correlated to the precursor length and the number of predicted target genes.http://europepmc.org/articles/PMC3548844?pdf=render
spellingShingle Zhide Fang
Ruofei Du
Andrea Edwards
Erik K Flemington
Kun Zhang
The sequence structures of human microRNA molecules and their implications.
PLoS ONE
title The sequence structures of human microRNA molecules and their implications.
title_full The sequence structures of human microRNA molecules and their implications.
title_fullStr The sequence structures of human microRNA molecules and their implications.
title_full_unstemmed The sequence structures of human microRNA molecules and their implications.
title_short The sequence structures of human microRNA molecules and their implications.
title_sort sequence structures of human microrna molecules and their implications
url http://europepmc.org/articles/PMC3548844?pdf=render
work_keys_str_mv AT zhidefang thesequencestructuresofhumanmicrornamoleculesandtheirimplications
AT ruofeidu thesequencestructuresofhumanmicrornamoleculesandtheirimplications
AT andreaedwards thesequencestructuresofhumanmicrornamoleculesandtheirimplications
AT erikkflemington thesequencestructuresofhumanmicrornamoleculesandtheirimplications
AT kunzhang thesequencestructuresofhumanmicrornamoleculesandtheirimplications
AT zhidefang sequencestructuresofhumanmicrornamoleculesandtheirimplications
AT ruofeidu sequencestructuresofhumanmicrornamoleculesandtheirimplications
AT andreaedwards sequencestructuresofhumanmicrornamoleculesandtheirimplications
AT erikkflemington sequencestructuresofhumanmicrornamoleculesandtheirimplications
AT kunzhang sequencestructuresofhumanmicrornamoleculesandtheirimplications