A comparative method for finding and folding RNA secondary structures within protein-coding regions.

Existing computational methods for RNA secondary-structure prediction tacitly assume RNA to only encode functional RNA structures. However, experimental studies have revealed that some RNA sequences, e.g. compact viral genomes, can simultaneously encode functional RNA structures as well as proteins,...

Full description

Bibliographic Details
Main Authors: Pedersen, J, Meyer, I, Forsberg, R, Simmonds, P, Hein, J
Format: Journal article
Language:English
Published: 2004
_version_ 1797069315411804160
author Pedersen, J
Meyer, I
Forsberg, R
Simmonds, P
Hein, J
author_facet Pedersen, J
Meyer, I
Forsberg, R
Simmonds, P
Hein, J
author_sort Pedersen, J
collection OXFORD
description Existing computational methods for RNA secondary-structure prediction tacitly assume RNA to only encode functional RNA structures. However, experimental studies have revealed that some RNA sequences, e.g. compact viral genomes, can simultaneously encode functional RNA structures as well as proteins, and evidence is accumulating that this phenomenon may also be found in Eukaryotes. We here present the first comparative method, called RNA-DECODER, which explicitly takes the known protein-coding context of an RNA-sequence alignment into account in order to predict evolutionarily conserved secondary-structure elements, which may span both coding and non-coding regions. RNA-DECODER employs a stochastic context-free grammar together with a set of carefully devised phylogenetic substitution-models, which can disentangle and evaluate the different kinds of overlapping evolutionary constraints which arise. We show that RNA-DECODER's parameters can be automatically trained to successfully fold known secondary structures within the HCV genome. We scan the genomes of HCV and polio virus for conserved secondary-structure elements, and analyze performance as a function of available evolutionary information. On known secondary structures, RNA-DECODER shows a sensitivity similar to the programs MFOLD, PFOLD and RNAALIFOLD. When scanning the entire genomes of HCV and polio virus for structure elements, RNA-DECODER's results indicate a markedly higher specificity than MFOLD, PFOLD and RNAALIFOLD.
first_indexed 2024-03-06T22:22:37Z
format Journal article
id oxford-uuid:5594557f-4b57-4264-a5a8-798c2511aa02
institution University of Oxford
language English
last_indexed 2024-03-06T22:22:37Z
publishDate 2004
record_format dspace
spelling oxford-uuid:5594557f-4b57-4264-a5a8-798c2511aa022022-03-26T16:44:48ZA comparative method for finding and folding RNA secondary structures within protein-coding regions.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:5594557f-4b57-4264-a5a8-798c2511aa02EnglishSymplectic Elements at Oxford2004Pedersen, JMeyer, IForsberg, RSimmonds, PHein, JExisting computational methods for RNA secondary-structure prediction tacitly assume RNA to only encode functional RNA structures. However, experimental studies have revealed that some RNA sequences, e.g. compact viral genomes, can simultaneously encode functional RNA structures as well as proteins, and evidence is accumulating that this phenomenon may also be found in Eukaryotes. We here present the first comparative method, called RNA-DECODER, which explicitly takes the known protein-coding context of an RNA-sequence alignment into account in order to predict evolutionarily conserved secondary-structure elements, which may span both coding and non-coding regions. RNA-DECODER employs a stochastic context-free grammar together with a set of carefully devised phylogenetic substitution-models, which can disentangle and evaluate the different kinds of overlapping evolutionary constraints which arise. We show that RNA-DECODER's parameters can be automatically trained to successfully fold known secondary structures within the HCV genome. We scan the genomes of HCV and polio virus for conserved secondary-structure elements, and analyze performance as a function of available evolutionary information. On known secondary structures, RNA-DECODER shows a sensitivity similar to the programs MFOLD, PFOLD and RNAALIFOLD. When scanning the entire genomes of HCV and polio virus for structure elements, RNA-DECODER's results indicate a markedly higher specificity than MFOLD, PFOLD and RNAALIFOLD.
spellingShingle Pedersen, J
Meyer, I
Forsberg, R
Simmonds, P
Hein, J
A comparative method for finding and folding RNA secondary structures within protein-coding regions.
title A comparative method for finding and folding RNA secondary structures within protein-coding regions.
title_full A comparative method for finding and folding RNA secondary structures within protein-coding regions.
title_fullStr A comparative method for finding and folding RNA secondary structures within protein-coding regions.
title_full_unstemmed A comparative method for finding and folding RNA secondary structures within protein-coding regions.
title_short A comparative method for finding and folding RNA secondary structures within protein-coding regions.
title_sort comparative method for finding and folding rna secondary structures within protein coding regions
work_keys_str_mv AT pedersenj acomparativemethodforfindingandfoldingrnasecondarystructureswithinproteincodingregions
AT meyeri acomparativemethodforfindingandfoldingrnasecondarystructureswithinproteincodingregions
AT forsbergr acomparativemethodforfindingandfoldingrnasecondarystructureswithinproteincodingregions
AT simmondsp acomparativemethodforfindingandfoldingrnasecondarystructureswithinproteincodingregions
AT heinj acomparativemethodforfindingandfoldingrnasecondarystructureswithinproteincodingregions
AT pedersenj comparativemethodforfindingandfoldingrnasecondarystructureswithinproteincodingregions
AT meyeri comparativemethodforfindingandfoldingrnasecondarystructureswithinproteincodingregions
AT forsbergr comparativemethodforfindingandfoldingrnasecondarystructureswithinproteincodingregions
AT simmondsp comparativemethodforfindingandfoldingrnasecondarystructureswithinproteincodingregions
AT heinj comparativemethodforfindingandfoldingrnasecondarystructureswithinproteincodingregions