cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequences

Identifying cis-acting elements and understanding regulatory mechanisms of a gene is crucial to fully understand the molecular biology of an organism. In general, it is difficult to identify previously uncharacterised cis-acting elements with an unknown consensus sequence. The task is especially pro...

Full description

Bibliographic Details
Main Authors: Mikk Puustusmaa, Aare Abroi
Format: Article
Language:English
Published: PeerJ Inc. 2019-01-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/6176.pdf
_version_ 1797420409685016576
author Mikk Puustusmaa
Aare Abroi
author_facet Mikk Puustusmaa
Aare Abroi
author_sort Mikk Puustusmaa
collection DOAJ
description Identifying cis-acting elements and understanding regulatory mechanisms of a gene is crucial to fully understand the molecular biology of an organism. In general, it is difficult to identify previously uncharacterised cis-acting elements with an unknown consensus sequence. The task is especially problematic with viruses containing regions of limited or no similarity to other previously characterised sequences. Fortunately, the fast increase in the number of sequenced genomes allows us to detect some of these elusive cis-elements. In this work, we introduce a web-based tool called cRegions. It was developed to identify regions within a protein-coding sequence where the conservation in the amino acid sequence is caused by the conservation in the nucleotide sequence. The cRegion can be the first step in discovering novel cis-acting sequences from diverged protein-coding genes. The results can be used as a basis for future experimental analysis. We applied cRegions on the non-structural and structural polyproteins of alphaviruses as an example and successfully detected all known cis-acting elements. In this publication and in previous work, we have shown that cRegions is able to detect a wide variety of functional elements in DNA and RNA viruses. These functional elements include splice sites, stem-loops, overlapping reading frames, internal promoters, ribosome frameshifting signals and other embedded elements with yet unknown function. The cRegions web tool is available at http://bioinfo.ut.ee/cRegions/.
first_indexed 2024-03-09T07:02:07Z
format Article
id doaj.art-505b850b304040adbddf47537a5d58e1
institution Directory Open Access Journal
issn 2167-8359
language English
last_indexed 2024-03-09T07:02:07Z
publishDate 2019-01-01
publisher PeerJ Inc.
record_format Article
series PeerJ
spelling doaj.art-505b850b304040adbddf47537a5d58e12023-12-03T09:50:10ZengPeerJ Inc.PeerJ2167-83592019-01-016e617610.7717/peerj.6176cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequencesMikk Puustusmaa0Aare Abroi1Department of Bioinformatics, Institute of Molecular and Cell Biology, University of Tartu, Tartu, EstoniaInstitute of Technology, University of Tartu, Tartu, EstoniaIdentifying cis-acting elements and understanding regulatory mechanisms of a gene is crucial to fully understand the molecular biology of an organism. In general, it is difficult to identify previously uncharacterised cis-acting elements with an unknown consensus sequence. The task is especially problematic with viruses containing regions of limited or no similarity to other previously characterised sequences. Fortunately, the fast increase in the number of sequenced genomes allows us to detect some of these elusive cis-elements. In this work, we introduce a web-based tool called cRegions. It was developed to identify regions within a protein-coding sequence where the conservation in the amino acid sequence is caused by the conservation in the nucleotide sequence. The cRegion can be the first step in discovering novel cis-acting sequences from diverged protein-coding genes. The results can be used as a basis for future experimental analysis. We applied cRegions on the non-structural and structural polyproteins of alphaviruses as an example and successfully detected all known cis-acting elements. In this publication and in previous work, we have shown that cRegions is able to detect a wide variety of functional elements in DNA and RNA viruses. These functional elements include splice sites, stem-loops, overlapping reading frames, internal promoters, ribosome frameshifting signals and other embedded elements with yet unknown function. The cRegions web tool is available at http://bioinfo.ut.ee/cRegions/.https://peerj.com/articles/6176.pdfEmbedded functional elementCis-elementCodon usage biasAlphavirusCis-acting sequenceViruses
spellingShingle Mikk Puustusmaa
Aare Abroi
cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequences
PeerJ
Embedded functional element
Cis-element
Codon usage bias
Alphavirus
Cis-acting sequence
Viruses
title cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequences
title_full cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequences
title_fullStr cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequences
title_full_unstemmed cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequences
title_short cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequences
title_sort cregions a tool for detecting conserved cis elements in multiple sequence alignment of diverged coding sequences
topic Embedded functional element
Cis-element
Codon usage bias
Alphavirus
Cis-acting sequence
Viruses
url https://peerj.com/articles/6176.pdf
work_keys_str_mv AT mikkpuustusmaa cregionsatoolfordetectingconservedciselementsinmultiplesequencealignmentofdivergedcodingsequences
AT aareabroi cregionsatoolfordetectingconservedciselementsinmultiplesequencealignmentofdivergedcodingsequences