cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequences
Identifying cis-acting elements and understanding regulatory mechanisms of a gene is crucial to fully understand the molecular biology of an organism. In general, it is difficult to identify previously uncharacterised cis-acting elements with an unknown consensus sequence. The task is especially pro...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
PeerJ Inc.
2019-01-01
|
Series: | PeerJ |
Subjects: | |
Online Access: | https://peerj.com/articles/6176.pdf |
_version_ | 1797420409685016576 |
---|---|
author | Mikk Puustusmaa Aare Abroi |
author_facet | Mikk Puustusmaa Aare Abroi |
author_sort | Mikk Puustusmaa |
collection | DOAJ |
description | Identifying cis-acting elements and understanding regulatory mechanisms of a gene is crucial to fully understand the molecular biology of an organism. In general, it is difficult to identify previously uncharacterised cis-acting elements with an unknown consensus sequence. The task is especially problematic with viruses containing regions of limited or no similarity to other previously characterised sequences. Fortunately, the fast increase in the number of sequenced genomes allows us to detect some of these elusive cis-elements. In this work, we introduce a web-based tool called cRegions. It was developed to identify regions within a protein-coding sequence where the conservation in the amino acid sequence is caused by the conservation in the nucleotide sequence. The cRegion can be the first step in discovering novel cis-acting sequences from diverged protein-coding genes. The results can be used as a basis for future experimental analysis. We applied cRegions on the non-structural and structural polyproteins of alphaviruses as an example and successfully detected all known cis-acting elements. In this publication and in previous work, we have shown that cRegions is able to detect a wide variety of functional elements in DNA and RNA viruses. These functional elements include splice sites, stem-loops, overlapping reading frames, internal promoters, ribosome frameshifting signals and other embedded elements with yet unknown function. The cRegions web tool is available at http://bioinfo.ut.ee/cRegions/. |
first_indexed | 2024-03-09T07:02:07Z |
format | Article |
id | doaj.art-505b850b304040adbddf47537a5d58e1 |
institution | Directory Open Access Journal |
issn | 2167-8359 |
language | English |
last_indexed | 2024-03-09T07:02:07Z |
publishDate | 2019-01-01 |
publisher | PeerJ Inc. |
record_format | Article |
series | PeerJ |
spelling | doaj.art-505b850b304040adbddf47537a5d58e12023-12-03T09:50:10ZengPeerJ Inc.PeerJ2167-83592019-01-016e617610.7717/peerj.6176cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequencesMikk Puustusmaa0Aare Abroi1Department of Bioinformatics, Institute of Molecular and Cell Biology, University of Tartu, Tartu, EstoniaInstitute of Technology, University of Tartu, Tartu, EstoniaIdentifying cis-acting elements and understanding regulatory mechanisms of a gene is crucial to fully understand the molecular biology of an organism. In general, it is difficult to identify previously uncharacterised cis-acting elements with an unknown consensus sequence. The task is especially problematic with viruses containing regions of limited or no similarity to other previously characterised sequences. Fortunately, the fast increase in the number of sequenced genomes allows us to detect some of these elusive cis-elements. In this work, we introduce a web-based tool called cRegions. It was developed to identify regions within a protein-coding sequence where the conservation in the amino acid sequence is caused by the conservation in the nucleotide sequence. The cRegion can be the first step in discovering novel cis-acting sequences from diverged protein-coding genes. The results can be used as a basis for future experimental analysis. We applied cRegions on the non-structural and structural polyproteins of alphaviruses as an example and successfully detected all known cis-acting elements. In this publication and in previous work, we have shown that cRegions is able to detect a wide variety of functional elements in DNA and RNA viruses. These functional elements include splice sites, stem-loops, overlapping reading frames, internal promoters, ribosome frameshifting signals and other embedded elements with yet unknown function. The cRegions web tool is available at http://bioinfo.ut.ee/cRegions/.https://peerj.com/articles/6176.pdfEmbedded functional elementCis-elementCodon usage biasAlphavirusCis-acting sequenceViruses |
spellingShingle | Mikk Puustusmaa Aare Abroi cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequences PeerJ Embedded functional element Cis-element Codon usage bias Alphavirus Cis-acting sequence Viruses |
title | cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequences |
title_full | cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequences |
title_fullStr | cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequences |
title_full_unstemmed | cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequences |
title_short | cRegions—a tool for detecting conserved cis-elements in multiple sequence alignment of diverged coding sequences |
title_sort | cregions a tool for detecting conserved cis elements in multiple sequence alignment of diverged coding sequences |
topic | Embedded functional element Cis-element Codon usage bias Alphavirus Cis-acting sequence Viruses |
url | https://peerj.com/articles/6176.pdf |
work_keys_str_mv | AT mikkpuustusmaa cregionsatoolfordetectingconservedciselementsinmultiplesequencealignmentofdivergedcodingsequences AT aareabroi cregionsatoolfordetectingconservedciselementsinmultiplesequencealignmentofdivergedcodingsequences |