Evolutionary analyses of base-pairing interactions in DNA and RNA secondary structures
Pairs of nucleotides within functional nucleic acid secondary structures often display evidence of coevolution that is consistent with the maintenance of base-pairing. Here we introduce a sequence evolution model, MESSI, that infers coevolution associated with base-paired sites in DNA or RNA sequenc...
Main Authors: | , , , , |
---|---|
Format: | Working paper |
Published: |
2019
|
_version_ | 1797059061248688128 |
---|---|
author | Golden, M Murrell, B Pybus, O Martin, D Hein, J |
author_facet | Golden, M Murrell, B Pybus, O Martin, D Hein, J |
author_sort | Golden, M |
collection | OXFORD |
description | Pairs of nucleotides within functional nucleic acid secondary structures often display evidence of coevolution that is consistent with the maintenance of base-pairing. Here we introduce a sequence evolution model, MESSI, that infers coevolution associated with base-paired sites in DNA or RNA sequence alignments. MESSI can estimate coevolution whilst accounting for an unknown secondary structure. MESSI can also use GPU parallelism to increase computational speed. We used MESSI to infer coevolution associated with GC, AU (AT in DNA), GU (GT in DNA) pairs in non-coding RNA alignments, and in single-stranded RNA and DNA virus alignments. Estimates of GU pair coevolution were found to be higher at base-paired sites in single-stranded RNA viruses and non-coding RNAs than estimates of GT pair coevolution in single-stranded DNA viruses, suggesting that GT pairs do not stabilise DNA secondary structures to the same extent that GU pairs do in RNA. Additionally, MESSI estimates the degrees of coevolution at individual base-paired sites in an alignment. These estimates were computed for a SHAPE-MaP-determined HIV-1 NL4-3 RNA secondary structure and two corresponding alignments. We found that estimates of coevolution were more strongly correlated with experimentally-determined SHAPE-MaP pairing scores than three non-evolutionary measures of base-pairing covariation. To assist researchers in prioritising substructures with potential functionality, MESSI automatically ranks substructures by degrees of coevolution at base-paired sites within them. Such a ranking was created for an HIV-1 subtype B alignment, revealing an excess of top-ranking substructures that have been previously identified as having structure-related functional importance, amongst several uncharacterised top-ranking substructures. |
first_indexed | 2024-03-06T19:58:55Z |
format | Working paper |
id | oxford-uuid:26994364-c364-414f-a14f-9318a7534698 |
institution | University of Oxford |
last_indexed | 2024-03-06T19:58:55Z |
publishDate | 2019 |
record_format | dspace |
spelling | oxford-uuid:26994364-c364-414f-a14f-9318a75346982022-03-26T12:01:59ZEvolutionary analyses of base-pairing interactions in DNA and RNA secondary structuresWorking paperhttp://purl.org/coar/resource_type/c_8042uuid:26994364-c364-414f-a14f-9318a7534698Symplectic Elements at Oxford2019Golden, MMurrell, BPybus, OMartin, DHein, JPairs of nucleotides within functional nucleic acid secondary structures often display evidence of coevolution that is consistent with the maintenance of base-pairing. Here we introduce a sequence evolution model, MESSI, that infers coevolution associated with base-paired sites in DNA or RNA sequence alignments. MESSI can estimate coevolution whilst accounting for an unknown secondary structure. MESSI can also use GPU parallelism to increase computational speed. We used MESSI to infer coevolution associated with GC, AU (AT in DNA), GU (GT in DNA) pairs in non-coding RNA alignments, and in single-stranded RNA and DNA virus alignments. Estimates of GU pair coevolution were found to be higher at base-paired sites in single-stranded RNA viruses and non-coding RNAs than estimates of GT pair coevolution in single-stranded DNA viruses, suggesting that GT pairs do not stabilise DNA secondary structures to the same extent that GU pairs do in RNA. Additionally, MESSI estimates the degrees of coevolution at individual base-paired sites in an alignment. These estimates were computed for a SHAPE-MaP-determined HIV-1 NL4-3 RNA secondary structure and two corresponding alignments. We found that estimates of coevolution were more strongly correlated with experimentally-determined SHAPE-MaP pairing scores than three non-evolutionary measures of base-pairing covariation. To assist researchers in prioritising substructures with potential functionality, MESSI automatically ranks substructures by degrees of coevolution at base-paired sites within them. Such a ranking was created for an HIV-1 subtype B alignment, revealing an excess of top-ranking substructures that have been previously identified as having structure-related functional importance, amongst several uncharacterised top-ranking substructures. |
spellingShingle | Golden, M Murrell, B Pybus, O Martin, D Hein, J Evolutionary analyses of base-pairing interactions in DNA and RNA secondary structures |
title | Evolutionary analyses of base-pairing interactions in DNA and RNA secondary structures |
title_full | Evolutionary analyses of base-pairing interactions in DNA and RNA secondary structures |
title_fullStr | Evolutionary analyses of base-pairing interactions in DNA and RNA secondary structures |
title_full_unstemmed | Evolutionary analyses of base-pairing interactions in DNA and RNA secondary structures |
title_short | Evolutionary analyses of base-pairing interactions in DNA and RNA secondary structures |
title_sort | evolutionary analyses of base pairing interactions in dna and rna secondary structures |
work_keys_str_mv | AT goldenm evolutionaryanalysesofbasepairinginteractionsindnaandrnasecondarystructures AT murrellb evolutionaryanalysesofbasepairinginteractionsindnaandrnasecondarystructures AT pybuso evolutionaryanalysesofbasepairinginteractionsindnaandrnasecondarystructures AT martind evolutionaryanalysesofbasepairinginteractionsindnaandrnasecondarystructures AT heinj evolutionaryanalysesofbasepairinginteractionsindnaandrnasecondarystructures |