Evolutionary analyses of base-pairing interactions in DNA and RNA secondary structures

Pairs of nucleotides within functional nucleic acid secondary structures often display evidence of coevolution that is consistent with the maintenance of base-pairing. Here we introduce a sequence evolution model, MESSI, that infers coevolution associated with base-paired sites in DNA or RNA sequenc...

Full description

Bibliographic Details
Main Authors: Golden, M, Murrell, B, Pybus, O, Martin, D, Hein, J
Format: Working paper
Published: 2019
_version_ 1797059061248688128
author Golden, M
Murrell, B
Pybus, O
Martin, D
Hein, J
author_facet Golden, M
Murrell, B
Pybus, O
Martin, D
Hein, J
author_sort Golden, M
collection OXFORD
description Pairs of nucleotides within functional nucleic acid secondary structures often display evidence of coevolution that is consistent with the maintenance of base-pairing. Here we introduce a sequence evolution model, MESSI, that infers coevolution associated with base-paired sites in DNA or RNA sequence alignments. MESSI can estimate coevolution whilst accounting for an unknown secondary structure. MESSI can also use GPU parallelism to increase computational speed. We used MESSI to infer coevolution associated with GC, AU (AT in DNA), GU (GT in DNA) pairs in non-coding RNA alignments, and in single-stranded RNA and DNA virus alignments. Estimates of GU pair coevolution were found to be higher at base-paired sites in single-stranded RNA viruses and non-coding RNAs than estimates of GT pair coevolution in single-stranded DNA viruses, suggesting that GT pairs do not stabilise DNA secondary structures to the same extent that GU pairs do in RNA. Additionally, MESSI estimates the degrees of coevolution at individual base-paired sites in an alignment. These estimates were computed for a SHAPE-MaP-determined HIV-1 NL4-3 RNA secondary structure and two corresponding alignments. We found that estimates of coevolution were more strongly correlated with experimentally-determined SHAPE-MaP pairing scores than three non-evolutionary measures of base-pairing covariation. To assist researchers in prioritising substructures with potential functionality, MESSI automatically ranks substructures by degrees of coevolution at base-paired sites within them. Such a ranking was created for an HIV-1 subtype B alignment, revealing an excess of top-ranking substructures that have been previously identified as having structure-related functional importance, amongst several uncharacterised top-ranking substructures.
first_indexed 2024-03-06T19:58:55Z
format Working paper
id oxford-uuid:26994364-c364-414f-a14f-9318a7534698
institution University of Oxford
last_indexed 2024-03-06T19:58:55Z
publishDate 2019
record_format dspace
spelling oxford-uuid:26994364-c364-414f-a14f-9318a75346982022-03-26T12:01:59ZEvolutionary analyses of base-pairing interactions in DNA and RNA secondary structuresWorking paperhttp://purl.org/coar/resource_type/c_8042uuid:26994364-c364-414f-a14f-9318a7534698Symplectic Elements at Oxford2019Golden, MMurrell, BPybus, OMartin, DHein, JPairs of nucleotides within functional nucleic acid secondary structures often display evidence of coevolution that is consistent with the maintenance of base-pairing. Here we introduce a sequence evolution model, MESSI, that infers coevolution associated with base-paired sites in DNA or RNA sequence alignments. MESSI can estimate coevolution whilst accounting for an unknown secondary structure. MESSI can also use GPU parallelism to increase computational speed. We used MESSI to infer coevolution associated with GC, AU (AT in DNA), GU (GT in DNA) pairs in non-coding RNA alignments, and in single-stranded RNA and DNA virus alignments. Estimates of GU pair coevolution were found to be higher at base-paired sites in single-stranded RNA viruses and non-coding RNAs than estimates of GT pair coevolution in single-stranded DNA viruses, suggesting that GT pairs do not stabilise DNA secondary structures to the same extent that GU pairs do in RNA. Additionally, MESSI estimates the degrees of coevolution at individual base-paired sites in an alignment. These estimates were computed for a SHAPE-MaP-determined HIV-1 NL4-3 RNA secondary structure and two corresponding alignments. We found that estimates of coevolution were more strongly correlated with experimentally-determined SHAPE-MaP pairing scores than three non-evolutionary measures of base-pairing covariation. To assist researchers in prioritising substructures with potential functionality, MESSI automatically ranks substructures by degrees of coevolution at base-paired sites within them. Such a ranking was created for an HIV-1 subtype B alignment, revealing an excess of top-ranking substructures that have been previously identified as having structure-related functional importance, amongst several uncharacterised top-ranking substructures.
spellingShingle Golden, M
Murrell, B
Pybus, O
Martin, D
Hein, J
Evolutionary analyses of base-pairing interactions in DNA and RNA secondary structures
title Evolutionary analyses of base-pairing interactions in DNA and RNA secondary structures
title_full Evolutionary analyses of base-pairing interactions in DNA and RNA secondary structures
title_fullStr Evolutionary analyses of base-pairing interactions in DNA and RNA secondary structures
title_full_unstemmed Evolutionary analyses of base-pairing interactions in DNA and RNA secondary structures
title_short Evolutionary analyses of base-pairing interactions in DNA and RNA secondary structures
title_sort evolutionary analyses of base pairing interactions in dna and rna secondary structures
work_keys_str_mv AT goldenm evolutionaryanalysesofbasepairinginteractionsindnaandrnasecondarystructures
AT murrellb evolutionaryanalysesofbasepairinginteractionsindnaandrnasecondarystructures
AT pybuso evolutionaryanalysesofbasepairinginteractionsindnaandrnasecondarystructures
AT martind evolutionaryanalysesofbasepairinginteractionsindnaandrnasecondarystructures
AT heinj evolutionaryanalysesofbasepairinginteractionsindnaandrnasecondarystructures