SparseRNAfolD: optimized sparse RNA pseudoknot-free folding with dangle consideration

Abstract Motivation Computational RNA secondary structure prediction by free energy minimization is indispensable for analyzing structural RNAs and their interactions. These methods find the structure with the minimum free energy (MFE) among exponentially many possible structures and have a restrict...

Full description

Bibliographic Details
Main Authors: Mateo Gray, Sebastian Will, Hosna Jabbari
Format: Article
Language:English
Published: BMC 2024-03-01
Series:Algorithms for Molecular Biology
Subjects:
Online Access:https://doi.org/10.1186/s13015-024-00256-4
_version_ 1797275872850345984
author Mateo Gray
Sebastian Will
Hosna Jabbari
author_facet Mateo Gray
Sebastian Will
Hosna Jabbari
author_sort Mateo Gray
collection DOAJ
description Abstract Motivation Computational RNA secondary structure prediction by free energy minimization is indispensable for analyzing structural RNAs and their interactions. These methods find the structure with the minimum free energy (MFE) among exponentially many possible structures and have a restrictive time and space complexity ( $$O(n^3)$$ O ( n 3 ) time and $$O(n^2)$$ O ( n 2 ) space for pseudoknot-free structures) for longer RNA sequences. Furthermore, accurate free energy calculations, including dangle contributions can be difficult and costly to implement, particularly when optimizing for time and space requirements. Results Here we introduce a fast and efficient sparsified MFE pseudoknot-free structure prediction algorithm, SparseRNAFolD, that utilizes an accurate energy model that accounts for dangle contributions. While the sparsification technique was previously employed to improve the time and space complexity of a pseudoknot-free structure prediction method with a realistic energy model, SparseMFEFold, it was not extended to include dangle contributions due to the complexity of computation. This may come at the cost of prediction accuracy. In this work, we compare three different sparsified implementations for dangle contributions and provide pros and cons of each method. As well, we compare our algorithm to LinearFold, a linear time and space algorithm, where we find that in practice, SparseRNAFolD has lower memory consumption across all lengths of sequence and a faster time for lengths up to 1000 bases. Conclusion Our SparseRNAFolD algorithm is an MFE-based algorithm that guarantees optimality of result and employs the most general energy model, including dangle contributions. We provide a basis for applying dangles to sparsified recursion in a pseudoknot-free model that has the potential to be extended to pseudoknots.
first_indexed 2024-03-07T15:20:15Z
format Article
id doaj.art-b64d56fc4ef845ecb03400afccd55427
institution Directory Open Access Journal
issn 1748-7188
language English
last_indexed 2024-03-07T15:20:15Z
publishDate 2024-03-01
publisher BMC
record_format Article
series Algorithms for Molecular Biology
spelling doaj.art-b64d56fc4ef845ecb03400afccd554272024-03-05T17:41:31ZengBMCAlgorithms for Molecular Biology1748-71882024-03-0119111610.1186/s13015-024-00256-4SparseRNAfolD: optimized sparse RNA pseudoknot-free folding with dangle considerationMateo Gray0Sebastian Will1Hosna Jabbari2Department of Biomedical Engineering, University of AlbertaDepartment of Computer Science CNRS/LIX (UMR 7161), Institut Polytechnique de ParisDepartment of Biomedical Engineering, University of AlbertaAbstract Motivation Computational RNA secondary structure prediction by free energy minimization is indispensable for analyzing structural RNAs and their interactions. These methods find the structure with the minimum free energy (MFE) among exponentially many possible structures and have a restrictive time and space complexity ( $$O(n^3)$$ O ( n 3 ) time and $$O(n^2)$$ O ( n 2 ) space for pseudoknot-free structures) for longer RNA sequences. Furthermore, accurate free energy calculations, including dangle contributions can be difficult and costly to implement, particularly when optimizing for time and space requirements. Results Here we introduce a fast and efficient sparsified MFE pseudoknot-free structure prediction algorithm, SparseRNAFolD, that utilizes an accurate energy model that accounts for dangle contributions. While the sparsification technique was previously employed to improve the time and space complexity of a pseudoknot-free structure prediction method with a realistic energy model, SparseMFEFold, it was not extended to include dangle contributions due to the complexity of computation. This may come at the cost of prediction accuracy. In this work, we compare three different sparsified implementations for dangle contributions and provide pros and cons of each method. As well, we compare our algorithm to LinearFold, a linear time and space algorithm, where we find that in practice, SparseRNAFolD has lower memory consumption across all lengths of sequence and a faster time for lengths up to 1000 bases. Conclusion Our SparseRNAFolD algorithm is an MFE-based algorithm that guarantees optimality of result and employs the most general energy model, including dangle contributions. We provide a basis for applying dangles to sparsified recursion in a pseudoknot-free model that has the potential to be extended to pseudoknots.https://doi.org/10.1186/s13015-024-00256-4RNAMFESecondary structure predictionDangleSparsificationSpace complexity
spellingShingle Mateo Gray
Sebastian Will
Hosna Jabbari
SparseRNAfolD: optimized sparse RNA pseudoknot-free folding with dangle consideration
Algorithms for Molecular Biology
RNA
MFE
Secondary structure prediction
Dangle
Sparsification
Space complexity
title SparseRNAfolD: optimized sparse RNA pseudoknot-free folding with dangle consideration
title_full SparseRNAfolD: optimized sparse RNA pseudoknot-free folding with dangle consideration
title_fullStr SparseRNAfolD: optimized sparse RNA pseudoknot-free folding with dangle consideration
title_full_unstemmed SparseRNAfolD: optimized sparse RNA pseudoknot-free folding with dangle consideration
title_short SparseRNAfolD: optimized sparse RNA pseudoknot-free folding with dangle consideration
title_sort sparsernafold optimized sparse rna pseudoknot free folding with dangle consideration
topic RNA
MFE
Secondary structure prediction
Dangle
Sparsification
Space complexity
url https://doi.org/10.1186/s13015-024-00256-4
work_keys_str_mv AT mateogray sparsernafoldoptimizedsparsernapseudoknotfreefoldingwithdangleconsideration
AT sebastianwill sparsernafoldoptimizedsparsernapseudoknotfreefoldingwithdangleconsideration
AT hosnajabbari sparsernafoldoptimizedsparsernapseudoknotfreefoldingwithdangleconsideration