【24h】

RNA Secondary Structure Prediction Via Energy Density Minimization

机译:通过能量密度最小化的RNA二级结构预测

获取原文
获取原文并翻译 | 示例

摘要

There is a resurgence of interest in RNA secondary structure prediction problem (a.k.a. the RNA folding problem) due to the discovery of many new families of non-coding RNAs with a variety of functions. The vast majority of the computational tools for RNA secondary structure prediction are based on free energy minimization. Here the goal is to compute a non-conflicting collection of structural elements such as hairpins, bulges and loops, whose total free energy is as small as possible. Perhaps the most commonly used tool for structure prediction, mf old/RNAfold, is designed to fold a single RNA sequence. More recent methods, such as RNAscf and alifoldare developed to improve the prediction quality of this tool by aiming to minimize the free energy of a number of functionally similar RNA sequences simultaneously. Typically, the (stack) prediction quality of the latter approach improves as the number of sequences to be folded and/or the similarity between the sequences increase. If the number of available RNA sequences to be folded is small then the predictive power of multiple sequence folding methods can deteriorate to that of the single sequence folding methods or worse. In this paper we show that delocalizing the thermodynamic cost of forming an RNA substructure by considering the energy density of the substructure can significantly improve on secondary structure prediction via free energy minimization. We describe a new algorithm and a software tool that we call Densityf old, which aims to predict the secondary structure of an RNA sequence by minimizing the sum of energy densities of individual substructures. We show that when only one or a small number of input sequences are available, Densityf old can outperform all available alternatives. It is our hope that this approach will help to better understand the process of nucleation that leads to the formation of biologically relevant RNA substructures.
机译:由于发现了许多新的具有多种功能的非编码RNA家族,人们对RNA二级结构预测问题(又称RNA折叠问题)的兴趣再次兴起。 RNA二级结构预测的绝大多数计算工具都基于自由能最小化。此处的目标是计算结构元素(例如发夹,凸起和环)的无冲突集合,其总自由能尽可能小。可能最常用的结构预测工具是mf old / RNAfold,旨在折叠单个RNA序列。通过旨在同时使多个功能相似的RNA序列的自由能最小化,开发了诸如RNAscf和alifold之类的最新方法来提高该工具的预测质量。通常,后一种方法的(堆栈)预测质量随着要折叠的序列数和/或序列之间的相似度增加而提高。如果要折叠的可用RNA序列的数量很少,则多种序列折叠方法的预测能力可能会恶化到单序列折叠方法的预测能力,甚至更差。在本文中,我们表明,通过考虑子结构的能量密度来分散形成RNA子结构的热力学成本,可以通过自由能最小化显着改善二级结构预测。我们描述了一种称为Densityf old的新算法和软件工具,旨在通过最小化单个子结构的能量密度之和来预测RNA序列的二级结构。我们表明,当只有一个或少数几个输入序列可用时,Densityf old的性能会优于所有可用的替代方法。我们希望这种方法将有助于更好地理解导致形成生物学相关的RNA亚结构的成核过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号