首页> 外文期刊>RNA >Improved free energy parameters for RNA pseudoknotted secondary structure prediction
【24h】

Improved free energy parameters for RNA pseudoknotted secondary structure prediction

机译:改进的自由能参数用于RNA假结二级结构预测

获取原文
获取原文并翻译 | 示例
           

摘要

Accurate prediction of RNA pseudoknotted secondary structures from the base sequence is a challenging computational problem. Since prediction algorithms rely on thermodynamic energy models to identify low-energy structures, prediction accuracy relies in large part on the quality of free energy change parameters. In this work, we use our earlier constraint generation and Boltzmann likelihood parameter estimation methods to obtain new energy parameters for two energy models for secondary structures with pseudoknots, namely, the Dirks–Pierce (DP) and the Cao–Chen (CC) models. To train our parameters, and also to test their accuracy, we create a large data set of both pseudoknotted and pseudoknot-free secondary structures. In addition to structural data our training data set also includes thermodynamic data, for which experimentally determined free energy changes are available for sequences and their reference structures. When incorporated into the HotKnots prediction algorithm, our new parameters result in significantly improved secondary structure prediction on our test data set. Specifically, the prediction accuracy when using our new parameters improves from 68% to 79% for the DP model, and from 70% to 77% for the CC model.
机译:从碱基序列准确预测RNA假结二级结构是一个具有挑战性的计算问题。由于预测算法依赖于热力学能量模型来识别低能结构,因此预测精度在很大程度上取决于自由能变化参数的质量。在这项工作中,我们使用较早的约束生成和Boltzmann似然参数估计方法来获得带有假结的二级结构的两个能量模型的新能量参数,即Dirks-Pierce(DP)和Cao-Chen(CC)模型。为了训练我们的参数并测试其准确性,我们创建了一个包含假结和无假结二级结构的大型数据集。除了结构数据,我们的训练数据集还包括热力学数据,对于这些数据,实验确定的自由能变化可用于序列及其参考结构。当将这些参数合并到HotKnots预测算法中后,我们的新参数将大大改善测试数据集的二级结构预测。具体而言,使用我们的新参数时,DP模型的预测精度从68%提高到79%,CC模型从70%提高到77%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号