...
【24h】

RFRCDB-siRNA: Improved design of siRNAs by random forest regression model coupled with database searching.

机译:RFRCDB-siRNA:通过随机森林回归模型和数据库搜索改进了siRNA的设计。

获取原文
获取原文并翻译 | 示例
           

摘要

Although the observations concerning the factors which influence the siRNA efficacy give clues to the mechanism of RNAi, the quantitative prediction of the siRNA efficacy is still a challenge task. In this paper, we introduced a novel non-linear regression method: random forest regression (RFR), to quantitatively estimate siRNAs efficacy values. Compared with an alternative machine learning regression algorithm, support vector machine regression (SVR) and four other score-based algorithms [A. Reynolds, D. Leake, Q. Boese, S. Scaringe, W.S. Marshall, A. Khvorova, Rational siRNA design for RNA interference, Nat. Biotechnol. 22 (2004) 326-330; K. Ui-Tei, Y. Naito, F. Takahashi, T. Haraguchi, H. Ohki-Hamazaki, A. Juni, R. Ueda, K. Saigo, Guidelines for the selection of highly effective siRNA sequences for mammalian and chick RNA interference, Nucleic Acids Res. 32 (2004) 936-948; A.C. Hsieh, R. Bo, J. Manola, F. Vazquez, O. Bare, A. Khvorova, S. Scaringe, W.R. Sellers, A library of siRNA duplexes targetingthe phosphoinositide 3-kinase pathway: determinants of gene silencing for use in cell-based screens, Nucleic Acids Res. 32 (2004) 893-901; M. Amarzguioui, H. Prydz, An algorithm for selection of functional siRNA sequences, Biochem. Biophys. Res. Commun. 316 (2004) 1050-1058) our RFR model achieved the best performance of all. A web-server, RFRCDB-siRNA (http://www.bioinf.seu.edu.cn/siRNA/index.htm), has been developed. RFRCDB-siRNA consists of two modules: a siRNA-centric database and a RFR prediction system. RFRCDB-siRNA works as follows: (1) Instead of directly predicting the gene silencing activity of siRNAs, the service takes these siRNAs as queries to search against the siRNA-centric database. The matched sequences with the exceeding the user defined functionality value threshold are kept. (2) The mismatched sequences are then processed into the RFR prediction system for further analysis.
机译:尽管有关影响siRNA功效的因素的观察为RNAi的机制提供了线索,但是对siRNA功效的定量预测仍是一项艰巨的任务。在本文中,我们介绍了一种新颖的非线性回归方法:随机森林回归(RFR),以定量估计siRNA的功效值。与替代的机器学习回归算法相比,支持向量机回归(SVR)和其他四个基于分数的算法[A.雷诺(Reynolds),里克(D.Leake),凯斯(Boese),斯卡汀(S.Scaringe),华盛顿州Marshall,A. Khvorova,针对RNA干扰的Rational siRNA设计,Nat。生物技术。 22(2004)326-330; Ui-Tei,Y.Naito,F.Takahashi,T.Haraguchi,H.Ohki-Hamazaki,A.Juni,R.Ueda,K.Saigo,为哺乳动物和雏鸡RNA选择高效siRNA序列的指南干扰,核酸研究。 32(2004)936-948; AC Hsieh,R。Bo,J。Manola,F。Vazquez,O。Bare,A。Khvorova,S。Scaringe,WR Sellers,针对磷酸肌醇3-激酶途径的siRNA双链体文库:用于细胞中的基因沉默决定因素的屏幕,Nucleic Acids Res。 32(2004)893-901; M. Amarzguioui,H。Prydz,一种选择功能性siRNA序列的算法,生物化学。生物物理学。 Res。公社316(2004)1050-1058),我们的RFR模型取得了最佳性能。已经开发了一个网络服务器RFRCDB-siRNA(http://www.bioinf.seu.edu.cn/siRNA/index.htm)。 RFRCDB-siRNA由两个模块组成:以siRNA为中心的数据库和RFR预测系统。 RFRCDB-siRNA的工作方式如下:(1)该服务不是直接预测siRNA的基因沉默活性,而是将这些siRNA作为查询来搜索以siRNA为中心的数据库。保留超过用户定义的功能值阈值的匹配序列。 (2)然后将不匹配的序列处理到RFR预测系统中进行进一步分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号