【24h】

Protein Motif Discovery with Linear Genetic Programming

机译:用线性遗传编程发现蛋白质主题发现

获取原文

摘要

There have been published some studies of genetic programming as a way to discover motifs in proteins and other biological data. These studies have been small, and often used domain knowledge to improve search. In this paper we present a genetic programming algorithm, that does not use domain knowledge, with results on 44 different protein families. We demonstrate that our list-based representation, given a fixed amount of processing resources/is able to discover meaningful motifs with good classification performance. Sometimes comparable to or even surpassing that of motifs found in a database of manually created motifs. We also investigate introduction of gaps in our algorithm, and it seems that this give a small increase in classification accuracy and recall, but with reduced precision.
机译:已经出版了一些关于遗传编程的研究,以探测蛋白质和其他生物数据中的基序的方式。这些研究很小,通常使用域名知识来改善搜索。在本文中,我们提出了一种遗传编程算法,不使用域知识,结果44种不同的蛋白质。我们展示了我们基于列表的表示,给定固定数量的处理资源/能够以良好的分类性能发现有意义的主题。有时可与手动创建的图案数据库中发现的图案相当或甚至超越。我们还调查了算法中的差距的引入,似乎这似乎可以缩短分类准确性和召回,但精度降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号