【24h】

Protein Motif Discovery with Linear Genetic Programming

机译:线性遗传编程的蛋白质基序发现

获取原文
获取原文并翻译 | 示例

摘要

There have been published some studies of genetic programming as a way to discover motifs in proteins and other biological data. These studies have been small, and often used domain knowledge to improve search. In this paper we present a genetic programming algorithm, that does not use domain knowledge, with results on 44 different protein families. We demonstrate that our list-based representation, given a fixed amount of processing resources/is able to discover meaningful motifs with good classification performance. Sometimes comparable to or even surpassing that of motifs found in a database of manually created motifs. We also investigate introduction of gaps in our algorithm, and it seems that this give a small increase in classification accuracy and recall, but with reduced precision.
机译:已经发表了一些关于遗传程序设计的研究,作为发现蛋白质和其他生物学数据中基序的一种方法。这些研究规模很小,经常使用领域知识来改善搜索。在本文中,我们提出了一种遗传编程算法,该算法不使用领域知识,其结果涉及44个不同的蛋白质家族。我们证明,在给定固定数量的处理资源的情况下,基于列表的表示形式/能够发现具有良好分类性能的有意义的图案。有时可与甚至超过手动创建的图案数据库中找到的图案进行比较。我们还研究了在算法中引入间隙的情况,看来这会稍微提高分类精度和召回率,但降低精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号