首页> 外文会议>Asia-Pacific Bioinformatics Conference >SOMEA: self-organizing map based extraction algorithm for DNA motif identification with heterogeneous model
【24h】

SOMEA: self-organizing map based extraction algorithm for DNA motif identification with heterogeneous model

机译:SOMEA:基于自组织的基于地图的DNA基序与异构模型的识别算法

获取原文

摘要

Background: Discrimination of transcription factor binding sites (TFBS) from background sequences plays a key role in computational motif discovery. Current clustering based algorithms employ homogeneous model for problem solving, which assumes that motifs and background signals can be equivalently characterized. This assumption has some limitations because both sequence signals have distinct properties. Results: This paper aims to develop a Self-Organizing Map (SOM) based clustering algorithm for extracting binding sites in DNA sequences. Our framework is based on a novel intra-node soft competitive procedure to achieve maximum discrimination of motifs from background signals in datasets. The intra-node competition is based on an adaptive weightingtechnique on two different signal models to better represent these two classes of signals. Using several real and artificial datasets, we compared our proposed method with several motif discovery tools. Compared to SOMBRERO, a state-of-the-art SOM basedmotif discovery tool, it is found that our algorithm can achieve significant improvements in the average precision rates (i.e., about 27%) on the real datasets without compromising its sensitivity. Our method also performed favourably comparing againstother motif discovery tools. Conclusions: Motif discovery with model based clustering framework should consider the use of heterogeneous model to represent the two classes of signals in DNA sequences. Such heterogeneous model can achieve better signal discrimination compared to the homogeneous model.
机译:背景:来自背景序列的转录因子结合位点(TFB)的判断在计算主题发现中起着关键作用。基于基于集群的算法采用问题解决的均匀模型,这假设可以等效地表征图案和背景信号。此假设具有一些限制,因为两个序列信号都具有不同的属性。结果:本文旨在开发基于自组织地图(SOM)的聚类算法,用于提取DNA序列中的结合位点。我们的框架是基于新颖的节点内部软竞争程序,以实现来自数据集中的背景信号的最大判断图案。节点内竞争基于两个不同的信号模型上的自适应加权技术,以更好地代表这两类信号。使用几个真实和人工数据集,我们将我们提出的方法与多个主题发现工具进行了比较。与SoMbrero相比,基于最先进的SOM的基于Mot Discovery工具,发现我们的算法可以在实际数据集上的平均精度速率(即,约27%)的平均精度率(即,约27%)的显着改进,而不会影响其灵敏度。我们的方法还对其他主题发现工具进行了有利的比较。结论:基于模型的聚类框架的图案发现应考虑使用异质模型来代表DNA序列中的两类信号。与均匀模型相比,这种异构模型可以实现更好的信号辨别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号