首页> 外文会议>International Conference on Bioinformatics Models, Methods and Algorithms >Fast Alignment-free Comparison for Regulatory Sequences using Multiple Resolution Entropic Profiles
【24h】

Fast Alignment-free Comparison for Regulatory Sequences using Multiple Resolution Entropic Profiles

机译:使用多分辨率熵型材的调节序列的快速对准比较

获取原文

摘要

Enhancers are stretches of DNA (100-1000 bp) that play a major role in development gene expression, evolution and disease. It has been recently shown that in high-level eukaryotes enhancers rarely work alone, instead they collaborate by forming clusters of cis-regulatory modules (CRMs). Even if the binding of transcription factors is sequence-specific, the identification of functionally similar enhancers is very difficult and it cannot be carried out with traditional alignment-based techniques. In this paper we study the use of alignment-free measures for the classification of CRMs. However alignment-free measures are generally tied to a fixed resolution k. Here we propose an alignment-free statistic that is based on multiple resolution patterns derived from Entropic Profiles. Entropic Profile is a function of the genomic location that captures the importance of that region with respect to the whole genome. We evaluate several alignment-free statistics on simulated data and real mouse ChIP-seq sequences. The new statistic is highly successful in discriminating functionally related enhancers and, in almost all experiments, it outperforms fixed-resolution methods.
机译:增强剂是DNA(100-1000bp)的延伸,在发育基因表达,演化和疾病中起主要作用。最近据表明,在高级别的真核节中,增强剂很少单独工作,而是通过形成CIS-Charmatory模块(CRM)的集群来协作。即使转录因子的结合是序列特异性的,鉴定功能相似的增强剂也是非常困难的,并且不能与传统的基于对准的技术进行。在本文中,我们研究了对CRMS分类的对准措施的使用。然而,无对准的措施通常与固定分辨率k相关联。在这里,我们提出了一种对对齐的统计数据,其基于从熵简档导出的多个分辨率模式。熵轮廓是基因组位置的函数,其捕获该区域关于整个基因组的重要性。我们在模拟数据和真正的鼠标芯片SEQ序列中评估几个无对齐的统计数据。新统计数据在鉴别功能相关的增强剂方面非常成功,并且在几乎所有实验中,它优于固定解决方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号