A Clustering-Based Algorithm for De Novo Motif Discovery in DNA Sequences

机译：一种基于聚类的DNA序列De Novo Motif发现的算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motif discovery is a challenging problem in molecular biology and has been attracting researcher's attention for years. Different kind of data and computational methods have been used to unravel this problem, but there is still room for improvement. In this study, our goal was to develop a method with the ability to identify all the TFBS signals, including known and unknown, inside the input set of sequences. We developed a clustering method specialized as part of our algorithm which outperforms other existing clustering methods such as DNACLUST and CD-HIT-EST in clustering short sequences. A scoring system was needed to determine how much a cluster is close to being a real motif. Multiple features are calculated based on the contents of each cluster to determine the score of the cluster. These features contain a set of divergence measures, positional, and occurrence information. These scores are combined in a way that a trade-off between them determines the clusters situation. There is an option to compare the final results with the motif databases such as Jolma2013, and UniProbe using Tomtom motif comparison tool. Algorithm Evaluation has been performed on three datasets from ABS database.

机译：MOTIF发现是分子生物学的一个具有挑战性的问题，多年来一直吸引研究员的注意。不同类型的数据和计算方法已被用于解开此问题，但仍有改进的余地。在这项研究中，我们的目标是开发一种能够识别输入组的输入组中的所有TFB信号的方法，包括已知和未知。我们开发了一种专门为我们算法的一部分的聚类方法，其特殊地优于其他现有的聚类方法，例如DNAClust和CD-HIT-EST中的聚类短序列。需要评分系统来确定群集靠近成为真正的主题。基于每个群集的内容来计算多个功能以确定群集的分数。这些功能包含一组分歧测量，位置和发生信息。这些分数在某种程度上组合在某种程度上，它们之间的权衡决定了集群状况。有一个选项可以使用TomTom Motif比较工具将最终结果与Jolma2013等主语数据库进行比较。从ABS数据库的三个数据集执行了算法评估。

著录项

来源
《National Conference on Biomedical Engineering》|2017年|347p|共6页
会议地点
作者
Mohammad Haghir Ebrahim-Abadi; Emad Fatemizadeh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类生物医学工程;
关键词
Clustering algorithms; Anomaly detection; Biomedical measurement; Pulse width modulation; Clustering methods; Biomedical engineering; Position measurement;

机译：聚类算法;异常检测;生物医学测量;脉冲宽度调制;聚类方法;生物医学工程;位置测量;

相似文献

外文文献
中文文献
专利

1. PairMotif+: A fast and effective algorithm for de novo motif discovery in DNA sequences [J] . YuQ., HuoH., ZhangY., International journal of biological sciences . 2013,第3a4期

机译：PairMotif +：快速有效的从头序列发现DNA序列的算法
2. PairMotif+: A Fast and Effective Algorithm for De Novo Motif Discovery in DNA sequences [J] . Qiang Yu, Hongwei Huo, Yipu Zhang, International journal of biological sciences . 2013,第4期

机译：PairMotif +：一种快速有效的从头发现DNA序列的算法
3. PairMotif+: A fast and effective algorithm for de novo motif discovery in DNA sequences [J] . YuQ., HuoH., ZhangY., International journal of biological sciences . 2013,第3a4期

机译：Pairmotif +：DNA序列中De Novo Motif发现的快速有效算法
4. A Clustering-Based Algorithm for De Novo Motif Discovery in DNA Sequences [C] . Mohammad Haghir Ebrahim-Abadi, Emad Fatemizadeh 2017 24th National and 2nd International Iranian Conference on Biomedical Engineering . 2017

机译：DNA序列中从头发现的基于聚类的算法
5. Novel algorithms for motif discovery in bio-sequence datasets. [D] . Balla, Sudha. 2007

机译：用于生物序列数据集中的基序发现的新算法。
6. PairMotif+: A Fast and Effective Algorithm for De Novo Motif Discovery in DNA sequences [O] . Qiang Yu, Hongwei Huo, Yipu Zhang, 2013

机译：PairMotif +：一种快速有效的从头发现DNA序列的算法
7. A De Novo Shape Motif Discovery Algorithm Reveals Preferences of Transcription Factors for DNA Shape Beyond Sequence Motifs [O] . Md. Abul Hassan Samee, Benoit G. Bruneau, Katherine S. Pollard 2019

机译：De Novo形状基序算法揭示了在序列图中的DNA形状转录因子的偏好

A Clustering-Based Algorithm for De Novo Motif Discovery in DNA Sequences

摘要

著录项

相似文献

相关主题

期刊订阅