首页> 外文会议>International Symposium on Current Progress in Mathematics and Sciences >Protein Sequences Clustering of Herpes Virus by Using Tribe Markov Clustering (Tribe-MCL)
【24h】

Protein Sequences Clustering of Herpes Virus by Using Tribe Markov Clustering (Tribe-MCL)

机译:使用部落马克罗夫聚类(TRIBE-MCL)蛋白序列疱疹病毒的聚类

获取原文

摘要

The herpes virus can be found anywhere and one of the important characteristics is its ability to cause acute and chronic infection at certain times so as a result of the infection allows severe complications occurred. The herpes virus is composed of DNA containing protein and wrapped by glycoproteins. In this work, the Herpes viruses family is classified and analyzed by clustering their protein-sequence using Tribe Markov Clustering (Tribe-MCL) algorithm. Tribe-MCL is an efficient clustering method based on the theory of Markov chains, to classify protein families from protein sequences using pre-computed sequence similarity information. We implement the Tribe-MCL algorithm using an open source program of R. We select 24 protein sequences of Herpes virus obtained from NCBI database. The dataset consists of three types of glycoprotein B, F, and H. Each type has eight herpes virus that infected humans. Based on our simulation using different inflation factor r=1 .5, 2, 3 we find a various number of the clusters results. The greater the inflation factor the greater the number of their clusters. Each protein will grouped together in the same type of protein.
机译:疱疹病毒可以在任何地方找到,其中一个重要的特征是其在某些时候导致急性和慢性感染的能力,因此由于感染而导致严重的并发症发生了严重的并发症。疱疹病毒由含有含有蛋白质的DNA和用糖蛋白包裹。在这项工作中,通过使用部落Markov聚类(TRIBE-MCL)算法进行蛋白质序列来分类和分析疱疹病毒家族。部落-MCL是基于马尔可夫链理论的有效聚类方法,使用预先计算的序列相似信息对来自蛋白质序列的蛋白质家族进行分类。我们使用R的开源程序实施TRIBE-MCL算法。我们选择从NCBI数据库获得的24个疱疹病毒的蛋白序列。数据集由三种类型的糖蛋白B,F和H组成。每种类型有八种感染人类的​​病毒。基于我们的模拟使用不同的充气因子r = 1 .5,2,3,我们发现各种群集结果。膨胀因子越大,其集群的数量越大。每种蛋白质将以相同类型的蛋白质分组。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号