首页> 外文会议>International Symposium on Bioinformatics Research and Applications >Iterative Spaced Seed Hashing: Closing the Gap Between Spaced Seed Hashing and k-mer Hashing
【24h】

Iterative Spaced Seed Hashing: Closing the Gap Between Spaced Seed Hashing and k-mer Hashing

机译:迭代间隔的种子散列:关闭间距种子散列和K-MEL散列之间的间隙

获取原文

摘要

Alignment-free classification of sequences has enabled high-throughput processing of sequencing data in many bioinformatics pipelines. Much work has been done to speed-up the indexing of k-mers through hash-table and other data structures. These efforts have led to very fast indexes, but because they are k-mer based, they often lack sensitivity due to sequencing errors or polymorphisms. Spaced seeds are a special type of pattern that accounts for errors or mutations. They allow to improve the sensitivity and they are now routinely used instead of kmers in many applications. The major drawback of spaced seeds is that they cannot be efficiently hashed and thus their usage increases substantially the computational time. In this paper we address the problem of efficient spaced seed hashing. We propose an iterative algorithm that combines multiple spaced seed hashes by exploiting the similarity of adjacent hash values in order to efficiently compute the next hash. We report a series of experiments on HTS reads hashing, with several spaced seeds. Our algorithm can compute the hashing values of spaced seeds with a speedup of 6.2x, outperforming previous methods. Software and Datasets are available at ISSH
机译:无序分类序列使能够在许多生物信息学管道中测序数据的高吞吐量处理。已经完成了很多工作来加速K-Mers通过Hash-Table和其他数据结构的索引。这些努力导致了非常快的索引,但由于它们是基于K-MER的,因此由于测序错误或多态性,它们通常缺乏敏感性。间隔的种子是一种特殊类型的模式,占错误或突变。它们允许提高灵敏度,现在他们经常使用而不是在许多应用中使用而不是管理。间隔种子的主要缺点是它们不能有效地散列,因此它们的使用基本上增加了计算时间。在本文中,我们解决了有效的间隔种子散列问题。我们提出了一种迭代算法,通过利用相邻散列值的相似性来结合多个间隔的种子散列,以便有效地计算下一个哈希。我们报告了一系列关于HTS读取散列的实验,具有几种间隔的种子。我们的算法可以使用6.2x的加速来计算间隔种子的散列值,优于先前的方法。 ISSH提供软件和数据集

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号