首页> 美国卫生研究院文献>BioMed Research International >Pattern Matching for DNA Sequencing Data Using Multiple Bloom Filters
【2h】

Pattern Matching for DNA Sequencing Data Using Multiple Bloom Filters

机译:使用多个Bloom过滤器进行DNA测序数据的模式匹配

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Storing and processing of large DNA sequences has always been a major problem due to increasing volume of DNA sequence data. However, a number of solutions have been proposed but they require significant computation and memory. Therefore, an efficient storage and pattern matching solution is required for DNA sequencing data. Bloom filters (BFs) represent an efficient data structure, which is mostly used in the domain of bioinformatics for classification of DNA sequences. In this paper, we explore more dimensions where BFs can be used other than classification. A proposed solution is based on Multiple Bloom Filters (MBFs) that finds all the locations and number of repetitions of the specified pattern inside a DNA sequence. Both of these factors are extremely important in determining the type and intensity of any disease. This paper serves as a first effort towards optimizing the search for location and frequency of substrings in DNA sequences using MBFs. We expect that further optimizations in the proposed solution can bring remarkable results as this paper presents a proof of concept implementation for a given set of data using proposed MBFs technique. Performance evaluation shows improved accuracy and time efficiency of the proposed approach.
机译:由于DNA序列数据量的增加,大DNA序列的存储和处理一直是一个主要问题。但是,已经提出了许多解决方案,但是它们需要大量的计算和存储。因此,DNA测序数据需要有效的存储和模式匹配解决方案。布隆过滤器(BF)代表了一种有效的数据结构,该结构主要用于生物信息学领域以对DNA序列进行分类。在本文中,我们探索了除分类以外还可以使用BF的更多维度。提出的解决方案基于多重布隆过滤器(MBF),可找到DNA序列内指定模式的所有位置和重复数。这两个因素在确定任何疾病的类型和强度方面都非常重要。本文是使用MBF优化DNA序列中子串位置和频率搜索的第一步。我们期望在提出的解决方案中进行进一步的优化可以带来显着的结果,因为本文提出了使用提出的MBFs技术对给定数据集进行概念验证的方法。性能评估表明该方法具有更高的准确性和时间效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号