首页> 外文会议>IEEE International Conference on Semantic Computing >FBF: Bloom Filter for Fuzzy Membership Queries on Strings
【24h】

FBF: Bloom Filter for Fuzzy Membership Queries on Strings

机译:FBF:Bloom Filter for fuzzy会员资格查询字符串

获取原文

摘要

Bloom filters are space efficient, probabilistic data structures for membership queries with low false positives and zero false negatives. They are widely used in the field of big data, networking, bio-informatics, cloud computing and IoT. However, a standard bloom filter does not support fuzzy membership queries. To overcome this limitation, we propose a novel algorithm using locality sensitive hashing on strings which extends the capability of a bloom filter to support fuzzy queries and calculate a score indicating its degree of similarity with the dataset. Experiments are performed to analyze false positive and false negative rates along with space requirements, time taken to serve a query and time taken to build the data structure. The proposed algorithm is benchmarked on two real world datasets where it outperforms all the baselines while maintaining all the advantages of a standard bloom filter.
机译:Bloom过滤器是具有低误报和零假底片的成员查询的空间高效,概率数据结构。它们广泛应用于大数据,网络,生物信息学,云计算和物联网领域。但是,标准盛开过滤器不支持模糊成员查询。为了克服这种限制,我们提出了一种新颖的算法,使用局部敏感散列串联,其扩展了盛开过滤器的能力来支持模糊查询并计算与数据集相似度的分数。进行实验以分析假阳性和假负速率以及空间要求,以满足建立数据结构的查询和时间所花费的时间。所提出的算法在两个真实世界数据集上是基准测试,其中它优于所有基线,同时保持标准绽放过滤器的所有优点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号