首页> 外文会议>Database systems for advanced applications >A Two-Tire Index Structure for Approximate String Matching with Block Moves
【24h】

A Two-Tire Index Structure for Approximate String Matching with Block Moves

机译:两轮胎索引结构,用于通过块移动近似匹配字符串

获取原文
获取原文并翻译 | 示例

摘要

Many applications need to solve the problem of approximate string matching with block moves. It is an NP-Complete problem to compute block edit distance between two strings. Our goal is to filter non-candidate strings as much as possible. Based on the two matured filter strategies, frequency distance and positional q-gram, we propose a two-tire index structure to make the use of the two fiiters more efficiently. We give a full specification of the index structure, including how to choose character order to achieve a better filterability and how to balance number of strings in different clusters. We present our experiments on real data sets to evaluate our technique and show the proposed index structure can provide a good performance.
机译:许多应用程序需要解决与块移动近似的字符串匹配的问题。计算两个字符串之间的块编辑距离是一个NP-Complete问题。我们的目标是尽可能地过滤非候选字符串。基于频率距离和位置q-gram这两种成熟的滤波策略,我们提出了一种两轮胎的索引结构,以更有效地利用两个滤波器。我们给出了索引结构的完整说明,包括如何选择字符顺序以实现更好的可过滤性,以及如何平衡不同簇中字符串的数量。我们在真实数据集上展示我们的实验以评估我们的技术,并表明所提出的索引结构可以提供良好的性能。

著录项

  • 来源
  • 会议地点 Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU)
  • 作者

    Bin Wang; Long Xie; Guoren Wang;

  • 作者单位

    Key Laboratory of Medical Image Computing (Northeastern University),Ministry of Education School of Information Science and Engineering,Northeastern University, Shenyang, China;

    Information School, Liaoning University, Shenyang, China;

    Key Laboratory of Medical Image Computing (Northeastern University),Ministry of Education School of Information Science and Engineering,Northeastern University, Shenyang, China;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 TP311.13;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号