首页> 外国专利> POSITION TOP-K KEYWORD QUERY-BASED FAST INDEXING METHOD AND SYSTEM UNDER SLIDING WINDOW

POSITION TOP-K KEYWORD QUERY-BASED FAST INDEXING METHOD AND SYSTEM UNDER SLIDING WINDOW

机译:滑动窗口下基于位置TOP-K关键词查询的快速索引方法及系统

摘要

Disclosed are a position top-k keyword query-based fast indexing method and system under a sliding window. The fast indexing method comprises constructing a data indexing model and query. The construction of the data indexing model comprises the following: determining a geographical range covered by a quadtree and a node splitting rule; accepting a data stream, and inserting data into a node; for node splitting which satisfies the node splitting rule, inserting data to generate a complete quadtree; for a leaf node, storing an inverted index; for a non-leaf node, storing an MG aggregation abstract of sub-nodes thereof; and adjusting the structure of the quadtree. The query comprises the following: initializing a set of results; carrying out a branch trim operation to obtain a set of candidate results; and taking a word having the maximum score from a priority queue to start computation, traversing starting from a root node until an accurate score thereof is found in a leaf node and placing same into a queue, and repeating until the first k words in the priority queue no longer change. The present invention can effectively reduce costs and improve the speed of querying, can also effectively trim a search space according to word frequency and position proximity, and can process geographical text data streams with a high arrival rate.
机译:公开了一种滑动窗口下基于位置前k个关键字查询的快速索引方法和系统。快速索引方法包括构造数据索引模型和查询。数据索引模型的构建包括以下步骤:确定四叉树和节点划分规则所覆盖的地理范围;以及接受数据流,并将数据插入节点;对于满足节点拆分规则的节点拆分,插入数据以生成完整的四叉树;对于叶节点,存储倒排索引;对于非叶子节点,存储其子节点的MG聚合摘要;并调整四叉树的结构。该查询包括以下内容:初始化一组结果;进行分支修剪操作以获得一组候选结果;并从优先级队列中获取具有最高分数的单词以开始计算,从根节点开始遍历,直到在叶节点中找到其准确分数,然后将其放入队列中,然后重复执行,直到优先级中的前k个单词为止队列不再更改。本发明可以有效地降低成本,提高查询速度,还可以根据词频和位置接近度有效地修剪搜索空间,并且可以以较高的到达率处理地理文本数据流。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号