首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Analysis of range queries and self-spatial join queries on real region datasets stored using an R-tree
【24h】

Analysis of range queries and self-spatial join queries on real region datasets stored using an R-tree

机译:使用R树存储的真实区域数据集的范围查询和自空间联接查询分析

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we study the node distribution of an R-tree storing region data, like, for instance, islands, lakes, or human-inhabited areas. We will show that real region datasets are packed in an R-tree into minimum bounding rectangles (MBRs) whose area distribution follows the same power law, named REGAL (REGion Area Law), as that for the regions themselves. Moreover, these MBRs are packed in their turn into MBRs following the same law, and so on iteratively, up to the root of the R-tree. Based on this observation, we are able to accurately estimate the search effort for range queries, using a small number of easy-to-retrieve parameters. Furthermore, since our analysis exploits, through a realistic mathematical model, the proximity relations existing among the regions in the dataset, we show how to use our model to predict the selectivity of a self-spatial join query posed on the dataset. Experiments on a variety of real datasets (islands, lakes, human-inhabited areas) show that our estimations are accurate, enjoying a geometric average relative error ranging from 22 percent to 32 percent for the search effort of a range query, and from 14 percent to 34 percent for the selectivity of a self-spatial join query. This is significantly better than using a naive model based on uniformity assumption, which gives rise to a geometric average relative error up to 270 percent and up to 85 percent for the two problems, respectively.
机译:在本文中,我们研究了存储区域数据(例如,岛屿,湖泊或人类居住区)的R树的节点分布。我们将显示真实区域数据集在R树中打包成最小边界矩形(MBR),其区域分布遵循与区域本身相同的幂定律,称为REGAL(区域面积定律)。而且,这些MBR按照相同的规律依次打包为MBR,依此类推,直到R树的根。基于此观察,我们能够使用少量易于检索的参数来准确估算范围查询的搜索量。此外,由于我们的分析是通过现实的数学模型利用数据集中各区域之间存在的邻近关系,因此我们展示了如何使用我们的模型来预测对数据集进行的自空间联接查询的选择性。在各种真实数据集(岛屿,湖泊,人类居住区)上进行的实验表明,我们的估算是准确的,对于范围查询的搜索工作,其几何平均相对误差范围从22%到32%,从14%自空间连接查询的选择性提高到34%。这比使用基于均匀性假设的朴素模型要好得多,后者可以使两个问题的几何平均相对误差分别高达270%和85%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号