【24h】

Inverted Voronoi-Based kNN Query Processing with MapReduce

机译:使用MapReduce的基于Voronoi的反向kNN查询处理

获取原文
获取原文并翻译 | 示例

摘要

In mobile cloud computing environments distributed k Nearest Neighbor (kNN) query is an important issue. We consider the problem of processing kNN query over large data sets where the index is jointly maintained by a set of machines in a computing cluster. The kNN query is a primitive operator that is widely used in many fields ranging from knowledge discovery, data mining and spatial databases etc. A scalable and distributed spatial data index plays an important role in conducting kNN query effectively. We can use different ways to conduct distributed indexes and kNN query processing by using MapReduce, i.e. R-tree and Grid-based index, etc. Nevertheless, R-tree is not compatible with parallelization, and Grid is a many-to-many index, which could potentially lead to content redundancy. In the paper, a distributed method of kNN queries applying MapReduce program model will be introduced. In the very beginning, I propose distributed methods which set up a novel distributed spatial data index: Inverted Voronoi Index that combines both inverted index and Voronoi diagram. Next, I propose a kNN queries processing algorithm, it is very efficient because it is based on Voronoi and uses MapReduce. Last but not least, I present the outcomes of extensive experiment that are gained by both real and simulated data sets which indicate efficiency and scalability of the proposed approach.
机译:在移动云计算环境中,分布式k最近邻(kNN)查询是一个重要问题。我们考虑在大型数据集上处理kNN查询的问题,其中索引是由计算集群中的一组计算机共同维护的。 kNN查询是一种原始运算符,已广泛应用于知识发现,数据挖掘和空间数据库等许多领域。可伸缩且分布式的空间数据索引在有效进行kNN查询中起着重要作用。我们可以使用不同的方式通过MapReduce进行分布式索引和kNN查询处理,即R-tree和基于Grid的索引等。尽管如此,R-tree与并行化不兼容,并且Grid是多对多索引,这可能会导致内容冗余。本文介绍了一种应用MapReduce程序模型的分布式kNN查询方法。从一开始,我就提出了建立新的分布式空间数据索引的分布式方法:结合了反向索引和Voronoi图的反向Voronoi索引。接下来,我提出了一种kNN查询处理算法,该算法非常有效,因为它基于Voronoi并使用MapReduce。最后但并非最不重要的一点是,我介绍了通过实际数据集和模拟数据集获得的广泛实验的结果,这些结果表明了所提出方法的效率和可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号