首页> 外国专利> Transformation-based method for indexing high-dimensional data for nearest neighbour queries

Transformation-based method for indexing high-dimensional data for nearest neighbour queries

机译:基于变换的索引高维数据用于最近邻居查询的方法

摘要

We disclose a transformation-based method for indexing high-dimensional data to support similarity search. The method, iDistance, partitions the data into clusters either based on some clustering strategies or simple data space partitioning strategies. The data in each cluster can be described based on their similarity with respect to a reference point, and hence they can be transformed into a single dimensional space based on such relative similarity. This allows us to index the data points using a B+-tree structure and perform similarity search using range search strategy. As such, the method is well suited for integration into existing DBMSs. We also study two data partitioning strategies, and several methods on selection of reference points. We conducted extensive experiments to evaluate iDistance, and our results demonstrate its effectiveness.
机译:我们公开了一种基于索引的高维数据索引方法,以支持相似性搜索。 iDistance方法根据某些群集策略或简单的数据空间分区策略将数据划分为群集。可以基于它们相对于参考点的相似性来描述每个群集中的数据,因此可以基于这样的相对相似性将它们转换为一维空间。这使我们可以使用B + -树结构索引数据点,并使用范围搜索策略执行相似性搜索。因此,该方法非常适合集成到现有DBMS中。我们还研究了两种数据分区策略以及几种参考点选择方法。我们进行了广泛的实验来评估iDistance,我们的结果证明了它的有效性。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号