【24h】

Symbolic Distance Measurements Based on Characteristic Subspaces

机译:基于特征子空间的符号距离测量

获取原文
获取原文并翻译 | 示例

摘要

We introduce the subspace difference metric, a novel heterogeneous distance metric for calculating distances between points with both continuous and (unordered) categorical attributes. Our approach is based on the computation and comparison of characteristic subspaces (i.e. contexts) for each of the symbols and can be viewed as a generalization of the well-known value difference metric. Subsequently, as one possible extension, we propose a linearization of the computed symbolic distances by multidimensional scaling, thereby mapping a set of symbols onto the interval. Thus, even algorithms, which have originally been designed for usage with continuous attributes (e.g. clustering algorithms like k-means), may be applied to datasets containing discrete attributes, without having to adapt the algorithm itself. Finally, we evaluate the proposed metric and the linearization in quantitative and qualitative settings and exemplify the applicability in clustering domains.
机译:我们介绍了子空间差异度量,这是一种新颖的异构距离度量,用于计算具有连续和(无序)分类属性的点之间的距离。我们的方法基于每个符号的特征子空间(即上下文)的计算和比较,可以看作是众所周知的值差度量的概括。随后,作为一种可能的扩展,我们建议通过多维缩放对计算出的符号距离进行线性化,从而将一组符号映射到间隔上。因此,甚至最初设计用于连续属性的算法(例如,诸如k均值的聚类算法)也可以应用于包含离散属性的数据集,而不必适应算法本身。最后,我们在定量和定性设置中评估所提出的度量和线性化,并举例说明在聚类领域中的适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号