首页> 外文会议>Pacific Rim international conference on artificial intelligence >A New Context-Based Clustering Framework for Categorical Data
【24h】

A New Context-Based Clustering Framework for Categorical Data

机译:一种新的基于上下文的分类数据聚类框架

获取原文

摘要

Clustering is a fundamental task that has been utilized in many scientific fields, especially in machine learning and data mining. In clustering, dissimilarity measures play a key role in formulating clusters. For handling categorical values, the simple matching method is usually used for quantifying their dissimilarity. However, this method cannot capture the hidden semantic information that can be inferred from relationships among categories. In this paper, we propose a new clustering framework for categorical data that is capable of integrating not only the distributions of categories but also their mutual relationship information into the pattern proximity evaluation process of the clustering task. The effectiveness of the proposed clustering algorithm is proven by a comparative study conducted on existing clustering methods for categorical data.
机译:集群是一项基本任务,已在许多科学领域中得到利用,尤其是在机器学习和数据挖掘中。在聚类中,相异性度量在形成聚类中起关键作用。为了处理分类值,通常使用简单的匹配方法来量化它们的不相似性。但是,此方法无法捕获可以从类别之间的关系推断出的隐藏语义信息。在本文中,我们提出了一个新的分类数据聚类框架,该框架不仅能够将类别的分布及其相互关系信息整合到聚类任务的模式接近度评估过程中。通过对现有分类数据的聚类方法进行的比较研究证明了所提出的聚类算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号