首页> 外文期刊>Information Sciences: An International Journal >Clustering algorithm selection by meta-learning systems: A new distance-based problem characterization and ranking combination methods
【24h】

Clustering algorithm selection by meta-learning systems: A new distance-based problem characterization and ranking combination methods

机译:元学习系统的聚类算法选择:一种基于距离的新问题表征和排序组合方法

获取原文
获取原文并翻译 | 示例
           

摘要

Data clustering aims to segment a database into groups of objects based on the similarity among these objects. Due to its unsupervised nature, the search for a good-quality solution can become a complex process. There is currently a wide range of clustering algorithms, and selecting the best one for a given problem can be a slow and costly process. In 1976, Rice formulated the Algorithm Selection Problem (ASP), which postulates that the algorithm performance can be predicted based on the structural characteristics of the problem. Meta-learning brings the concept of learning about learning; that is, the meta-knowledge obtained from the algorithm learning process allows the improvement of the algorithm performance. Meta-learning has a major intersection with data mining in classification problems, in which it is normally used to recommend algorithms. The present paper proposes new ways to obtain meta-knowledge for clustering tasks. Specifically, two contributions are explored here: (1) a new approach to characterize clustering problems based on the similarity among objects; and (2) new methods to combine internal indices for ranking algorithms based on their performance on the problems. Experiments were conducted to evaluate the recommendation quality. The results show that the new meta-knowledge provides high-quality algorithm selection for clustering tasks. (C) 2015 Elsevier Inc. All rights reserved.
机译:数据聚类的目的是根据这些对象之间的相似性将数据库划分为对象组。由于其不受监督的性质,寻求优质解决方案的过程可能会变得很复杂。当前有各种各样的聚类算法,针对给定的问题选择最佳的聚类算法可能是一个缓慢而昂贵的过程。 1976年,赖斯制定了算法选择问题(ASP),它假定可以根据问题的结构特征来预测算法性能。元学习带来了学习的概念。也就是说,从算法学习过程获得的元知识可以提高算法性能。元学习在分类问题中与数据挖掘有很大的交集,通常用于推荐算法。本文提出了获取聚类任务元知识的新方法。具体来说,这里探讨了两个方面:(1)一种基于对象之间相似性来表征聚类问题的新方法; (2)根据内部算法对问题的表现,对内部索引进行组合的新方法。进行实验以评估推荐质量。结果表明,新的元知识为聚类任务提供了高质量的算法选择。 (C)2015 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号