Clustering algorithm selection by meta-learning systems: A new distance-based problem characterization and ranking combination methods

Ferrari Daniel Gomes; de Castro Leandro Nunes

首页> 外文期刊>Information Sciences: An International Journal >Clustering algorithm selection by meta-learning systems: A new distance-based problem characterization and ranking combination methods

【24h】

Clustering algorithm selection by meta-learning systems: A new distance-based problem characterization and ranking combination methods

机译：元学习系统的聚类算法选择：一种基于距离的新问题表征和排序组合方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data clustering aims to segment a database into groups of objects based on the similarity among these objects. Due to its unsupervised nature, the search for a good-quality solution can become a complex process. There is currently a wide range of clustering algorithms, and selecting the best one for a given problem can be a slow and costly process. In 1976, Rice formulated the Algorithm Selection Problem (ASP), which postulates that the algorithm performance can be predicted based on the structural characteristics of the problem. Meta-learning brings the concept of learning about learning; that is, the meta-knowledge obtained from the algorithm learning process allows the improvement of the algorithm performance. Meta-learning has a major intersection with data mining in classification problems, in which it is normally used to recommend algorithms. The present paper proposes new ways to obtain meta-knowledge for clustering tasks. Specifically, two contributions are explored here: (1) a new approach to characterize clustering problems based on the similarity among objects; and (2) new methods to combine internal indices for ranking algorithms based on their performance on the problems. Experiments were conducted to evaluate the recommendation quality. The results show that the new meta-knowledge provides high-quality algorithm selection for clustering tasks. (C) 2015 Elsevier Inc. All rights reserved.

机译：数据聚类的目的是根据这些对象之间的相似性将数据库划分为对象组。由于其不受监督的性质，寻求优质解决方案的过程可能会变得很复杂。当前有各种各样的聚类算法，针对给定的问题选择最佳的聚类算法可能是一个缓慢而昂贵的过程。 1976年，赖斯制定了算法选择问题（ASP），它假定可以根据问题的结构特征来预测算法性能。元学习带来了学习的概念。也就是说，从算法学习过程获得的元知识可以提高算法性能。元学习在分类问题中与数据挖掘有很大的交集，通常用于推荐算法。本文提出了获取聚类任务元知识的新方法。具体来说，这里探讨了两个方面：（1）一种基于对象之间相似性来表征聚类问题的新方法; （2）根据内部算法对问题的表现，对内部索引进行组合的新方法。进行实验以评估推荐质量。结果表明，新的元知识为聚类任务提供了高质量的算法选择。（C）2015 Elsevier Inc.保留所有权利。

著录项

来源
《Information Sciences: An International Journal》 |2015年第null期|共14页
作者
Ferrari Daniel Gomes; de Castro Leandro Nunes;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动信息理论;
关键词
Clustering; Problem characterization; Algorithm ranking; Algorithm selection; Meta-knowledge; Meta-learning systems;

机译：聚类;问题表征;算法排名;算法选择;元知识;元学习系统;

相似文献

外文文献
中文文献
专利

1. Clustering algorithm selection by meta-learning systems: A new distance-based problem characterization and ranking combination methods [J] . Ferrari Daniel Gomes, de Castro Leandro Nunes Information Sciences: An International Journal . 2015,第Null期

机译：元学习系统的聚类算法选择：一种基于距离的新问题表征和排序组合方法
2. Ranking Learning Algorithms: Using IBL and Meta-Learning on Accuracy and Time Results [J] . PAVEL B. BRAZDIL, CARLOS SOARES, JOAQUIM PINTO DA COSTA Machine Learning . 2003,第3期

机译：排名学习算法：使用IBL和元学习来提高准确性和时间结果
3. MetaStream: A meta-learning based method for periodic algorithm selection in time-changing data [J] . Andre Luis Debiaso Rossi, Andre Carlos Ponce de Leon Ferreira de Carvalho, Carlos Soares, Neurocomputing . 2014,第mara15期

机译：MetaStream：一种基于元学习的方法，用于在时变数据中进行周期性算法选择
4. Statistical versus Distance-Based Meta-Features for Clustering Algorithm recommendation Using Meta-Learning [C] . Bruno Almeida Pimentel, André C. P. L. F. de Carvalho International Joint Conference on Neural Networks . 2018

机译：基于元学习的聚类算法推荐的统计量与基于距离的元特征
5. Improving Algorithm Selection Methods using Meta-Learning by Considering Accuracy and Run Time [D] . Abdulrahman, Salisu Mamman. 2017

机译：通过考虑准确性和运行时间的元学习改进算法选择方法
6. Combined Mapping of Multiple clUsteriNg ALgorithms (COMMUNAL): A Robust Method for Selection of Cluster Number K [O] . Timothy E. Sweeney, Albert C. Chen, Olivier Gevaert -1

机译：多个聚类算法的组合映射（公共）：选择簇数K的稳健方法
7. Ranking and selecting clustering algorithms using a meta-learning approach [O] . de Souto, M., Prudencio, R., Soares, R., 2008

机译：使用元学习方法对聚类算法进行排名和选择
8. Pattern Search Ranking and Selection Algorithms for Mixed-Variable Optimization of Stochastic Systems [R] . Iver, T. A. 2004

机译：随机系统混合变量优化模式搜索排序与选择算法

Clustering algorithm selection by meta-learning systems: A new distance-based problem characterization and ranking combination methods

摘要

著录项

相似文献

相关主题

期刊订阅