首页> 外文期刊>Expert Systems with Application >A novel clustering algorithm based on data transformation approaches
【24h】

A novel clustering algorithm based on data transformation approaches

机译:一种基于数据转换方法的新型聚类算法

获取原文
获取原文并翻译 | 示例
           

摘要

Clustering provides a knowledge acquisition method for intelligent systems. This paper proposes a novel data-clustering algorithm, by combining a new initialization technique, K-means algorithm and a new gradual data transformation approach to provide more accurate clustering results than the K-means algorithm and its variants by increasing the clusters' coherence. The proposed data transformation approach solves the problem of generating empty clusters, which frequently occurs for other clustering algorithms. An efficient method based on the principal component transformation and a modified silhouette algorithm is also proposed in this paper to determine the number of clusters. Several different data sets are used to evaluate the efficacy of the proposed method to deal with the empty cluster generation problem and its accuracy and computational performance in comparison with other K-means based initialization techniques and clustering methods. The developed estimation method for determining the number of clusters is also evaluated and compared with other estimation algorithms. Significances of the proposed method include addressing the limitations of the K-means based clustering and improving the accuracy of clustering as an important method in the field of data mining and expert systems. Application of the proposed method for the knowledge acquisition in time series data such as wind, solar, electric load and stock market provides a pre-processing tool to select the most appropriate data to feed in neural networks or other estimators in use for forecasting such time series. In addition, utilization of the knowledge discovered by the proposed K-means clustering to develop rule based expert systems is one of the main impacts of the proposed method. (C) 2017 Elsevier Ltd. All rights reserved.
机译:聚类为智能系统提供了一种知识获取方法。本文提出了一种新颖的数据聚类算法,通过结合一种新的初始化技术,K-means算法和一种新的渐进数据转换方法,通过增加聚类的一致性,提供了比K-means算法及其变体更准确的聚类结果。提出的数据转换方法解决了生成空聚类的问题,该问题在其他聚类算法中经常发生。本文还提出了一种基于主成分变换和改进的轮廓算法的有效方法来确定聚类数目。与其他基于K均值的初始化技术和聚类方法相比,使用了几种不同的数据集来评估该方法处理空聚类生成问题的有效性及其准确性和计算性能。还评估了用于确定簇数的已开发估计方法,并将其与其他估计算法进行了比较。提出的方法的意义包括解决基于K均值的聚类的局限性和提高聚类的准确性,这是数据挖掘和专家系统领域中的一种重要方法。所提出的方法在时间序列数据(例如风能,太阳能,电力负荷和股票市场)中的知识获取的应用提供了一种预处理工具,可以选择最合适的数据以馈入神经网络或其他用于估算此类时间的估计器系列。另外,利用通过提出的K-means聚类发现的知识来开发基于规则的专家系统是提出的方法的主要影响之一。 (C)2017 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号