首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >M-N scatter plots technique for evaluating varying-size clusters and setting the parameters of Bi-CoPaM and Uncles methods
【24h】

M-N scatter plots technique for evaluating varying-size clusters and setting the parameters of Bi-CoPaM and Uncles methods

机译:M-N散点图技术,用于评估大小可变的簇并设置Bi-CoPaM和Uncles方法的参数

获取原文

摘要

The recently proposed UNCLES method has the ability to unify clustering results from multiple datasets under different types of external specifications. It can also tunably tighten the results such that many objects are unassigned from all of the clusters to obtain few tight clusters. Despite the success of this method, setting its parameters, such as the number of clusters (K) and the tuning parameters δ and (δ+, δ−), has never been automated. As its clusters vary in size, they cannot be validated by the existing validation indices. In this study we present a technique of validation based on our proposed M-N scatter plots. This technique has the ability to provide better fitness values for the clusters which include more objects while preserving their tightness. This well suits the nature of the results of UNCLES. We have applied this technique to a set of bacterial microarray datasets as well as a set of English vowels datasets. Our results demonstrate the success of the M-N plots in selecting the best few clusters out of a pool of clusters generated under varying K, δ, and (δ+, δ−) values. Our results also show that the best few clusters can be originated from different partitions, which shows the power of our technique in evaluating individual clusters rather than whole partitions. Finally, despite proposing this technique within the context of the UNCLES framework, it is readily applicable to other clustering results, especially when the parameters are not confidently predefined.
机译:最近提出的UNCLES方法具有统一来自不同类型外部规范的多个数据集的聚类结果的能力。它还可以调整结果,以使从所有群集中取消分配许多对象以获得很少的密集群集。尽管此方法取得了成功,但从未自动设置其参数(例如,簇数(K)以及调整参数δ和(δ+,δ-))。由于其集群大小不同,因此无法通过现有的验证索引进行验证。在这项研究中,我们提出了一种基于提出的M-N散点图的验证技术。该技术能够为包含更多对象的群集提供更好的适应度值,同时保持其紧密度。这非常适合UNCLES结果的性质。我们已将此技术应用于一组细菌微阵列数据集以及一组英语元音数据集。我们的结果证明了M-N图在从变化的K,δ和(δ+,δ-)值生成的簇池中选择最佳的几个簇的成功。我们的结果还表明,最好的几个群集可以源自不同的分区,这表明了我们的技术在评估单个群集而不是整个分区方面的强大功能。最后,尽管在UNCLES框架的上下文中提出了此技术,但它很容易应用于其他聚类结果,尤其是在参数没有确定地预定义的情况下。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号