首页> 外文会议>Machine learning and data mining in pattern recognition >SOStream: Self Organizing Density-Based Clustering over Data Stream
【24h】

SOStream: Self Organizing Density-Based Clustering over Data Stream

机译:SOStream:在数据流上自组织基于密度的群集

获取原文
获取原文并翻译 | 示例

摘要

In this paper we propose a data stream clustering algorithm, called Self Organizing density based clustering over data Stream (SOStream). This algorithm has several novel features. Instead of using a fixed, user defined similarity threshold or a static grid, SOStream detects structure within fast evolving data streams by automatically adapting the threshold for density-based clustering. It also employs a novel cluster updating strategy which is inspired by competitive learning techniques developed for Self Organizing Maps (SOMs). In addition, SOStream has built-in online functionality to support advanced stream clustering operations including merging and fading. This makes SOStream completely online with no separate offline components. Experiments performed on KDD Cup'99 and artificial datasets indicate that SOStream is an effective and superior algorithm in creating clusters of higher purity while having lower space and time requirements compared to previous stream clustering algorithms.
机译:在本文中,我们提出了一种数据流聚类算法,称为基于自组织密度的数据流聚类(SOStream)。该算法具有几个新颖的特征。 SOStream无需使用固定的,用户定义的相似性阈值或静态网格,而是通过自动调整阈值以基于密度的群集来检测快速发展的数据流中的结构。它还采用了一种新颖的集群更新策略,该策略受到为自组织地图(SOM)开发的竞争性学习技术的启发。此外,SOStream具有内置的联机功能,以支持高级流群集操作,包括合并和衰落。这使SOStream完全在线,没有单独的脱机组件。在KDD Cup'99和人工数据集上进行的实验表明,与以前的流聚类算法相比,SOStream是创建高纯度聚类的有效且优越的算法,同时具有较低的空间和时间要求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号