首页> 外文期刊>Procedia Computer Science >Clustering Text Data Streams – A Tree based Approach with Ternary Function and Ternary Feature Vector
【24h】

Clustering Text Data Streams – A Tree based Approach with Ternary Function and Ternary Feature Vector

机译:聚类文本数据流–具有三元函数和三元特征向量的基于树的方法

获取原文
           

摘要

Data is the primary concern in data mining. Data Stream Mining is gaining a lot of practical significance with the huge online data generated from Sensors, Internet Relay Chats, Twitter, Facebook, Online Bank or ATM Transactions. The primary constraint in finding the frequent patterns in data streams is to perform only one time scan of the data with limited memory and requires less processing time. The concept of dynamically changing data is becoming a key challenge, what we call as data streams. In our present work, the algorithm is based on finding frequent patterns in the data streams using a tree based approach and to continuously cluster the text data streams being generated using a new ternary similarity measure defined.
机译:数据是数据挖掘中的主要关注点。通过传感器,Internet中继聊天,Twitter,Facebook,在线银行或ATM交易产生的大量在线数据,数据流挖掘正获得许多实际意义。查找数据流中频繁模式的主要限制是仅对内存有限的数据执行一次扫描,并且需要较少的处理时间。动态更改数据的概念正成为关键挑战,我们称之为数据流。在我们目前的工作中,该算法基于使用基于树的方法在数据流中查找频繁的模式,并使用定义的新三元相似性度量对正在生成的文本数据流进行连续聚类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号