首页> 外文会议>International Conference on Computer Communication and Informatics >Attribute Pattern Weights (APW): A Scale to Detect Concept Drift in Data Stream Mining Models
【24h】

Attribute Pattern Weights (APW): A Scale to Detect Concept Drift in Data Stream Mining Models

机译:属性模式权重(APW):一种检测数据流挖掘模型中概念漂移的比例

获取原文

摘要

Extracting data structures from dynamic real-time data records is gaining prominence across industries. The need for massive mining of data sequences is increasingly observed in a wide range of user applications including social network platforms, banking sector, genomics, telecom sector, e-commerce and other sectors. To analyse multiple streams of data that is, for understanding rapid sequences of data flowing at continuous intervals, researchers are focusing on continuous improvements in data stream mining. Application of data mining models (like classifiers) in data streaming scenario mandates accurate detection of data distribution. Further, the model should adapt quickly to any variations in the distribution patterns to ensure the sustained performance of model predictability. Referred to as drift detection, the process can be gradual or abrupt. Extensive research has been made, designing several algorithms to accurately detect the type of drift and to adapt to shifts drift approaches. However, even the most reputed concept drift models have limited ability to adapt to both types of drift. The relationship between the adaptability and predictor variables is based on data distribution features and its sensitivity to in-built parameters. In this context, concept drift detection using attribute pattern weight (APW) is proposed here in this manuscript. Unlike the many of existing models, the proposed model is not dependent of any of the process targeted to apply on streaming data. The other significance of the proposed model is to detect the both types of concept drift that is gradual or abrupt. The experimental study that carried is evincing the scalability and robustness, and significance of the proposed model.
机译:从动态实时数据记录中提取数据结构正在跨行业突出。在广泛的用户应用程序中,在包括社交网络平台,银行业,基因组学,电信部门,电子商务等部门的广泛的用户应用中越来越多地观察到数据序列的大量挖掘。为了了解多个数据流,即用于了解以连续间隔流动的快速数据序列,研究人员专注于数据流挖掘的持续改进。数据挖掘模型(如分类器)在数据流方案中的应用要求准确地检测数据分布。此外,该模型应该快速适应分布模式的任何变化,以确保模型可预测性的持续性能。称为漂移检测,过程可以逐渐或突然。已经进行了广泛的研究,设计了几种算法,可以精确地检测漂移类型并适应移位漂移方法。然而,即使是最知名的概念漂移模型也具有有限的适应两种类型的漂移能力。适应性和预测变量之间的关系基于数据分布特征及其对内置参数的敏感性。在此上下文中,在此稿件中提出了使用属性模式权重(APW)的概念漂移检测。与许多现有模型不同,所提出的模型不依赖于针对应用于流数据的任何过程。拟议模型的其他重要性是检测逐渐或突然的两种类型的概念漂移。携带的实验研究表明了所提出的模型的可扩展性和鲁棒性和意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号