首页> 外文会议>3rd International Conference on Data Mining and Intelligent Information Technology Applications >OVFDT with functional tree leaf — Majority class, naive Bayes and adaptive hybrid integrations
【24h】

OVFDT with functional tree leaf — Majority class, naive Bayes and adaptive hybrid integrations

机译:具有功能性叶子的OVFDT-多数类,朴素贝叶斯和自适应混合集成

获取原文

摘要

Very Fast Decision Tree (VFDT) is an exemplar of classification techniques in data stream mining where models are built by incremental learning from continuously arriving data instead of batches. Many variations and modifications were made upon VFDT since it was first introduced in year 2000. Novel contributions were mainly made in two aspects of VFDT, tree induction process and prediction process, for the sake of improving its prediction accuracy. The basic concept of inducing a VFDT is to use fresh instances from data stream for recursively replacing leaves with decision nodes. This standard version of VFDT therefore simply predicts or classifies new instance by the distribution counts of the past samples at the leaves. Gama et al, extended VFDT to VFDTNB, by installing a naive Bayes classifier at each leaf in the prediction process, so that prior probabilities as referenced from attribute-values at the leaves can be used to refine the prediction accuracy. They called this technique in general, Functional Tree Leaf. Recently a new version of VFDT called Optimized-VFDT or OVFDT has been proposed by the authors that achieve relatively good prediction accuracy and compact tree sizes by controlling the node-splitting and pruning in the tree induction process. Naturally these two types of enhanced algorithms, OVFDT and Functional Tree Leaf, which are both based on incremental learning, can be integrated together in each respective process, like two sides of a hand, for further performance improvement. Our paper reports about this integration and the experimental results.
机译:超快速决策树(VFDT)是数据流挖掘中分类技术的典范,其中通过从连续到达的数据而不是批处理中进行增量学习来构建模型。自从2000年首次引入VFDT以来,对VFDT进行了许多变型和修改。为提高其预测精度,主要在VFDT的两个方面做出了新的贡献,即树木归纳过程和预测过程。引入VFDT的基本概念是使用数据流中的新实例来用决策节点递归替换叶子。因此,此VFDT的标准版本仅通过叶子上过去样本的分布计数来简单预测或分类新实例。 Gama等人通过在预测过程中在每个叶子上安装朴素贝叶斯分类器,将VFDT扩展为VFDT NB ,以便可以使用从叶子的属性值引用的先验概率来细化预测准确性。他们通常将此功能称为功能树叶子。最近,作者提出了一种新版本的VFDT,称为Optimized-VFDT或OVFDT,它通过控制树的诱导过程中的节点拆分和修剪来实现相对较好的预测精度和紧凑的树大小。自然地,这两种基于增量学习的增强算法OVFDT和功能树叶子可以像手的两侧一样在每个相应的过程中集成在一起,以进一步提高性能。我们的论文报道了这种整合和实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号