首页> 外文会议>International Conference on Data Mining >A computational environment for extracting rules from databases
【24h】

A computational environment for extracting rules from databases

机译:从数据库中提取规则的计算环境

获取原文

摘要

Classification for very large databases has many practical applications in Data Mining. Thus, Machine Learning algorithms should be able to operate in massive datasets. When a dataset is too large for a particular learning algorithm to be applied, there are other ways to make learning feasible; preprocessing techniques and dataset sampling can be used to scale up classifiers to large datasets. In this work we propose a computational environment based on two architectures, one for data pre-processing and one for post-processing which allow evaluation of induced knowledge. The two architecture share a set of learning systems, which can be enhanced to support new ones. The environment is designed as a test-bed for Data Mining research, as well as a generic knowledge discovery tool for varied database domains. Flexibility is achieved by an open-ended design for extensibility, enabling integration of existing Machine Learning algorithms, support functions for pre-processing as well as new locally developed algorithm and functions.
机译:非常大型数据库的分类在数据挖掘中具有许多实际应用。因此,机器学习算法应该能够在大规模数据集中运行。当数据集太大时对于要应用的特定学习算法,还有其他方法可以学习可行;预处理技术和数据集采样可用于将分类器扩展到大型数据集。在这项工作中,我们提出了一种基于两个架构的计算环境,一个用于数据预处理,一个用于评估诱导知识的后处理。这两个架构共享一组学习系统,可以增强以支持新的系统。环境设计为数据挖掘研究的测试床,以及用于各种数据库域的通用知识发现工具。灵活性是通过开放式设计实现的可扩展性,实现现有机器学习算法的集成,支持预处理以及新的本地开发的算法和功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号