首页> 外文会议>Second International Conference on Data Mining, 2nd >A computational environment for extracting rules from databases
【24h】

A computational environment for extracting rules from databases

机译:从数据库中提取规则的计算环境

获取原文
获取原文并翻译 | 示例

摘要

Classification for very large databases has many practical applications in Data Mining. Thus, Machine Learning algorithms should be able to operate in massive datasets. When a dataset is too large for a particular learning algorithm to be applied, there are other ways to make learning feasible; preprocessing techniques and dataset sampling can be used to scale up classifiers to large datasets. In this work we propose a computational environment based on two architectures, one for data pre-processing and one for post-processing which allow evaluation of induced knowledge. The two architecture share a set of learning systems, which can be enhanced to support new ones. The environment is designed as a test-bed for Data Mining research, as well as a generic knowledge discovery tool for varied database domains. Flexibility is achieved by an open-ended design for extensibility, enabling integration of existing Machine Learning algorithms, support functions for pre-processing as well as new locally developed algorithm and functions.
机译:大型数据库的分类在数据挖掘中有许多实际应用。因此,机器学习算法应该能够在海量数据集中进行操作。当数据集太大而无法应用特定的学习算法时,还有其他方法可以使学习变得可行。预处理技术和数据集采样可用于将分类器扩展到大型数据集。在这项工作中,我们提出了一种基于两种体系结构的计算环境,一种用于数据预处理,另一种用于后处理,可以评估归纳知识。两种体系结构共享一组学习系统,可以对其进行增强以支持新的学习系统。该环境被设计为数据挖掘研究的试验台,以及用于各种数据库域的通用知识发现工具。通过可扩展性的开放式设计来实现灵活性,从而可以集成现有的机器学习算法,用于预处理的支持功能以及本地开发的新算法和功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号