首页> 外文期刊>Future generation computer systems >A grid-based approach for enterprise-scale data mining
【24h】

A grid-based approach for enterprise-scale data mining

机译:基于网格的企业规模数据挖掘方法

获取原文
获取原文并翻译 | 示例
           

摘要

We describe a grid-based approach for enterprise-scale data mining, which is based on leveraging parallel database technology for data storage, and on-demand compute servers for parallelism in the statistical computations. This approach is targeted towards the use of data mining in highly-automated vertical business applications, where the data is stored on one or more relational database systems, and an independent set of high-performance compute servers or a network of low-cost, commodity processors is used to improve the application performance and overall workload management. The goal of this paper is to describe an algorithmic decomposition of data mining kernels between the data storage and compute grids, which makes it possible to exploit the parallelism on the respective grids in a simple way, while minimizing the data transfer between these grids. This approach is compatible with existing standards for data mining task specification and results reporting, so that larger applications using these data mining algorithms do not have to be modified to benefit from this grid-based approach. (C) 2006 Elsevier B.V. All rights reserved.
机译:我们描述了一种基于网格的企业规模数据挖掘方法,该方法基于利用并行数据库技术进行数据存储,并利用按需计算服务器来实现统计计算中的并行性。这种方法的目标是在高度自动化的垂直业务应用程序中使用数据挖掘,在该应用程序中,数据存储在一个或多个关系数据库系统以及一组独立的高性能计算服务器或低成本商品网络中处理器用于提高应用程序性能和整体工作负载管理。本文的目的是描述数据存储和计算网格之间的数据挖掘内核的算法分解,这使得有可能以一种简单的方式利用各个网格上的并行性,同时最大程度地减少这些网格之间的数据传输。这种方法与数据挖掘任务规范和结果报告的现有标准兼容,因此不必修改使用这些数据挖掘算法的大型应用程序,即可从基于网格的方法中受益。 (C)2006 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号