首页> 外文期刊>Future generation computer systems >Distributed feature extraction in a p2p setting - a case study
【24h】

Distributed feature extraction in a p2p setting - a case study

机译:p2p设置中的分布式特征提取-案例研究

获取原文
获取原文并翻译 | 示例
           

摘要

Finding the right data representation is essential for virtually every data mining application. In this work we describe an approach to collaborative feature extraction, selection and aggregation in distributed, loosely coupled domains. In contrast to other work in the field of distributed data mining, we focus on scenarios in which a large number of loosely coupled nodes apply data mining to different, usually very small and overlapping, subsets of the entire data space. The aim is not to find a global concept to cover all data, but to learn a set of local concepts. Our prototypical application is a distributed media organization platform, called Nemoz, that assists users in maintaining their media collections. We propose two models for collaborative feature extraction, selection and aggregation for supervised data mining. One is based on a centralized p2p architecture, and the other on a fully distributed p2p architecture. We compare both models on a real world data set and discuss their advantages and problems. (C) 2006 Elsevier B.V. All rights reserved.
机译:对于几乎每个数据挖掘应用程序而言,找到正确的数据表示形式都是至关重要的。在这项工作中,我们描述了一种在分布式,松散耦合域中的协作特征提取,选择和聚合的方法。与分布式数据挖掘领域的其他工作相比,我们关注的场景是大量松散耦合的节点将数据挖掘应用于整个数据空间的不同子集,通常是非常小的和重叠的子集。目的不是要找到一个涵盖所有数据的全局概念,而是要学习一组本地概念。我们的原型应用程序是一个名为Nemoz的分布式媒体组织平台,可帮助用户维护其媒体收藏。我们提出了两种用于协作特征提取,选择和聚合的模型,用于监督数据挖掘。一种基于集中式p2p架构,另一种基于完全分布式的p2p架构。我们在现实世界的数据集上比较这两种模型,并讨论它们的优势和问题。 (C)2006 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号