...
首页> 外文期刊>Future generation computer systems >A novel parallel distance metric-based approach for diversified ranking on large graphs
【24h】

A novel parallel distance metric-based approach for diversified ranking on large graphs

机译:一种基于并行距离度量的新颖方法,用于在大图上进行多样化排名

获取原文
获取原文并翻译 | 示例
           

摘要

Diversified ranking on graphs (DRG) is an important and challenging issue in researching graph data mining. Traditionally, this problem is modeled by a submodular optimization objective, and solved by applying a cardinality constrained monotone submodular maximization. However, the existing submodular objectives do not directly capture the dis-similarity over pairs of nodes, while most of algorithms cannot easily take full advantage of the power of a distributed cluster computing platform, such as Spark, to significantly promote the efficiency of algorithms. To overcome the deficiencies of existing approaches, in this paper, a generalized distance metric based on a subadditive set function over the symmetry difference of neighbors of pairs of nodes is introduced to capture the pairwise dis-similarity over pairs of nodes. In our approach,DRGis formulated as a Max-Sumk-dispersion problem with metrical edge weights, which is NP-hard, in association with the proposed distance metric, a centralized linear time 2-approximation algorithm GA is then developed to significantly solve the problem ofDRG. Moreover, we develop a highly parallelizable algorithm forDRG, which can be easily implemented in MapReduce style parallel computation models using GA as a basic reducer. Finally, extensive experiments are conducted on real network datasets to verify the effectiveness and efficiency of our proposed approaches.
机译:图的多样化排序(DRG)是研究图数据挖掘的重要且具有挑战性的问题。传统上,此问题是通过亚模优化目标建模的,并通过应用基数约束的单调亚模最大化来解决。但是,现有的次模块目标无法直接捕获节点对之间的不相似性,而大多数算法无法轻松利用分布式集群计算平台(例如Spark)的功能来显着提高算法效率。为了克服现有方法的不足,本文提出了一种基于次可加集合函数的节点对邻居对称性差异的广义距离度量,以捕获节点对之间的成对不相似性。在我们的方法中,DRG被公式化为具有度量边缘权重的Max-Sumk色散问题,该问题是NP难的,结合提出的距离度量,然后开发了集中式线性时间2近似算法GA以显着解决该问题DRG。此外,我们为DRG开发了高度可并行化的算法,可以使用GA作为基本的约简工具在MapReduce风格的并行计算模型中轻松实现。最后,在真实的网络数据集上进行了广泛的实验,以验证我们提出的方法的有效性和效率。

著录项

  • 来源
    《Future generation computer systems》 |2018年第11期|79-91|共13页
  • 作者单位

    School of Software, Yunnan University,Key Laboratory of Software Engineering of Yunnan Province,Kunming Key Laboratory of Data Science and Intelligent Computing;

    School of Software, Yunnan University,Key Laboratory of Software Engineering of Yunnan Province,Kunming Key Laboratory of Data Science and Intelligent Computing;

    Shanghai Key Laboratory of Trustworthy Computing, East China Normal University;

    University of Amsterdam;

    Key Laboratory of Software Engineering of Yunnan Province,Kunming Key Laboratory of Data Science and Intelligent Computing;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Graph algorithms; Diversified ranking; Distance metric; Parallel computing; MapReduce;

    机译:图算法;多样化排名;距离度量;并行计算;MapReduce;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号