首页> 外文期刊>Physical Review X >Network Structure, Metadata, and the Prediction of Missing Nodes and Annotations
【24h】

Network Structure, Metadata, and the Prediction of Missing Nodes and Annotations

机译:网络结构,元数据和缺少节点和注释的预测

获取原文
           

摘要

The empirical validation of community detection methods is often based on available annotations on the nodes that serve as putative indicators of the large-scale network structure. Most often, the suitability of the annotations as topological descriptors itself is not assessed, and without this it is not possible to ultimately distinguish between actual shortcomings of the community detection algorithms, on one hand, and the incompleteness, inaccuracy, or structured nature of the data annotations themselves, on the other. In this work, we present a principled method to access both aspects simultaneously. We construct a joint generative model for the data and metadata, and a nonparametric Bayesian framework to infer its parameters from annotated data sets. We assess the quality of the metadata not according to their direct alignment with the network communities, but rather in their capacity to predict the placement of edges in the network. We also show how this feature can be used to predict the connections to missing nodes when only the metadata are available, as well as predicting missing metadata. By investigating a wide range of data sets, we show that while there are seldom exact agreements between metadata tokens and the inferred data groups, the metadata are often informative of the network structure nevertheless, and can improve the prediction of missing nodes. This shows that the method uncovers meaningful patterns in both the data and metadata, without requiring or expecting a perfect agreement between the two.
机译:社区检测方法的经验验证通常基于节点上的可用注释,其作为大规模网络结构的推定指标。最常,注释作为拓扑描述符本身的适用性不会被评估,而且没有这种情况,不可能将社区检测算法的实际缺点一方面区分开,以及不完整,不准确或结构性数据注释本身,另一方面。在这项工作中,我们提出了一个主要的方法来同时访问两个方面。我们为数据和元数据构建联合生成模型,以及非参数贝叶斯框架,用于从注释数据集推断其参数。我们不根据与网络社区的直接对齐方式评估元数据的质量,而是以其预测网络中边缘放置的能力。我们还展示了如何使用该功能,以预测只有元数据时与缺少节点的连接,以及预测丢失的元数据。通过调查各种数据集,我们表明,虽然元数据令牌和推断的数据组之间很少有精确的协议,但是元数据往往是无线信息的网络结构,并且可以改善缺失节点的预测。这表明该方法在数据和元数据中发现有意义的模式,而无需需要或期望两者之间的完美协议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号