首页> 外文会议>IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining >Multi-label collective classification in multi-attribute multi-relational network data
【24h】

Multi-label collective classification in multi-attribute multi-relational network data

机译:多属性多关系网络数据中的多标签集体分类

获取原文

摘要

Classical machine learning techniques assume the data to be i.i.d., but the real world data is inherently relational and can generally be represented using graphs or some variants of a graph representation. The importance of modeling relational data is evident from its increasing presence in many domains: Telecom networks, WWW, social networks, organizational networks, images, protein sequences, etc. This field has recently been receiving a lot of attention in various communities under different themes depending on the problem addressed and the nature of solution proposed. Collective classification is one such popular approach which involves the use of a local classifier that embeds the node's own attributes and neighbors' information in a feature vector, and classifies the nodes in an iterative procedure. Despite the increasing popularity, there is not much attention paid towards datasets with multiple attributes and multi-relational (MAMR) networks under multi-label scenarios. In MAMR data, nodes can be represented using multiple types of attributes (attribute views) and there are multiple link types between the nodes. For example, in Twitter, users can be represented using their tweets, urls shared, hashtags and list memberships. And different Twitter users can be connected using follower, followed by and re-tweet links. Secondly, in many networks, nodes are associated with more than one label. For instance, Twitter users can be tagged with one or more labels from a set L, where L contains various movie genres that a user might like. Motivated by this, we propose a learning technique for multi-label collective classification using multiple attribute views on multi-relational network data which captures complex label correlations within and across attribute/relationship types. We empirically evaluate our proposed approach on Twitter and MovieLens datasets, and we show that it performs better than the state-of-art approaches.
机译:经典的机器学习技术假定数据为i.d.,但现实世界的数据本质上是关系型的,通常可以使用图形或图形表示的某些变体来表示。关系数据建模在许多领域(例如电信网络,WWW,社交网络,组织网络,图像,蛋白质序列等)中的应用日益广泛,这一点显而易见,它的重要性显而易见。最近,该领域在不同主题下的各个社区中受到了广泛关注。取决于解决的问题和建议的解决方案的性质。集体分类就是这样一种流行的方法,它涉及使用局部分类器,该局部分类器将节点自身​​的属性和邻居的信息嵌入特征向量中,并以迭代过程对节点进行分类。尽管越来越受欢迎,但是在多标签方案下,具有多个属性和多关系(MAMR)网络的数据集却没有引起太多关注。在MAMR数据中,可以使用多种类型的属性(属性视图)来表示节点,并且节点之间存在多种链接类型。例如,在Twitter中,可以使用其推文,共享的URL,标签和列表成员身份来表示用户。可以使用关注者,后面跟着并重新发送链接的方式来连接不同的Twitter用户。其次,在许多网络中,节点与多个标签相关联。例如,可以为Twitter用户标记来自集合L的一个或多个标签,其中L包含用户可能喜欢的各种电影流派。因此,我们提出了一种使用多关系网络数据上的多个属性视图进行多标签集体分类的学习技术,该视图可捕获属性/关系类型之内和之间的复杂标签相关性。我们对Twitter和MovieLens数据集上的拟议方法进行了经验评估,结果表明,该方法的性能优于最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号