首页> 外文会议>International Conference on Complex, Intelligent and Software Intensive Systems >Clusters of Trends Detection in Microblogging: Simple Natural Language Processing vs Hashtags – Which is More Informative?
【24h】

Clusters of Trends Detection in Microblogging: Simple Natural Language Processing vs Hashtags – Which is More Informative?

机译:微博中的趋势检测集群:简单的自然语言处理与主题标签-哪个更具信息意义?

获取原文

摘要

In this paper we introduce the initial proposition and evaluation of the method that enables detection of clusters of trends among microblogging posts gathered from a given social graph. By the cluster of trends we mean the trending words that are popular among same group of people and which describes their common interests. The information about shared interests of group of people in the social network is very important for business. Knowing it we can for example perform directed advertising campaign aimed at single community of people. We validate our approach on large datasets that contains 22 030 252 tweets posted by 20 130 followers of the world-known actress. We found that clusters of trends detection in microblogging with simple natural language processing (namely lemmatization) did not give any valuable information for business. For the other side hashtags frequency filtering and probability conditional probabilities graph clustering resulted in valuable informative about structure of interest in social network.
机译:在本文中,我们介绍了该方法的初始命题和评估,该方法能够检测从给定社交图收集的微博帖子之间的趋势簇。趋势集指的是在同一群人中流行并描述他们共同利益的趋势词。社交网络中有关一群人的共同利益的信息对于业务非常重要。知道了这一点,我们就可以开展针对单个人群的定向广告活动。我们对包含22 030 252条推文的大型数据集验证了我们的方法,该推文由世界著名女演员的20 130位关注者发布。我们发现,通过简单的自然语言处理(即词形化)在微博中进行趋势检测的集群并没有为企业提供任何有价值的信息。另一方面,频率标签和概率条件概率图聚类可为社交网络中的兴趣结构提供有价值的信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号