首页> 外文会议>International Conference on Computer and Knowledge engineering >Enhancing trust accuracy among online social network users utilizing data text mining techniques in apache spark
【24h】

Enhancing trust accuracy among online social network users utilizing data text mining techniques in apache spark

机译:利用Apache Spark中的数据文本挖掘技术提高在线社交网络用户之间的信任准确性

获取原文

摘要

The number of users and amount of data transfer are increasing per each minute with the rapid growth of social network platforms on the web while the users have no certain knowledge of each other. Thus, with the overwhelming spread of the internet and such bulk of data, people find it arduous to identify valid comments. Establishing a genuine and more accurate trust becomes harder if classical processing is used especially with the presence of profitable, oriented, devious and narrow-minded comments. Various methods have been employed so far to evaluate reliable users most of which combine trust algorithms, subject classification, and comment mining methods. Researches reveal that the majority of social network users firstly take into account an overall number of public trust standards such as the number of friends, followers, followings, and likes of individuals in order to trust them. However, a malicious user could manipulate this trust by building virtual qualities. Accordingly, this study supplies a dictionary of malicious words and weighs them by combining trust standards and text mining users' tweets. It is intended to identify malicious users and analyze their behavior to proceed a more accurate trust within distributed execution in Spark environment for providing a quicker call. The results of this study show that the suggested method benefits from a high diagnostic accuracy.
机译:随着社交网络平台在网络上的快速发展,每分钟用户的数量和数据传输量都在增加,而用户之间却没有一定的了解。因此,随着互联网的压倒性普及和如此大量的数据,人们发现识别有效评论很困难。如果使用经典处理方法,尤其是在存在有利可图,定向,oriented回和狭and评论的情况下,建立真正,更准确的信任将变得更加困难。到目前为止,已经采用了各种方法来评估可靠的用户,其中大多数方法结合了信任算法,主题分类和评论挖掘方法。研究表明,大多数社交网络用户首先考虑了公共信任标准的总数,例如朋友,追随者,追随者和个人的喜好数,以信任他们。但是,恶意用户可以通过建立虚拟质量来操纵这种信任。因此,本研究提供了恶意单词字典,并通过结合信任标准和文本挖掘用户的推文对它们进行加权。它旨在识别恶意用户并分析其行为,以在Spark环境中的分布式执行中进行更准确的信任,以提供更快的调用。这项研究的结果表明,建议的方法受益于较高的诊断准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号