首页> 外文期刊>Computers & Security >Improving cyberbullying detection using Twitter users' psychological features and machine learning
【24h】

Improving cyberbullying detection using Twitter users' psychological features and machine learning

机译:利用Twitter用户的心理特征和机器学习改善网络欺凌检测

获取原文
获取原文并翻译 | 示例
           

摘要

Empirical evidences linking users' psychological features such as personality traits and cybercrimes such as cyberbullying are many. This study deals with automatic cyberbullying detection mechanism tapping into Twitter users' psychological features including personalities, sentiment and emotion. User personalities were determined using Big Five and Dark Triad models, whereas machine learning classifiers namely, Naive Bayes, Random Forest and J48 were used to classify the tweets into one of four categories: bully, aggressor, spammer and normal. The Twitter dataset contained 5453 tweets gathered using the hashtag #Gamergate, and manually annotated by human experts. Selected Twitter-based features namely text, user and network-based features were used as the baseline algorithm. Results show that cyberbullying detection improved when personalities and sentiments were used, however, a similar effect was not observed for emotion. A further analysis on the personalities revealed extraversion, agreeableness, neuroti-cism and psychopathy to have greater impacts in detecting online bullying compared to other traits. Key features were identified using the dimension reduction technique, and integrated into a single model, which produced the best detection accuracy. The paper describes suggestions and recommendations as to how the findings can be applied to mitigate cyberbullying.
机译:将用户的心理特征(如人格特征)与网络犯罪(如网络欺凌)联系起来的经验证据很多。这项研究涉及自动网络欺凌检测机制,该机制利用了Twitter用户的心理特征,包括个性,情感和情绪。用户个性是使用“大五人”和“暗黑社会”模型确定的,而机器学习分类器(朴素贝叶斯,随机森林和J48)则将推文分为以下四类之一:欺负,攻击者,垃圾邮件发送者和正常。 Twitter数据集包含使用标签#Gamergate收集的5453条推文,并由人类专家手动注释。选定的基于Twitter的功能(即基于文本,用户和基于网络的功能)被用作基准算法。结果表明,当使用个性和情感时,网络欺凌检测得到改善,但是,对于情绪,没有观察到类似的效果。对人格的进一步分析显示,与其他特征相比,外向性,顺从性,神经质和精神病对检测在线欺凌的影响更大。使用降维技术确定了关键特征,并将其集成到单个模型中,从而产生了最佳的检测精度。本文描述了有关如何将发现应用于减轻网络欺凌的建议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号