【24h】

Aggressive language in an online hacking forum

机译:在线黑客论坛中的激进语言

获取原文

摘要

We probe the heterogeneity in levels of abusive language in different sections of the Internet, using an annotated corpus of Wikipedia page edit comments to train a binary classifier for abuse detection. Our test data come from the CrimeBB Corpus of hacking-related forum posts and we find that (a) forum interactions are rarely abusive, (b) the abusive language which does exist tends to be relatively mild compared to that found in the Wikipedia comments domain, and tends to involve aggressive posturing rather than hate speech or threats of violence. We observe that the purpose of conversations in online forums tend to be more constructive and informative than those in Wikipedia page edit comments which are geared more towards adversarial interactions, and that this may explain the lower levels of abuse found in our forum data than in Wikipedia comments. Further work remains to be done to compare these results with other inter-domain classification experiments, and to understand the impact of aggressive language in forum conversations.
机译:我们探讨了互联网不同部分中的滥用语言水平的异质性,使用Wikipedia页面的注释语料编辑评论,以培训二进制分类器进行滥用检测。我们的测试数据来自黑客相关的论坛帖子的犯罪计划,我们发现(a)论坛互动很少滥用,(b)与维基百科评论域中发现的辱骂语言往往相对温和,并倾向于涉及激进的姿势而不是仇恨言论或暴力威胁。我们遵守在线论坛的对话的目的往往比维基百科页面编辑评论更具建设性和信息性,这些评论更加朝着对抗性互动,这可以解释我们论坛数据中发现的较低滥用水平而不是维基百科。注释。进一步的工作仍有待完成与其他域间分类实验进行比较,并了解攻击性语言在论坛对话中的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号