首页> 外文会议>Conference of the European Chapter of the Association for Computational Linguistics >An Expert Annotated Dataset for the Detection of Online Misogyny
【24h】

An Expert Annotated Dataset for the Detection of Online Misogyny

机译:用于检测在线Misogyny的专家注释数据集

获取原文

摘要

Online misogyny is a pernicious social problem that risks making online platforms toxic and unwelcoming to women. We present a new hierarchical taxonomy for online misogyny, as well as an expert labelled dataset to enable automatic classification of misogynistic content. The dataset consists of 6,567 labels for Reddit posts and comments. As previous research has found untrained crowdsourced an-notators struggle with identifying misogyny, we hired and trained annotators and provided them with robust annotation guidelines. We report baseline classification performance on the binary classification task, achieving accuracy of 0.93 and F1 of 0.43. The codebook and datasets are made freely available for future researchers.
机译:在线Misogyny是一个有害的社会问题,使在线平台对女性有毒和不受欢迎的风险。 我们为在线Misogyny提出了新的分类分类系统,以及标记的数据集的专家,以启用厌恶女性内容的自动分类。 DataSet由Reddit帖子和评论的6,567个标签组成。 随着以前的研究发现未受训练的众群陷入困境,我们聘请了Misogyny,我们聘请并培训了注释器,并提供了坚固的注释指南。 我们在二进制分类任务上报告基线分类性能,实现0.93和F1的精度为0.43。 码本和数据集可免费为未来的研究人员提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号