...
首页> 外文期刊>Information Processing & Management >Context-sensitive gender inference of named entities in text
【24h】

Context-sensitive gender inference of named entities in text

机译:文本中命名实体的上下文敏感的性别推理

获取原文
获取原文并翻译 | 示例
           

摘要

The gender information of named entities is an important prerequisite for many text analysis tasks such as gender bias detection and targeted advertising. Despite its valuable use cases, gender tagging of named entities has traditionally been database-reliant. The lack of open-source benchmarks is a major impediment to exploring the effectiveness of machine learning-based methods for this task. Towards this goal, the article serves two main purposes. Firstly, we create four open-source datasets from well-known NER corpora and make them publicly available. Secondly, we propose a novel supervised learning approach based on the transformer network to identify the gender of named entities. We evaluate the proposed approach on four gender identification datasets. The proposed method outperforms two commercial database-reliant approaches and five deep sequence models, including BERT.
机译:命名实体的性别信息是许多文本分析任务等重要先决条件,例如性别偏见检测和目标广告。尽管它有价值的用例,但名为CONTITITE的性别标记传统上是依赖数据库依赖性。缺乏开源基准是探索基于机器学习方法的有效性的主要障碍。对此目标来说,文章有两种主要目的。首先,我们从知名的Ner Cotora创建四个开源数据集,并将其公开可用。其次,我们提出了一种基于变压器网络的新型监督学习方法,以确定命名实体的性别。我们评估了四个性别识别数据集的建议方法。所提出的方法优于两种商业数据库依赖性方法和五种深序模型,包括伯特。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号