首页> 外文会议>IEEE/WIC/ACM International Conference on Web Intelligence >OnPerDis: Ontology-Based Personal Name Disambiguation on the Web
【24h】

OnPerDis: Ontology-Based Personal Name Disambiguation on the Web

机译:OnPerDis:基于本体的Web上的个人名称歧义消除

获取原文

摘要

With the growth of web documents, the ambiguity of personal name becomes more common and brings poor performance of web search. Identifying a correct personal entity from the a piece of or the whole document is still a very challenging problem, especially for Chinese websites. In this paper, we propose a novel Ontology-based approach for Personal Name Disambiguation (named "OnPerDis"). This approach has two main steps: first, we construct person ontology (PO) with rich conceptual modeling as well as a large set of supporting instances, second, for a given personal name on the web, we create a temporary instance and extract features from the web documents, calculate the similarity between this temporary instance and the instances in the PO. The one with the highest similarity score is chosen as the appropriate personal name. Our extensive evaluations with two rich real-life datasets (CIPS-SIGHAN 2012 NERD and Chinese web documents) shows OnPerDis' efficacy on personal name disambiguation on the Web.
机译:随着网络文档的增长,个人名称的歧义变得越来越普遍,并且带来了网络搜索的不良性能。从一个或整个文档中识别正确的个人实体仍然是一个非常具有挑战性的问题,尤其是对于中文网站而言。在本文中,我们提出了一种新颖的基于本体的消除人名歧义的方法(名为“ OnPerDis”)。这种方法有两个主要步骤:首先,我们使用丰富的概念模型以及大量支持实例来构建人本体(PO),其次,对于网络上的给定个人名称,我们创建了一个临时实例并从中提取特征Web文档,计算此临时实例与PO中的实例之间的相似度。选择相似度得分最高的人作为适当的个人名字。我们对两个丰富的现实生活数据集(CIPS-SIGHAN 2012 NERD和中文网络文档)进行了广泛评估,显示了OnPerDis消除网络上人名歧义的功效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号