首页> 外文会议>International Conference on Big Data Analytics >Entity Markup for Knowledge Base Population
【24h】

Entity Markup for Knowledge Base Population

机译:知识库人口的实体标记

获取原文

摘要

Entities (e.g. people, places, products) exist in various heterogeneous sources, such as Wikipedia, web page, and social media. Entity markup, like entity extraction, coreference resolution, and entity disambiguation, is the essential means for adding semantic value to unstructured web contents and this way enabling the linkage between unstructured and structured data and knowledge collections. A major challenge in this endeavor lies in the ambiguity of the digital contents, with context-dependent semantic and dynamic. In this paper, I introduce the main challenges of coreference resolution and named entity disambiguation. Especially, I propose practical strategies to improve entity markup. Furthermore, experimental studies are conducted to fulfill named entity disambiguation in combination with the optimized entity extraction and coreference resolution. The main goal of this paper is to analyze the significant challenges of entity markup and present insights on the proposed entity markup framework for knowledge base population. The preliminary experimental results prove the significance of improving entity markup.
机译:各种异构来源,如维基百科,网页和社交媒体存在实体(例如人员,地点,产品)。实体标记,如实体提取,coreference分辨率和实体歧义,是向非结构化Web内容添加语义值的基本手段,并以这种方式实现非结构化和结构化数据和知识集合之间的链接。这一努力的主要挑战在于数字内容的模糊性,具有上下文依赖性语义和动态。在本文中,我介绍了Coreference解决方案的主要挑战和命名实体歧义。特别是,我提出了改进实体标记的实际策略。此外,进行实验研究以结合优化的实体提取和Coreference分辨率来实现命名实体消歧。本文的主要目的是分析实体标记的重大挑战,并对知识库人口拟议的实体标记框架进行了展望。初步实验结果证明了改善实体标记的重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号