首页> 外文会议>IEEE International Conference on Semantic Computing >Constructing and Maintaining Corpus-Driven Annotations
【24h】

Constructing and Maintaining Corpus-Driven Annotations

机译:构造和维护语料库驱动的注释

获取原文

摘要

A reference library can be described as a corpus of an individual composition of documents containing related work of research, documents of favorite authors, or proceedings of a conference. The documents in the corpus may change over time; new documents extend the corpus while other documents are sorted out. A subset of documents may contain meaningful annotations describing their content while other documents contain only weakly annotations. Enriching documents with meaningful annotations is beneficial for the performance of applications like semantic search, content aggregation, automated relationship discovery, query answering and information retrieval. However, enriching and maintaining a document with meaningful annotations is non-trivial. Available (semi-) automatic annotation tools ignore the individual composition of documents in corpora by annotating documents with generic named-entity related data. In this paper, we present techniques for enriching and maintaining annotations for document-specific databases considering changes in the composition of documents.
机译:参考图书馆可以描述为包含相关研究工作,喜爱的作者的文件或会议记录的单个文件组成的语料库。语料库中的文件可能会随着时间而改变;新文档扩展了语料库,同时整理了其他文档。文档的子集可能包含描述其内容的有意义的注释,而其他文档仅包含弱注释。使用有意义的注释来丰富文档对于诸如语义搜索,内容聚合,自动关系发现,查询回答和信息检索之类的应用程序的性能是有益的。但是,使用有意义的注释来丰富和维护文档并非易事。可用的(半)自动注释工具通过使用通用的命名实体相关数据对文档进行注释,从而忽略了语料库中文档的单独组成。在本文中,我们介绍了考虑文档组成的变化来丰富和维护特定于文档的数据库的注释的技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号