Constructing and Maintaining Corpus-Driven Annotations

机译：构造和维护语料库驱动的注释

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A reference library can be described as a corpus of an individual composition of documents containing related work of research, documents of favorite authors, or proceedings of a conference. The documents in the corpus may change over time; new documents extend the corpus while other documents are sorted out. A subset of documents may contain meaningful annotations describing their content while other documents contain only weakly annotations. Enriching documents with meaningful annotations is beneficial for the performance of applications like semantic search, content aggregation, automated relationship discovery, query answering and information retrieval. However, enriching and maintaining a document with meaningful annotations is non-trivial. Available (semi-) automatic annotation tools ignore the individual composition of documents in corpora by annotating documents with generic named-entity related data. In this paper, we present techniques for enriching and maintaining annotations for document-specific databases considering changes in the composition of documents.

机译：参考图书馆可以描述为包含相关研究工作，喜爱的作者的文件或会议记录的单个文件组成的语料库。语料库中的文件可能会随着时间而改变;新文档扩展了语料库，同时整理了其他文档。文档的子集可能包含描述其内容的有意义的注释，而其他文档仅包含弱注释。使用有意义的注释来丰富文档对于诸如语义搜索，内容聚合，自动关系发现，查询回答和信息检索之类的应用程序的性能是有益的。但是，使用有意义的注释来丰富和维护文档并非易事。可用的（半）自动注释工具通过使用通用的命名实体相关数据对文档进行注释，从而忽略了语料库中文档的单独组成。在本文中，我们介绍了考虑文档组成的变化来丰富和维护特定于文档的数据库的注释的技术。

著录项

来源
《IEEE International Conference on Semantic Computing》|2019年|462-467|共6页
会议地点
作者
Felix Kuhr; Ralf Möller;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantics; Iterative methods; Databases; Data mining; Ontologies; Information retrieval; Resource description framework;

机译：语义;迭代方法;数据库;数据挖掘;本体;信息检索;资源描述框架;

相似文献

外文文献
中文文献
专利

1. A computational platform to maintain and migrate manual functional annotations for BioCyc databases [J] . Jesse R Walsh, Taner Z Sen, Julie A Dickerson BMC Systems Biology . 2014,第1期

机译：一个用于维护和迁移BioCyc数据库的手动功能注释的计算平台
2. Constructing a WordNet for Turkish Using Manual and Automatic Annotation [J] . Ehsani Razieh, Solak Ercan, Yildiz Olcay Taner ACM transactions on Asian language information processing . 2018,第3期

机译：使用手动和自动注释为土耳其语构建WordNet
3. Constructing Differentiated Educational Materials Using Semantic Annotation for Sustainable Education in IoT Environments [J] . Yongsung Kim, Jihoon Moon, Eenjun Hwang Sustainability . 2018,第4期

机译：物联网环境中使用语义标注构建可持续发展教育的差异化教学材料
4. Constructing and Maintaining Corpus-Driven Annotations [C] . Felix Kuhr, Ralf M?ller IEEE International Conference on Semantic Computing . 2019

机译：构建和维护语料库驱动的注释
5. Improving Modeling of Human Experience and Behavior: Methodologies for Enhancing the Quality of Human-Produced Data and Annotations of Subjective Constructs [D] . Booth, Brandon M. 2020

机译：提高人类经验和行为的建模：提高人为数据质量的方法和主观构建的注释
6. Improving the Utility of the Tox21 Dataset by Deep Metadata Annotations and Constructing Reusable Benchmarked Chemical Reference Signatures [O] . Daniel J. Cooper, Stephan Schürer 2019

机译：通过深层元数据注释提高Tox21数据集的实用性并构建可重复使用的基准化学参考签名
7. Constructing immigrants in UK legislation and Administration informative texts: A corpus-driven study (2007–2011) [O] . Pérez-Paredes Pascual Francisco, Jiménez Pilar Aguado, Hernández Purificación Sánchez 2016

机译：在英国立法和行政管理性参考文献中建设移民：语料库驱动的研究（2007年至2011年）

Constructing and Maintaining Corpus-Driven Annotations

摘要

著录项

相似文献

相关主题

期刊订阅