首页> 外文会议>Pacific Asia Conference on Language, Information and Computation >Integrating Word Embedding Offsets into the Espresso System for Part-Whole Relation Extraction
【24h】

Integrating Word Embedding Offsets into the Espresso System for Part-Whole Relation Extraction

机译:将嵌入偏移的单词集成到浓缩咖啡系统中,以实现部分整体关系提取

获取原文

摘要

Part-whole relation, or meronymy plays an important role in many domains. Among approaches to addressing the part-whole relation extraction task, the Espresso bootstrapping algorithm has proved to be effective by significantly improving recall while keeping high precision. In this paper, we first investigate the effect of using fine-grained subtypes and careful seed selection step on the performance of extracting part-whole relation. Our multitask learning and careful seed selection were major factors for achieving higher precision. Then, we improve the Espresso bootstrapping algorithm for part-whole relation extraction task by integrating word embedding approach into its iterations. The key idea of our approach is utilizing an additional ranker component, namely Similarity Ranker in the Instances Extraction phase of the Espresso system. This ranker component uses embedding offset information between instance pairs of part-whole relation. The experiments show that our proposed system achieved a precision of 84.9% for harvesting instances of the part-whole relation, and outperformed the original Espresso system.
机译:零件整体关系,或孟喻在许多领域发挥着重要作用。在解决零件整体关系提取任务的方法中,通过显着改善召回的同时保持高精度的同时,已经证明了浓缩咖啡释放算法。在本文中,我们首先研究了使用细粒亚型和仔细种子选择步骤对提取部分整体关系的性能的影响。我们的多任务学习和仔细的种子选择是实现更高精度的主要因素。然后,通过将Word嵌入方法集成到其迭代来改进eSPRESSO引导算法,以实现部分整体关系提取任务。我们方法的关键思想是利用额外的Ranker组件,即浓缩咖啡系统的提取阶段中的相似性等级。此排名组件使用嵌入偏移信息与零件整体关系对之间的嵌入偏移信息。实验表明,我们的建议系统实现了84.9%的精度,用于收获部分整体关系的情况,并且优于原始浓缩咖啡系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号