...
首页> 外文期刊>Information Processing & Management >Incremental cue phrase learning and bootstrapping method for causality extraction using cue phrase and word pair probabilities
【24h】

Incremental cue phrase learning and bootstrapping method for causality extraction using cue phrase and word pair probabilities

机译:基于提示短语和单词对概率的因果关系增量式提示短语学习和自举方法

获取原文
获取原文并翻译 | 示例
           

摘要

This work aims to extract possible causal relations that exist between noun phrases. Some causal relations are manifested by lexical patterns like causal verbs and their sub-categorization. We use lexical patterns as a filter to find causality candidates and we transfer the causality extraction problem to the binary classification. To solve the problem, we introduce probabilities for word pair and concept pair that could be part of causal noun phrase pairs. We also use the cue phrase probability that could be a causality pattern. These probabilities are learned from the raw corpus in an unsu-pervised manner. With this probabilistic model, we increase both precision and recall. Our causality extraction shows an F-score of 77.37%, which is an improvement of 21.14 percentage points over the baseline model. The long distance causal relation is extracted with the binary tree-styled cue phrase. We propose an incremental cue phrase learning method based on the cue phrase confidence score that was measured after each causal classifier learning step. A better recall of 15.37 percentage points is acquired after the cue phrase learning.
机译:这项工作旨在提取名词短语之间存在的可能因果关系。某些因果关系通过因果动词及其子类别等词汇模式来体现。我们使用词法模式作为筛选器来查找因果关系候选者,并将因果关系提取问题转移到二进制分类中。为了解决该问题,我们介绍了可能成为因果名词短语对的一部分的词对和概念对的概率。我们还使用可能是因果关系模式的提示短语概率。从原始语料库以未经监督的方式学习这些概率。使用这种概率模型,我们可以提高准确性和召回率。我们的因果关系提取显示F得分为77.37%,比基准模型提高21.14个百分点。用二叉树样式提示短语提取远距离因果关系。我们提出了一种基于提示短语置信度得分的增量提示短语学习方法,该方法是在每个因果分类器学习步骤之后进行测量的。提示短语学习后,可以更好地回忆起15.37个百分点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号