首页> 外文会议>International Conference on Information Fusion;FUSION 2012 >Combining local and non-local information with dual decomposition for named entity recognition from text
【24h】

Combining local and non-local information with dual decomposition for named entity recognition from text

机译:将本地和非本地信息与双重分解相结合,以从文本中识别命名实体

获取原文
获取原文并翻译 | 示例

摘要

Named entity recognition (NER) is the task of segmenting and classifying occurrences of names in text. In NER, local contextual cues provide important evidence, but non-local information from the whole document could also prove useful: for example, it is useful to know that “Mary Kay Inc.” has been mentioned in a document to classify subsequent mentions of “Mary Kay” as an organization and not as a person. Previous works for NER typically model the problem as a sequence labeling problem, coupling the predictions of neighboring words with a Markov model such as conditional random fields. We propose applying the dual decomposition approach to combine a local sentential model and a non-local label consistency model for NER. The dual decomposition approach is a fusion approach which combines two models by constraining them to agree on their predictions on the test data. Empirically, we show that this approach outperforms the local sentential models on four out of five data sets.
机译:命名实体识别(NER)是对文本中出现的名称进行分段和分类的任务。在NER中,本地上下文线索提供了重要的证据,但是整个文档中的非本地信息也可能有用:例如,知道“ Mary Kay Inc.”很有用。在文档中已提及“ Mary Kay”,以将其后提及的“ Mary Kay”归为组织而非个人。 NER的先前工作通常将问题建模为序列标记问题,将相邻单词的预测与Markov模型(例如条件随机场)耦合在一起。我们建议应用双重分解方法将NER的局部句型模型和非局部标签一致性模型相结合。对偶分解方法是一种融合方法,通过约束两个模型以就其对测试数据的预测达成共识,从而将它们组合在一起。从经验上讲,我们表明该方法在五个数据集中的四个数据集上都优于局部句法模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号