首页> 外文会议>International conference on computational linguistics >Coreference Clustering Using Column Generation
【24h】

Coreference Clustering Using Column Generation

机译:使用列生成的共指聚类

获取原文

摘要

In this paper we describe a novel way of generating an optimal clustering for coreference resolution. Where usually heuristics are used to generate a document-level clustering, based on the output of local pairwise classifiers, we propose a method that calculates an exact solution. We cast the clustering problem as an Integer Linear Programming (ILP) problem, and solve this by using a column generation approach. Column generation is very suitable for ILP problems with a large amount of variables and few constraints, by exploiting structural information. Building on a state of the art framework for coreference resolution, we implement several strategies for clustering. We demonstrate a significant speedup in time compared to state-of-the-art approaches of solving the clustering problem with ILP, while maintaining transitivity of the coreference relation. Empirical evidence suggests a linear time complexity, compared to a cubic complexity of other methods.
机译:在本文中,我们描述了一种生成用于共参考分辨率的最佳聚类的新颖方法。在通常使用启发式方法来生成文档级聚类的情况下,根据本地成对分类器的输出,我们提出了一种计算精确解的方法。我们将聚类问题转换为整数线性规划(ILP)问题,并通过使用列生成方法来解决。通过利用结构信息,列生成非常适合于具有大量变量且几乎没有约束的ILP问题。基于最先进的共指解析框架,我们实施了几种聚类策略。与使用ILP解决聚类问题的最新方法相比,我们展示了显着的时间加速,同时保持了共指关系的可传递性。经验证据表明,与其他方法的立方复杂度相比,线性时间复杂度高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号