首页> 外文会议>Text Analysis Conference >JU_CSE_TAC: Textual Entailment Recognition System at TAC RTE-6
【24h】

JU_CSE_TAC: Textual Entailment Recognition System at TAC RTE-6

机译:ju_cse_tac:TAC RTE-6的文本意外识别系统

获取原文

摘要

The note describes the Recognizing Textual Entailment (RTE) system developed at the Computer Science and Engineering Department, Jadavpur University, India. In this competition, we participated and submitted the results in the RTE-6 Main Task (3 runs), Novelty Task (3 runs) and RTE-6 KBP task (3 runs for generic task and 3 runs for tailored task). For the Main and the Novelty Tasks, the corpus was a collection of news wire documents from various sources and arranged into particular topics, a hypothesis H and a set of "candidate" sentences retrieved by Lucene from that corpus for the hypothesis H. Each sentence in the set of documents associated with a given topic was involved in an entailment relationship with each hypothesis for the topic. RTE systems are required to identify all the sentences that entail H among the candidate sentences. For the Main and the Novelty Tasks, the system is based on the composition of lexical entailment module, lexical distance module, Chunk module, Named Entity module and syntactic text entailment (TE) module. Our TE system is based on the Support Vector Machine (SVM) that uses twenty five features for lexical similarity, the output tag from a rule based syntactic two-way TE system as a feature and the outputs from a rule based Chunk Module and Named Entity Module as the other features. For the Main task test set, the following micro-average results were obtained for Run 1: F-Score 34.79, Run 2: F-Score 26.78 and Run 3 : F-score 31.19. For the novelty task test set, the following micro-average results were obtained for Run 1: Novelty Evaluation F-Score 81.77 and Justification Evaluation F-Score 34.35, Run 2: Novelty Evaluation F-Score 78.18 and Justification Evaluation 26.87 and Run 3: Novelty Evaluation F-score 78.69 and Justification Evaluation 24.57 were obtained. The KBP Slot Filling task is focused on the searching a collection of news wire and Web documents and extracting values for a predefined set of attributes ("slots") for the target entities. The RTE KBP Validation Pilot is based on the assumption that extracted slot filler is correct if and only if the supporting document entails an hypothesis created on the basis of the slot filler. In RTE KBP, we participated for generic task and tailored task. For the RTE-6 KBP test set for Generic Task, micro-average results for Run 1: F-Score 0.1403, Run 2: F-Score 0.172 and Run 3: F-score 0.1531 were obtained. For RTE-6 KBP test set for Tailored Task, micro-average results for Run 1: F-Score 0.3, Run 2: F-Score 0.3307 and Run 3: F-score 0.3288 were obtained.
机译:该备注描述了在印度的Jadavpur University计算机科学和工程系中开发的识别文本征征(RTE)系统。在本次竞争中,我们参加并提交了RTE-6主要任务(3次运行)的结果,新颖的任务(3运行)和RTE-6 KBP任务(3 kBP任务(3用于通用任务,3个运行)为量身定制的任务。对于主要和新颖的任务来说,语料库是来自各种来源的新闻线文档的集合,并安排了特定主题,假设H和一组“候选人”句子由Lucene从该语料库中检索的假设H.每个句子在与给定主题关联的一组文档中,涉及与主题的每个假设的蕴涵关系。 RTE系统需要识别候选句子中需要h的所有句子。对于主要和新颖的任务,该系统基于词汇意外模块,词汇距离模块,块模块,命名实体模块和语法文本诱惑(TE)模块的组成。我们的TE系统基于支持向量机(SVM),它使用二十五个特征进行词法相似性,从规则的语法双向TE系统的输出标签作为一个特征和来自基于规则的块模块和命名实体的输出模块作为其他功能。对于主要的任务测试集,获得了以下微平均结果,用于运行1:F分数34.79,运行2:F分26.78,并运行3:F分数31.19。对于新颖性任务测试集,获得以下微平均结果为RIN 1:新颖性评估F分数为81.77和理由评估F-Score 34.35,运行2:新奇评估F分数78.18和理由评估26.87并运行3:新颖的评价F分数为78.69和理由评估24.57。 KBP插槽填充任务专注于搜索新闻线和Web文档的集合,并提取目标实体的预定义属性集合(“插槽”集的值。 RTE KBP验证导频基于如果支持文档在Slot Filler的基础上创建的假设,则RTE KBP验证导频基于提取的插槽填充器是正确的。在RTE KBP中,我们参与了通用任务和量身定制的任务。对于用于通用任务的RTE-6 KBP测试,MICRE-CARES 0.1403运行的微平均结果,运行2:F分数0.172并获得3:F分数0.1531。对于RTE-6 KBP测试设置为定制任务,运行的微平均结果为1:F分数0.3,运行2:F分数0.3307并运行3:F分数0.3288。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号