首页> 外文会议>DAGM Symposium on Pattern Recognition >Natural Language Understanding by Combining Statistical Methods and Extended Context-FreeGrammars
【24h】

Natural Language Understanding by Combining Statistical Methods and Extended Context-FreeGrammars

机译:通过结合统计方法和扩展的上下文无关语法来理解自然语言

获取原文

摘要

This paper introduces an novel framework for speech understanding using extended context-free grammars (ECFGs) by combining statistical methods and rule based knowledge. By only using 1 st level labels a considerable lower expense of annotation effort can be achieved. In this paper we derive hierarchical non-deterministic automata from the ECFGs, which are transformed into transition networks (TNs) representing all kinds of labels. A sequence of recognized words is hierarchically decoded by using a Viterbi algorithm. In experiments the difference between a hand-labeled tree bank annotation and our approach is evaluated. The conducted experiments show the superiority of our proposed framework. Comparing to a hand-labeled baseline system (=100%) we achieve 95,4 % acceptance rate for complete sentences and 97.8 % for words. This induces an accuray rate of 95.1 % and error rate of 4.9 %, respectively F1-measure 95.6 % in a corpus of 1 300 sentences.
机译:本文通过结合统计方法和基于规则的知识,介绍了一种使用扩展的无上下文语法(ECFG)进行语音理解的新颖框架。通过仅使用第一级标签,可以实现相当低的注释工作费用。在本文中,我们从ECFG派生出分层的不确定性自动机,并将其转换为代表各种标签的过渡网络(TN)。通过使用维特比算法对识别出的单词序列进行分层解码。在实验中,评估了手工标记的树库注释与我们的方法之间的差异。进行的实验表明了我们提出的框架的优越性。与手动标记的基准系统(= 100%)相比,完整句子的接受率为95.4%,单词的接受率为97.8%。在1300个句子的语料库中,这导致95.1%的累积准确率和4.9%的错误准确率,分别为F1度量95.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号