【24h】

Neural Constituency Parsing of Speech Transcripts

机译:语音笔录的神经选区解析

获取原文

摘要

This paper studies the performance of a neural self-attentive parser on transcribed speech. Speech presents parsing challenges that do not appear in written text, such as the lack of punctuation and the presence of speech dis-fluencies (including filled pauses, repetitions, corrections, etc.). Disfluencies are especially problematic for conventional syntactic parsers, which typically fail to find any EDITED dis-fluency nodes at all. This motivated the development of special disfluency detection systems, and special mechanisms added to parsers specifically to handle disfluencies. However, we show here that neural parsers can find EDITED disfluency nodes, and the best neural parsers find them with an accuracy surpassing that of specialized disfluency detection systems, thus making these specialized mechanisms unnecessary. This paper also investigates a modified loss function that puts more weight on EDITED nodes. It also describes tree-transformations that simplify the disfluency detection task by providing alternative encodings of disfluencies and syntactic information.
机译:本文研究了转录语音的神经自注意解析器的性能。语音提出了在书面文本中没有出现的解析挑战,例如缺少标点符号和语音不流畅(包括填充的停顿,重复,更正等)。对于传统的语法解析器而言,差异性尤其成问题,传统的语法解析器通常根本找不到任何已编辑的差异性节点。这促使开发特殊的水流检测系统,并为解析器添加了专门用于处理水流的特殊机制。但是,我们在这里表明,神经解析器可以找到已编辑的flufluency节点,而最佳的神经解析器可以以超过专门的flufluency检测系统的精度来找到它们,因此不需要这些专门的机制。本文还研究了修改后的损失函数,该函数对EDITED节点施加了更大的权重。它还描述了树变换,该树变换通过提供流水和语法信息的替代编码来简化流水检测任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号