首页> 外文会议>European conference on speech communication and technology >Multipass algorithm for acquisition of salient acoustic morphemes
【24h】

Multipass algorithm for acquisition of salient acoustic morphemes

机译:用于获取突出声学语素的多级数量算法

获取原文

摘要

We are interested in spoken language understanding within the domain of automated telecommunication services. Our current methodology involves training statistical language models from large annotated corpora for recognition and understanding. Since the transcribing of large speech corpora is a resource consuming task, we are motivated to exploit speech without transcriptions. In particular, we learn the semantic associations for a task exploiting only phone-based sequences from the output of a task-independent ASR-system. In this paper we present a new multipass algorithm for acquiring salient phone sequences from untranscribed speech corpora and evaluate their utility for the HMIHY task. Compared to our previous strategy, this algorithm is shown to produce improved call-classification results while reducing up to 7-fold the number of salient phone-sequences selected for training.
机译:我们对自动电信服务领域的口语理解感兴趣。我们目前的方法涉及从大型注释的语料库训练统计语言模型,以获得认可和理解。由于大型语音语料库的转录是一种资源消耗任务,因此我们有动力在没有转录的情况下开发语音。特别是,我们学习用于仅从任务独立于任务的ASR系统的输出的基于电话的序列的任务的语义关联。在本文中,我们提出了一种新的多级游戏算法,用于获取来自未筛选的语音语料库的突出电话序列,并评估他们为HMihy任务的实用性。与我们以前的策略相比,该算法显示出现改进的呼叫分类结果,同时减少了最多7倍的突出电话序列的次数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号