首页> 外文会议>IEEE Workshop on Spoken Language Technology >EM-based phoneme confusion matrix generation for low-resource spoken term detection
【24h】

EM-based phoneme confusion matrix generation for low-resource spoken term detection

机译:基于EM的音素混淆矩阵生成,用于低资源口语检测

获取原文

摘要

The idea of using a data-driven phoneme confusion matrix (PCM) to enhance speech recognition and retrieval performance is not new to the speech community. Although empirical results show various degrees of improvements brought by introducing a PCM, the underlying data-driven processes introduced in most papers are rather ad-hoc and lack rigorous statistical justifications. In this paper we will focus on the statistical aspects of PCM generation, propose and justify a novel expectation-maximization based algorithm for data-driven PCM generation. We will evaluate the performance of the generated PCMs under the context of low-resource spoken term detection, with primary focus on out-of-vocabulary keywords.
机译:使用数据驱动音素混淆矩阵(PCM)来增强语音识别和检索性能的想法在语音社区中并不陌生。尽管经验结果表明引入PCM可以带来不同程度的改进,但是大多数论文中介绍的底层数据驱动过程都是临时的,缺乏严格的统计依据。在本文中,我们将专注于PCM生成的统计方面,提出并证明一种新颖的基于期望最大化的数据驱动PCM生成算法。我们将在低资源口语检测的背景下评估生成的PCM的性能,主要关注词汇外的关键字。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号