【24h】

Phonetic Question Generation Using Misrecognition

机译:使用错误识别的语音问题生成

获取原文
获取原文并翻译 | 示例

摘要

Most automatic speech recognition systems are currently based on tied state triphones. These tied states are usually determined by a decision tree. Decision trees can automatically cluster triphone states into many classes according to data available allowing each class to be trained efficiently. In order to achieve higher accuracy, this clustering is constrained by manually generated phonetic questions. Moreover, the tree generated from these phonetic questions can be used to synthesize unseen triphones. The quality of decision trees therefore depends on the quality of the phonetic questions. Unfortunately, manual creation of phonetic questions requires a lot of time and resources. To overcome this problem, this paper is concerned with an alternative method for generating these phonetic questions automatically from misrecognition items. These questions are tested using the standard TIMIT phone recognition task.
机译:当前,大多数自动语音识别系统都基于捆绑状态三音器。这些联系状态通常由决策树确定。决策树可以根据可用数据自动将三音机状态分为许多类,从而可以有效地训练每个类。为了获得更高的准确性,这种聚类受到手动生成的语音问题的限制。而且,从这些语音问题中生成的树可用于合成看不见的三音。因此,决策树的质量取决于语音问题的质量。不幸的是,手动创建语音问题需要大量时间和资源。为了克服这个问题,本文涉及一种从错误识别项自动生成这些语音问题的替代方法。使用标准TIMIT电话识别任务测试了这些问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号