Phonetic Question Generation Using Misrecognition

机译：使用错误识别的语音问题生成

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most automatic speech recognition systems are currently based on tied state triphones. These tied states are usually determined by a decision tree. Decision trees can automatically cluster triphone states into many classes according to data available allowing each class to be trained efficiently. In order to achieve higher accuracy, this clustering is constrained by manually generated phonetic questions. Moreover, the tree generated from these phonetic questions can be used to synthesize unseen triphones. The quality of decision trees therefore depends on the quality of the phonetic questions. Unfortunately, manual creation of phonetic questions requires a lot of time and resources. To overcome this problem, this paper is concerned with an alternative method for generating these phonetic questions automatically from misrecognition items. These questions are tested using the standard TIMIT phone recognition task.

机译：当前，大多数自动语音识别系统都基于捆绑状态三音器。这些联系状态通常由决策树确定。决策树可以根据可用数据自动将三音机状态分为许多类，从而可以有效地训练每个类。为了获得更高的准确性，这种聚类受到手动生成的语音问题的限制。而且，从这些语音问题中生成的树可用于合成看不见的三音。因此，决策树的质量取决于语音问题的质量。不幸的是，手动创建语音问题需要大量时间和资源。为了克服这个问题，本文涉及一种从错误识别项自动生成这些语音问题的替代方法。使用标准TIMIT电话识别任务测试了这些问题。

著录项

来源
《International Conference on Text, Speech and Dialogue(TSD 2006); 20060911-15; Brno(CZ)》|2006年|P.407-414|共8页
会议地点 Brno(CZ)
作者
Supphanat Kanokphara; Julie Carson-Berndsen;
展开▼
作者单位

School of Computer Science and Informatics University College Dublin, Ireland;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Phonetic effects of focus and "tonal crowding" in intonation: Evidence from Greek polar questions [J] . Arvaniti A, Ladd DR, Mennen I Speech Communication . 2006,第6期

机译：语调中的焦点和“声调拥挤”的语音效果：来自希腊极地问题的证据
2. Improved Arabic speech recognition system through the automatic generation of fine-grained phonetic transcriptions [J] . Alsharhan Eiman, Ramsay Allan Information Processing & Management . 2019,第2期

机译：通过自动生成细粒度的语音转录来改进阿拉伯语语音识别系统
3. Generation of a phonetic transcription for modern standard Arabic: A knowledge-based model [J] . Allan Ramsay, Iman Alsharhan, Hanady Ahmed Computer speech and language . 2014,第4期

机译：现代标准阿拉伯语语音记录的生成：基于知识的模型
4. Phonetic Question Generation Using Misrecognition [C] . Supphanat Kanokphara, Julie Carson-Berndsen International Conference on Text, Speech and Dialogue(TSD 2006); 20060911-15; Brno(CZ) . 2006

机译：使用错误识别的语音问题生成
5. Automatic Neural Question Generation Using Community-Based Question Answering Systems [D] . Baghaee, Tina. 2018

机译：使用基于社区的问题应答系统的自动神经问题
6. Biological data questions the support of the self inhibition required for pattern generation in the half center model [O] . Matthias Kohler, Philipp Stratmann, Florian Röhrbein, 2020

机译：生物数据质疑半中心模型中图案生成所需的自我抑制的支持
7. Phonetic differences between uptalk and question rises in two Antipodean English varieties [O] . Paul Warren, Janet Fletcher 2016

机译：两种抗双翼英语品种的上行与问题之间的语音差异

Phonetic Question Generation Using Misrecognition

摘要

著录项

相似文献

相关主题

期刊订阅