Effective data-driven feature learning for detecting name errors in automatic speech recognition

机译：有效的数据驱动特征学习，可在自动语音识别中检测名称错误

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper addresses the problem of detecting name errors in automatic speech recognition (ASR) output. The highly skewed label distributions (i.e. name errors are infrequent), sparse training data, and large number of potential lexical features pose significant challenges for training name error classification systems. Data-driven feature learning is needed for handling multiple languages but is sensitive to over fitting. We address the problem by designing aggregate features using a related (sentence-level name detection) task, and reduce dimensionality of the lexical features using word classes. Experiments on conversational domain data in both English and Iraqi Arabic show that best results are obtained using all feature mapping methods plus feature selection using L1 regularization.

机译：本文解决了在自动语音识别（ASR）输出中检测名称错误的问题。高度倾斜的标签分布（即名称错误很少见），稀疏的训练数据以及大量潜在的词汇特征对训练名称错误分类系统提出了重大挑战。需要数据驱动的特征学习来处理多种语言，但对过度拟合很敏感。我们通过使用相关的（句子级名称检测）任务设计聚合特征来解决该问题，并使用单词类降低词汇特征的维数。使用英语和伊拉克阿拉伯语进行的会话域数据实验表明，使用所有特征映射方法以及使用L1正则化进行特征选择都可以获得最佳结果。

著录项

来源
《IEEE Workshop on Spoken Language Technology》|2014年|230-235|共6页
会议地点
作者
Ji He; Marin Alex; Ostendorf Mari;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
auxiliary tasks; feature learning; name error detection; word classes;

机译：辅助任务;特征学习;名称错误检测;词类;

相似文献

外文文献
中文文献
专利

1. Robust phoneme classification for automatic speech recognition using hybrid features and an amalgamated learning model [J] . Mohammed Kamal Khwaja, Peddakota Vikash, P. Arulmozhivarman, International journal of speech technology . 2016,第4期

机译：强大的音素分类功能，可使用混合功能和混合学习模型进行自动语音识别
2. Data-driven spectral basis functions for automatic speech recognition [J] . Naren Malayath, Hynek Hermansky Speech Communication . 2003,第4期

机译：数据驱动的频谱基础功能可实现自动语音识别
3. Analysis of the sensitivity of the End-Of-Turn Detection task to errors generated by the Automatic Speech Recognition process [J] . Cesar Montenegro, Roberto Santana, Jose A. Lozano Engineering Applications of Artificial Intelligence . 2021,第Apra期

机译：转向末端检测任务对自动语音识别过程产生的错误的敏感性分析
4. Effective data-driven feature learning for detecting name errors in automatic speech recognition [C] . Ji He, Marin Alex, Ostendorf Mari IEEE Workshop on Spoken Language Technology . 2014

机译：有效的数据驱动功能学习，用于检测自动语音识别中的名称错误
5. Design of loss functions and feature transformation for minimum classification error based automatic speech recognition [D] . Ratnagiri, Madhavi Vedula 2011

机译：基于最小分类误差的自动语音识别损失函数设计和特征变换
6. Words from spontaneous conversational speech can be recognized with human-like accuracy by an error-driven learning algorithm that discriminates between meanings straight from smart acoustic features bypassing the phoneme as recognition unit [O] . Denis Arnold, Fabian Tomaschek, Konstantin Sering, -1

机译：通过错误驱动的学习算法可以区分自发会话语音中的单词其准确性与人类类似可以从智能声学特征中区分出含义而绕过音素作为识别单元
7. Detecting and Correcting Automatic Speech Recognition Errors with A New Model [O] . 2021

机译：用新模型检测和纠正自动语音识别错误

Effective data-driven feature learning for detecting name errors in automatic speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅