Noise robust speech recognition using dynamic synapse neural networks.

机译：使用动态突触神经网络的噪声鲁棒语音识别。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Training parameters of Dynamic Synapse Neural Networks (DSNNs) is an ill-conditioned problem, due to nonlinearity, high dimensionality, and large-scale information processing. In addition to convergence problem, a trained neural network might be unstable which causes a poor generalization for unseen data. Therefore, dynamic networks have been lacking proper learning algorithm to extract complex information from spatio-temporal patterns.; In this dissertation, a new discrete version of DSNN based on the Wavelet filter bank and the Genetic Algorithms (GAs) learning method is introduced to optimize the DSNN parameters. Despite the success of GAs on training of a small-scaled DSNN, training of a large-scale DSNN with GAs becomes a challenging problem. To address this issue, a general class of DSNNs from biological and mathematical perspective is introduced to model the average local population activity of neurons. Efficient learning methods to optimize the DSNN parameters with point-process I/O are developed based on the Gauss-Newton learning and an interior trust-region (TR) nonlinear optimization method.; In order to study the robustness of the DSNNs, the following assumptions are made to simplify the problem: (a) continuous DSNNs and (b) removing the dynamic characteristics of I/O signals---working in steady state mode. These assumptions reduce DSNNs to their sub-classes, i.e., static neural network. A robust command recognition system is developed using frequency domain analysis and radial basis function networks with the kernel learning. A system for multiword spotting in continuous speech has been developed. A proof-of-concept of this system is illustrated by a demo which is implemented to identify 27 words for a drive-through MacDonald automatic system. The sensitivity, specificity and correct classification rate of the proposed system for a slow rate speaking are 82.6%, 69.1 %, and 72.4%, respectively.; Moreover, to construct a hybrid noise robust speech recognition system, a speaker-independent phoneme recognition system is developed based on the wavelet de-noising preprocessing at front-end. The improvement of 16.31% to 20.20% on the accuracy of continuous phoneme recognition under SNR levels of 20 to -5 dB for the TIMIT corpus was obtained.

机译：由于非线性，高维和大规模信息处理，动态突触神经网络（DSNN）的训练参数是一个病态问题。除了收敛问题外，训练有素的神经网络可能会不稳定，从而导致看不见数据的泛化能力很差。因此，动态网络一直缺乏适当的学习算法来从时空模式中提取复杂的信息。本文提出了一种基于小波滤波器组和遗传算法学习方法的离散神经网络新版本，以优化神经网络参数。尽管GA在训练小型DSNN方面取得了成功，但使用GA训练大型DSNN仍然是一个具有挑战性的问题。为了解决这个问题，从生物学和数学角度介绍了一类普通的DSNN，以对神经元的平均本地种群活动进行建模。基于高斯-牛顿学习和内部信任区域非线性优化方法，开发了一种利用点过程I / O优化DSNN参数的有效学习方法。为了研究DSNN的鲁棒性，做出以下假设以简化该问题：（a）连续DSNN和（b）删除I / O信号的动态特性-在稳态模式下工作。这些假设将DSNN简化为其子类，即静态神经网络。使用频域分析和径向基函数网络以及内核学习技术开发了一种鲁棒的命令识别系统。已经开发了用于连续语音中的多单词点播的系统。该系统的概念证明由一个演示演示，该演示被实现以识别驾车通过式MacDonald自动系统的27个单词。拟议的系统慢语速的敏感性，特异性和正确分类率分别为82.6％，69.1％和72.4％。此外，为了构建混合噪声鲁棒语音识别系统，基于前端的小波消噪预处理，开发了独立于说话人的音素识别系统。在TIMIT语料库的SNR级别为20至-5 dB的情况下，连续音素识别的准确性提高了16.31％至20.20％。

著录项

作者
Heidari Namarvar, Hassan.;
展开▼
作者单位

University of Southern California.;

展开▼
授予单位 University of Southern California.;
学科 Engineering Biomedical.; Engineering Electronics and Electrical.
学位 Ph.D.
年度 2005
页码 159 p.
总页数 159
原文格式 PDF
正文语种 eng
中图分类生物医学工程;无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Switching Linear Dynamical Systems for Noise Robust Speech Recognition [J] . Mesot B., Barber D. IEEE transactions on audio, speech and language processing . 2007,第6期

机译：开关线性动力系统用于噪声鲁棒语音识别
2. Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition [J] . Yanmin Qian, Mengxiao Bi, Tian Tan, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第12期

机译：用于噪声鲁棒语音识别的超深度卷积神经网络
3. Real-time frequency-based noise-robust Automatic Speech Recognition using Multi-Nets Artificial Neural Networks: A multi-views multi-learners approach [J] . Seyed Reza Shahamiri, Siti Salwah Binti Salim Neurocomputing . 2014,第apra10期

机译：基于多网络人工神经网络的实时基于频率的鲁棒性自动语音识别：多视图多学习者方法
4. Robust speech recognition with dynamic synapses [C] . Jim-Shih Liaw, Berger, T.W. . 1998

机译：具有动态突触的强大语音识别
5. On the robustness of static and dynamic spectral information for speech recognition in noise. [D] . Yang, Chen. 2005

机译：静态和动态频谱信息在噪声中识别语音的鲁棒性。
6. Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data [O] . Ayesha Pervaiz, Fawad Hussain, Huma Israr, 2020

机译：通过训练数据的噪声增强将噪声鲁棒性纳入语音命令识别中
7. Switching linear dynamical systems for noise robust speech recognition [O] . David Barber 2013

机译：用于噪声鲁棒语音识别的切换线性动态系统

Noise robust speech recognition using dynamic synapse neural networks.

摘要

著录项

相似文献

相关主题

期刊订阅