首页> 外文学位 >Noise robust speech recognition using dynamic synapse neural networks.
【24h】

Noise robust speech recognition using dynamic synapse neural networks.

机译:使用动态突触神经网络的噪声鲁棒语音识别。

获取原文
获取原文并翻译 | 示例

摘要

Training parameters of Dynamic Synapse Neural Networks (DSNNs) is an ill-conditioned problem, due to nonlinearity, high dimensionality, and large-scale information processing. In addition to convergence problem, a trained neural network might be unstable which causes a poor generalization for unseen data. Therefore, dynamic networks have been lacking proper learning algorithm to extract complex information from spatio-temporal patterns.; In this dissertation, a new discrete version of DSNN based on the Wavelet filter bank and the Genetic Algorithms (GAs) learning method is introduced to optimize the DSNN parameters. Despite the success of GAs on training of a small-scaled DSNN, training of a large-scale DSNN with GAs becomes a challenging problem. To address this issue, a general class of DSNNs from biological and mathematical perspective is introduced to model the average local population activity of neurons. Efficient learning methods to optimize the DSNN parameters with point-process I/O are developed based on the Gauss-Newton learning and an interior trust-region (TR) nonlinear optimization method.; In order to study the robustness of the DSNNs, the following assumptions are made to simplify the problem: (a) continuous DSNNs and (b) removing the dynamic characteristics of I/O signals---working in steady state mode. These assumptions reduce DSNNs to their sub-classes, i.e., static neural network. A robust command recognition system is developed using frequency domain analysis and radial basis function networks with the kernel learning. A system for multiword spotting in continuous speech has been developed. A proof-of-concept of this system is illustrated by a demo which is implemented to identify 27 words for a drive-through MacDonald automatic system. The sensitivity, specificity and correct classification rate of the proposed system for a slow rate speaking are 82.6%, 69.1 %, and 72.4%, respectively.; Moreover, to construct a hybrid noise robust speech recognition system, a speaker-independent phoneme recognition system is developed based on the wavelet de-noising preprocessing at front-end. The improvement of 16.31% to 20.20% on the accuracy of continuous phoneme recognition under SNR levels of 20 to -5 dB for the TIMIT corpus was obtained.
机译:由于非线性,高维和大规模信息处理,动态突触神经网络(DSNN)的训练参数是一个病态问题。除了收敛问题外,训练有素的神经网络可能会不稳定,从而导致看不见数据的泛化能力很差。因此,动态网络一直缺乏适当的学习算法来从时空模式中提取复杂的信息。本文提出了一种基于小波滤波器组和遗传算法学习方法的离散神经网络新版本,以优化神经网络参数。尽管GA在训练小型DSNN方面取得了成功,但使用GA训练大型DSNN仍然是一个具有挑战性的问题。为了解决这个问题,从生物学和数学角度介绍了一类普通的DSNN,以对神经元的平均本地种群活动进行建模。基于高斯-牛顿学习和内部信任区域非线性优化方法,开发了一种利用点过程I / O优化DSNN参数的有效学习方法。为了研究DSNN的鲁棒性,做出以下假设以简化该问题:(a)连续DSNN和(b)删除I / O信号的动态特性-在稳态模式下工作。这些假设将DSNN简化为其子类,即静态神经网络。使用频域分析和径向基函数网络以及内核学习技术开发了一种鲁棒的命令识别系统。已经开发了用于连续语音中的多单词点播的系统。该系统的概念证明由一个演示演示,该演示被实现以识别驾车通过式MacDonald自动系统的27个单词。拟议的系统慢语速的敏感性,特异性和正确分类率分别为82.6%,69.1%和72.4%。此外,为了构建混合噪声鲁棒语音识别系统,基于前端的小波消噪预处理,开发了独立于说话人的音素识别系统。在TIMIT语料库的SNR级别为20至-5 dB的情况下,连续音素识别的准确性提高了16.31%至20.20%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号