首页> 外文学位 >Development of a robust speaker-independent isolated-word recognition system using modular fuzzy neural networks.
【24h】

Development of a robust speaker-independent isolated-word recognition system using modular fuzzy neural networks.

机译:使用模块化模糊神经网络开发鲁棒的独立于说话人的孤立词识别系统。

获取原文
获取原文并翻译 | 示例

摘要

In this investigation, a modular artificial neural network (ANN) architecture was designed that significantly reduced the training time of a conventional ANN based speaker-independent 21-word automatic speech recognition (ASR) system. The modular ANN architecture also improved recognition rates as compared to a conventional ASR system. The introduction of a gender neural network which distinguished a female voice from a male voice at the first stage of the modular architecture further enhanced the performance of the system. The resulting multistage modular artificial neural network (MMANN) based ASR system was built. The performance of this newly designed MMANN was tested with a speaker-independent recognition task of 21 isolated words, using a new set of speech data that was not used during the training process. The MMANN based speech recognition system demonstrated a performance improvement over systems based on the conventional Dynamic Time Warping (DTW) algorithm and a single monolithic ANN architecture. Fuzzy logic was combined with artificial neural networks to further improve the performance of the MMANN based word recognition system, and a multistage modular fuzzy neural network (MMFNN) based ASR system was developed. Fuzzy logic was applied to the 21-word recognition problem because it enables incomplete or imprecise data to be distinguished in a manner similar to that of humans. In the MMFNN architecture, proper use of fuzzifiers is critical to producing peak performance. The proper fuzzifiers help increase dissimilarity among the various classes in feature vectors. Empirical studies were conducted to obtain the proper fuzzy parameters since there was no general rule to decide the best values for the fuzzifiers. As a result, an MMFNN based ASR system was developed which performed best among several conventional and unconventional systems while reducing training time considerably. The modular architecture of the proposed system was also found suitable for concurrent training of individual neural networks. A very-large-scale integration (VLSI) implementation of the entire system should be useful for achieving real time performance in an isolated-word recognition system.
机译:在这项研究中,设计了一种模块化的人工神经网络(ANN)体系结构,该体系结构显着减少了基于常规ANN的独立于说话者的21词自动语音识别(ASR)系统的训练时间。与传统的ASR系统相比,模块化的ANN体系结构还提高了识别率。性别神经网络的引入在模块化架构的第一阶段将女性语音与男性语音区分开来,进一步增强了系统的性能。建立了基于多级模块化人工神经网络(MMANN)的ASR系统。使用训练过程中未使用的一组新语音数据,使用21个独立单词的与说话者无关的识别任务测试了此新设计的MMANN的性能。基于MMANN的语音识别系统比基于常规动态时间规整(DTW)算法和单个整体式ANN架构的系统具有更高的性能。模糊逻辑与人工神经网络相结合,进一步提高了基于MMANN的单词识别系统的性能,并开发了基于多级模块化模糊神经网络(MMFNN)的ASR系统。模糊逻辑被应用于21字识别问题,因为它可以以类似于人类的方式区分不完整或不精确的数据。在MMFNN体系结构中,正确使用模糊器对于产生峰值性能至关重要。适当的模糊器有助于增加特征向量中各个类别之间的相似度。由于没有通用的规则来确定模糊器的最佳值,因此进行了实证研究以获得适当的模糊参数。结果,开发了一种基于MMFNN的ASR系统,该系统在几种常规和非常规系统中表现最佳,同时大大减少了培训时间。还发现提出的系统的模块化体系结构适合于同时训练单个神经网络。整个系统的超大规模集成(VLSI)实现对于在隔离词识别系统中实现实时性能应该是有用的。

著录项

  • 作者

    Kim, Jae Hong.;

  • 作者单位

    University of Kansas.;

  • 授予单位 University of Kansas.;
  • 学科 Electrical engineering.;Artificial intelligence.
  • 学位 Ph.D.
  • 年度 1997
  • 页码 211 p.
  • 总页数 211
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号