...
首页> 外文期刊>Journal of Signal and Information Processing >Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha
【24h】

Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha

机译:Chaha非常低于资源语言的多语言深神经网络建模方法对自动语音识别系统的研究

获取原文
           

摘要

Automatic speech recognition (ASR) is vital for verylow-resource languages for mitigating the extinction trouble. Chaha is one ofthe low-resource languages, which suffers from the problem of resourceinsufficiency and some of its phonological, morphological, and orthographicfeatures challenge the development and initiatives in the area of ASR. Byconsidering these challenges, this study is the first endeavor, which analyzedthe characteristics of the language, prepared speech corpus, and developed differentASR systems. A small 3-hour read speech corpus was prepared and transcribed.Different basic and rounded phone unit-based speech recognizers were exploredusing multilingual deep neural network (DNN) modeling methods. The experimentalresults demonstrated that all the basic phone and rounded phone unit-basedmultilingual models outperformed the corresponding unilingual models with therelative performance improvements of 5.47% to 19.87% and 5.74% to 16.77%,respectively. The rounded phone unit-based multilingual models outperformed theequivalent basic phone unit-based models with relative performance improvementsof 0.95% to 4.98%. Overall, we discovered that multilingual DNN modelingmethods are profoundly effective to develop Chaha speech recognizers. Both thebasic and rounded phone acoustic units are convenient to build Chaha ASRsystem. However, the rounded phone unit-based models are superior inperformance and faster in recognition speed over the corresponding basic phoneunit-based models. Hence, the rounded phone units are the most suitableacoustic units to develop Chaha ASR systems.
机译:自动语音识别(ASR)对于宽松资源语言至关重要,以减轻灭绝问题。 Chaha是一种低资源语言之一,遭受资产化的问题和其一些语音,形态学,矫形物,俄罗斯科矫正挑战ASR地区的发展和举措。通过考虑这些挑战,这项研究是第一次努力,分析了语言,准备的语音语料库和开发的不同态系统的特征。准备了一个小的3小时读语音语料库并转录。基于基础的基本和圆形电话单元的语音识别器进行了多语语言的深神经网络(DNN)建模方法。实验结果表明,所有基于基本的电话和圆形电话单元的司机模型显得了相应的未语模型,其性能提高分别为5.47%至19.87%和5.74%至16.77%。基于圆形的电话单元的多语言型号优于基于等平等的基础电话单元的模型,具有相对性能的改进0.95%至4.98%。总的来说,我们发现,多语种DNN造型方法对于开发Chaha语音识别人员来说是深刻的有效性。 TheBasic和圆形电话的声学单元都很方便构建Chaha Asrsystem。但是,基于圆形的电话单元的模型均匀且识别速度较快,在相应的基于基本的基本相互连接的模型上的识别速度更快。因此,圆形的电话单元是开发Chaha ASR系统的最具防护装置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号