首页> 外文会议>International Computer Conference, Computer Society of Iran >Transfer Learning for End-to-End ASR to Deal with Low-Resource Problem in Persian Language
【24h】

Transfer Learning for End-to-End ASR to Deal with Low-Resource Problem in Persian Language

机译:转移学习结束ASR以处理波斯语的低资源问题

获取原文

摘要

End-to-end models are state of the art for Automatic Speech Recognition (ASR) systems. Despite all their advantages, they suffer a significant problem: huge amounts of training data are required to achieve excellent performance. This problem is a serious challenge for low-resource languages such as Persian. Therefore, we need some methods and techniques to overcome this issue. One simple, yet effective method towards addressing this issue is transfer learning. We aim to explore the effect of transfer learning on a speech recognition system for the Persian language. To this end, we first train the network on 960 hours of English LibriSpeech corpus. Then, we transfer the trained network and fine-tune it on only about 3.5 hours of training data from the Persian FarsDat corpus. Transfer learning exhibits better performance while needing shorter training time than the model trained from scratch. Experimental results on FarsDat corpus indicate that transfer learning with a few hours of Persian training data can achieve 31.48% relative Phoneme Error Rate (PER) reduction compared to the model trained from scratch.
机译:端到端模型是用于自动语音识别(ASR)系统的最先义。尽管他们所有的优势,但它们遭受了重大问题:需要大量的培训数据来实现出色的性能。这个问题是波斯人等低资源语言的严峻挑战。因此,我们需要一些方法和技巧来克服这个问题。解决这个问题的一个简单但有效的方法是转移学习。我们的目标是探讨转移学习对波斯语语音识别系统的影响。为此,我们首先在960小时的英语LiblisPeech语料库中培训网络。然后,我们将训练有素的网络传输,并仅仅是从波斯法尔兹语料库的大约3.5小时的培训数据。转移学习表现出更好的性能,同时需要比从头开始培训的模型更短的培训时间。 Farsdat语料库上的实验结果表明,与从头开始训练的模型相比,使用几个小时的波斯训练数据的转移学习可以实现31.48%的相对音素错误率(每)减少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号