首页> 外文会议>National Conference on Biomedical Engineering;International Iranian Conference on Biomedical Engineering >Persian Language Phone Recognition Based on Robust Extraction of Acoustic Landmarks
【24h】

Persian Language Phone Recognition Based on Robust Extraction of Acoustic Landmarks

机译:基于声学地标的强大提取的波斯语电话识别

获取原文

摘要

Acoustic landmarks are defined as more informative parts of the speech signal and are proofed to be beneficial in designing more robust speech recognition systems. This work aims to present a Persian phone recognition system based on acoustic landmarks to achieve a quality phone recognition system. For this, appropriate acoustic landmarks for the Persian language was selected and trained to an artificial neural network. Then to boost the performance of our landmark recognition system, the model's structure and the training method were modified. The goal of these modifications is to filter variations of acoustic landmarks as much as possible. For this, we utilized neural network structures to map landmarks to their corresponding gold ones nonlinearly. These gold landmarks are the ones that could be recognized without any error in our landmark recognition system. The experiments were implemented on a Persian database named Farsdat. The best landmark recognition model is a five-hidden layer feedforward neural network with 21.74 phone error rate. We also attained 0.56 percent PER improvement using our best variation filtering method.
机译:声学地标被定义为语音信号的更多信息部分,并证明在设计更强大的语音识别系统方面是有益的。这项工作旨在介绍基于声学地标的波斯电话识别系统来实现优质的电话识别系统。为此,选择并培训对人工神经网络的适当声学地标。然后提高我们的地标识别系统的性能,修改了模型的结构和训练方法。这些修改的目标是尽可能地过滤声学地标的变化。为此,我们利用神经网络结构来将地标映射到它们相应的金色的标识。这些金色地标是可以在没有地标识别系统中没有任何错误的情况下认可的。实验是在名为Farsdat的波斯数据库上实施。最好的地标识别模型是一个五隐藏的层前馈神经网络,手机错误率为21.74。我们还使用最佳变化过滤方法实现了0.56%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号