【24h】

Extraction of Local Features for Tri-Phone Based Bangla ASR

机译:基于三电话的Bangla ASR的局部特征提取

获取原文
获取原文并翻译 | 示例

摘要

Inherent features of the Bangla (widely used as Bengali) language like long and short vowels and many instances of allophones make it difficult to build a continuous speech recognizer for the language. Stress and accent vary in spoken Bangla language from region to region. But in formal read Bangla speech, stress and accents are ignored. There are three approaches to continuous speech recognition (CSR) based on the sub-word unit viz. word, phoneme and syllable. Pronunciation of words and sentences are strictly governed by set of linguistic rules. Many attempts have been made to build continuous speech recognizers for Bangla for small and restricted tasks. However, medium and large vocabulary CSR for Bangla is relatively new and not explored. In this paper, the authors have attempted for extracting local features (LFs) from a Bangla input speech for tri-phone based automatic speech recognition (ASR) method. The method comprises two stages, where the first stage extracts LFs from input speech and the final stage generates word strings based on trip hone hidden Markov models (HMMs). The objective of this research is to build a medium vocabulary trip hone based continuous speech recognizer for Bangla language by extracting LFs over mel frequency cepstral coefficients (MFCCs). In this experimentation using Bangla speech corpus prepared by us, the recognizer provides higher word accuracy, word correct rate as well as sentence correct rate for trained and tested sentences with fewer mixture components in HMMs.
机译:孟加拉语(广泛用作孟加拉语)语言的固有功能(如长元音和短元音以及许多变音素实例)使得难以为该语言构建连续的语音识别器。孟加拉语的口语和重音因地区而异。但是在正式的孟加拉语演讲中,压力和重音被忽略了。存在三种基于子词单元viz的连续语音识别(CSR)的方法。单词,音素和音节。单词和句子的发音严格受一套语言规则支配。已经进行了许多尝试来为孟加拉语构建用于小型任务和受限任务的连续语音识别器。但是,孟加拉语的大中型词汇CSR相对较新,尚未探索。在本文中,作者试图从孟加拉输入语音中提取局部特征(LF),以用于基于三电话的自动语音识别(ASR)方法。该方法包括两个阶段,其中第一阶段从输入语音中提取低频,而最后阶段则基于跳闸隐藏的马尔可夫模型(HMM)生成词串。这项研究的目的是通过提取mel频率倒谱系数(MFCCs)上的LFs,为孟加拉语言构建基于中等词汇旅行磨练的连续语音识别器。在使用我们准备的孟加拉语语料库的实验中,识别器为HMM中具有较少混合成分的经过训练和测试的句子提供了更高的单词准确性,单词正确率以及句子正确率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号