首页> 外文期刊>The international arab journal of information technology >Recognition of Spoken Bengali Numerals Using MLP, SVM, RF Based Models with PCA Based Feature Summarization
【24h】

Recognition of Spoken Bengali Numerals Using MLP, SVM, RF Based Models with PCA Based Feature Summarization

机译:使用基于MLP,SVM,RF的模型和基于PCA的特征汇总识别孟加拉语数字

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a method of automatic recognition of Bengali numerals spoken in noise-free and noisy environments by multiple speakers with different dialects. Mel Frequency Cepstral Coefficients (MFCC) are used for feature extraction, and Principal Component Analysis is used as a feature summarizer to form the feature vector from the MFCC data for each digit utterance. Finally, we use Support Vector Machines, Multi-Layer Perceptrons, and Random Forests to recognize the Bengali digits and compare their performance. In our approach, we treat each digit utterance as a single indivisible entity, and we attempt to recognize it using features of the digit utterance as a whole. This approach can therefore be easily applied to spoken digit recognition tasks for other languages as well.
机译:本文提出了一种自动识别孟加拉国数字的方法,该语言在无噪声和嘈杂的环境中由具有不同方言的多个说话者组成。梅尔频率倒谱系数(MFCC)用于特征提取,主成分分析用作特征汇总器,以针对每个数字发声从MFCC数据中形成特征向量。最后,我们使用支持向量机,多层感知器和随机森林来识别孟加拉数字并比较它们的性能。在我们的方法中,我们将每个数字发声视为一个不可分割的实体,并尝试使用整个数字发声的特征来识别它。因此,该方法也可以轻松地应用于其他语言的口头数字识别任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号