首页> 外文期刊>International journal of speech technology >Identification of Indian languages using multi-level spectral and prosodic features
【24h】

Identification of Indian languages using multi-level spectral and prosodic features

机译:使用多级频谱和韵律特征识别印度语言

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper spectral and prosodic features extracted from different levels are explored for analyzing the language specific information present in speech. In this work, spectral features extracted from frames of 20 ms (block processing), individual pitch cycles (pitch synchronous analysis) and glottal closure regions are used for discriminating the languages. Prosodic features extracted from syllable, tri-syllable and multi-word (phrase) levels are proposed in addition to spectral features for capturing the language specific information. In this study, language specific prosody is represented by intonation, rhythm and stress features at syllable and tri-syllable (words) levels, whereas temporal variations in fundamental frequency (F_0 contour), durations of syllables and temporal variations in intensities (energy contour) are used to represent the prosody at multi-word (phrase) level. For analyzing the language specific information in the proposed features, Indian language speech database (IITKGP-MLILSC) is used. Gaussian mixture models are used to capture the language specific information from the proposed features. The evaluation results indicate that language identification performance is improved with combination of features. Performance of proposed features is also analyzed on standard Oregon Graduate Institute Multi-Language Telephone-based Speech (OGI-MLTS) database.
机译:本文探讨了从不同级别提取的频谱和韵律特征,以分析语音中存在的特定于语言的信息。在这项工作中,从20 ms的帧(块处理),单个音高周期(音高同步分析)和声门闭合区域中提取的频谱特征用于区分语言。除了用于捕获语言特定信息的频谱特征之外,还提出了从音节,三音节和多词(短语)级别提取的韵律特征。在这项研究中,特定语言的韵律由音节和三音节(单词)级别的语调,节奏和重音特征表示,而基本频率(F_0轮廓)的时间变化,音节的持续时间和强度的时间变化(能量轮廓)用于表示多词(短语)级别的韵律。为了分析所建议功能中的特定于语言的信息,使用了印度语言语音数据库(IITKGP-MLILSC)。高斯混合模型用于从提议的功能中捕获特定于语言的信息。评估结果表明,通过功能组合可以提高语言识别性能。在标准的俄勒冈大学研究生院基于多语言电话的语音(OGI-MLTS)数据库上还分析了建议功能的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号