【24h】

iVector-based prosodic system for language identification

机译:基于iVector的语言识别韵律系统

获取原文

摘要

Prosody is the part of speech where rhythm, stress, and intonation are reflected. In language identification tasks, these characteristics are assumed to be language dependent, and thus the language can be identified from them. In this paper, an automatic language recognition system that extracts prosody information from speech and makes decisions about the language with a generative classifier based on iVectors is built. The system is tested on the NIST LRE09 dataset. The results are still not comparable to state-of-the-art acoustic and phonotactic systems. However, they are promising and the fusion of the new approach with an iVector-based acoustic system is found to bring further improvements over the latter.
机译:韵律是言语中反映节奏,压力和语调的部分。在语言识别任务中,假定这些特征与语言有关,因此可以从中识别语言。本文构建了一种自动语言识别系统,该系统从语音中提取韵律信息,并使用基于iVectors的生成分类器对语言做出决策。该系统在NIST LRE09数据集上进行了测试。结果仍然无法与最新的声学和音律系统相提并论。但是,它们是有前途的,并且发现新方法与基于iVector的声学系统的融合将带来对后者的进一步改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号