首页> 外文会议>2012 8th International Symposium on Chinese Spoken Language Processing. >Efficient feature extraction of speaker identification using phoneme mean F-ratio for Chinese
【24h】

Efficient feature extraction of speaker identification using phoneme mean F-ratio for Chinese

机译:基于汉语音素均值F值的说话人识别有效特征提取

获取原文
获取原文并翻译 | 示例

摘要

The features used for speaker recognition should have more speaker individual information while attenuating the linguistic information. In order to discard the linguistic information effectively, in this paper, we employed the phoneme mean F-ratio method to investigate the different contributions of different frequency region from the point of view of Chinese phoneme, and apply it for speaker identification. It is found that the speaker individual information depending on the phonemes is distributed in different frequency regions of speech sound. Based on the contribution rate, we extracted the new features and combined with GMM model. The experiment for speaker identification task is conducted with a King-ASR Chinese database. Compared with the MFCC feature, the identification error rate with the proposed feature was reduced by 32.94%. The results confirmed that the efficiency of the phoneme mean F-ratio method for improving speaker recognition performance for Chinese.
机译:用于说话人识别的功能应具有更多的说话人个人信息,同时减弱语言信息。为了有效地舍弃语言信息,本文采用音素均值F值法从汉语音素的角度研究了不同频率区域的不同贡献,并将其应用于说话人识别。已经发现,取决于音素的说话者个体信息分布在语音的不同频率区域中。基于贡献率,我们提取了新功能并与GMM模型结合。语音识别任务的实验是通过King-ASR中文数据库进行的。与MFCC特征相比,该特征的识别错误率降低了32.94%。结果证实了音素均值F比率方法对提高中文说话者识别性能的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号