Efficient feature extraction of speaker identification using phoneme mean F-ratio for Chinese

机译：基于汉语音素均值F值的说话人识别有效特征提取

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The features used for speaker recognition should have more speaker individual information while attenuating the linguistic information. In order to discard the linguistic information effectively, in this paper, we employed the phoneme mean F-ratio method to investigate the different contributions of different frequency region from the point of view of Chinese phoneme, and apply it for speaker identification. It is found that the speaker individual information depending on the phonemes is distributed in different frequency regions of speech sound. Based on the contribution rate, we extracted the new features and combined with GMM model. The experiment for speaker identification task is conducted with a King-ASR Chinese database. Compared with the MFCC feature, the identification error rate with the proposed feature was reduced by 32.94%. The results confirmed that the efficiency of the phoneme mean F-ratio method for improving speaker recognition performance for Chinese.

机译：用于说话人识别的功能应具有更多的说话人个人信息，同时减弱语言信息。为了有效地舍弃语言信息，本文采用音素均值F值法从汉语音素的角度研究了不同频率区域的不同贡献，并将其应用于说话人识别。已经发现，取决于音素的说话者个体信息分布在语音的不同频率区域中。基于贡献率，我们提取了新功能并与GMM模型结合。语音识别任务的实验是通过King-ASR中文数据库进行的。与MFCC特征相比，该特征的识别错误率降低了32.94％。结果证实了音素均值F比率方法对提高中文说话者识别性能的效率。

著录项

来源
《2012 8th International Symposium on Chinese Spoken Language Processing.》|2012年|p.345-348|共4页
会议地点 Hong Kong(HK);Hong Kong(HK)
作者
Zhao Chen; Wang Hongcui; Hyon Songgun; Wei Jianguo; Dang Jianwu;
展开▼
作者单位

School of Computer Science and Technology, Tianjin University, 92 Weijin Road, Nankai District, 300072, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;人工智能理论;
关键词
feature extraction; phoneme mean F-ratio; speaker identification;

机译：特征提取;音素平均F比;说话人识别;;

相似文献

外文文献
中文文献
专利

1. Computationally efficient frame-averaged FM feature extraction for speaker recognition [J] . T. Thiruvaran, M. Nosratighods, E. Ambikairajah, Electronicsletters . 2009,第6期

机译：计算有效的帧平均FM特征提取，用于说话人识别
2. A network model of speaker identification with new feature extraction methods and asymmetric BLSTM [J] . Wang Xingmei, Xue Fuzhao, Wang Wei, Neurocomputing . 2020,第Auga25期

机译：具有新特征提取方法和非对称布斯特的扬声器识别网络模型
3. Speaker identification features extraction methods: A systematic review [J] . Tirumala Sreenivas Sremath, Shahamiri Seyed Reza, Garhwal Abhimanyu Singh, Expert Systems with Application . 2017,第deca30期

机译：说话人识别特征提取方法：系统综述
4. Efficient feature extraction of speaker identification using phoneme mean F-ratio for Chinese [C] . Zhao Chen, Wang Hongcui, Hyon Songgun, International Symposium on Chinese Spoken Language Processing . 2012

机译：使用音素均值的高效特征提取扬声器识别
5. Non-native speakers speak in phonemes: A phono-acoustic analysis of fricatives and affricates by native and Chinese speakers of English. [D] . Zhang, Wei. 2010

机译：非母语使用者在音素中说话：母语为英语的中国人和母语人士对擦音和副词进行语音声学分析。
6. Identification of Chinese Herbal Medicines from Zingiberaceae Family Using Feature Extraction and Cascade Classifier Based on Response Signals from E-Nose [O] . Lian Peng, Hui-Qin Zou, Rudolf Bauer, 2014

机译：基于电子鼻响应信号的特征提取和叶栅分类器鉴定姜科中草药
7. Features for Phoneme Independent Speaker Identification [O] . Jianglin Wang, An Ji, Michael T. Johnson 2015

机译：phoneme独立扬声器识别功能

Efficient feature extraction of speaker identification using phoneme mean F-ratio for Chinese

摘要

著录项

相似文献

相关主题

期刊订阅