Classification of phonemes using modulation spectrogram based features for Gujarati language

机译：使用古吉拉特语基于调制频谱图的特征对音素进行分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, features extracted from modulation spectrogram are used to classify the phonemes in Gujarati language. Modulation spectrogram which is a 2-dimensional (i.e., 2-D) feature vector, is then reduced to a smaller feature dimension by using the proposed feature extraction method. Gujarati database was manually segmented in 31 phoneme classes. These phonemes are then classified using support vector machine (SVM) classifier. Classification accuracy of phoneme classification is 94.5 % as opposed to classification with the state-of-the-art feature set Mel frequency cepstral coefficients (MFCC), which yields 92.74 % classification accuracy. Classification accuracy for broad phoneme classes, viz., vowel, stops, nasals, semivowels, affricates and fricatives is also determined. Phoneme classification in their respective classes is 95.03 % correct with the proposed feature set. Fusion of MFCC with the proposed feature set is performing even better, giving phoneme classification accuracy of 95.7%. With the fusion of features phoneme classification in sonorant and obstruent classes is found to be 97.01 % accurate.

机译：在本文中，使用从调制频谱图提取的特征对古吉拉特语中的音素进行分类。然后，通过使用所提出的特征提取方法，将作为二维（即2-D）特征向量的调制频谱图减小为较小的特征维。古吉拉特语数据库被手动划分为31个音素类。然后使用支持向量机（SVM）分类器对这些音素进行分类。音素分类的分类准确度为94.5％，与使用最新功能集梅尔频率倒谱系数（MFCC）进行分类相比，它的分类准确度为92.74％。还确定了宽音素类别的分类准确度，即元音，停止音，鼻音，半元音，附加音和摩擦音。所建议的功能集在其各自类别中的音素分类正确率为95.03％。 MFCC与提出的功能集的融合效果更好，音素分类准确率达95.7％。通过将特征音素分类合并在声音和淫秽类中，可以达到97.01％的准确率。

著录项

来源
《International conference on asian language processing》|2014年|46-49|共4页
会议地点
作者
Chittora A.; Patil H.A.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
cepstral analysis; feature extraction; natural language processing; signal classification; speech processing; support vector machines; vectors; 2-dimensional feature vector; Gujarati database; Gujarati language; MFCC; SVM classifier; broad phoneme classes; feature dimension; feature extraction method; feature set mel frequency cepstral coefficient; modulation spectrogram based features; phoneme classification accuracy; support vector machine classifier; Accuracy; Acoustics; Feature extraction; Frequency modulation; Spectrogram; Speech; Phonemes; acoustic and modulation frequency. Support vector machine classifier; modulation spectrogram;

机译：倒谱分析;特征提取;自然语言处理;信号分类;语音处理;支持向量机;向量;二维特征向量;古吉拉特语数据库;古吉拉特语; MFCC; SVM分类器;宽音素类别;特征维;特征提取方法;特征集mel频率倒谱系数;基于调制频谱图的特征;音素分类精度;支持向量机分类器;精度;声学;特征提取;频率调制;频谱图;语音;音素;声学和调制频率。支持向量机分类器;调制频谱图;

相似文献

外文文献
中文文献
专利

1. A new variance-based approach for discriminative feature extraction in machine hearing classification using spectrogram features [J] . Xie Zhipeng, McLoughlina Ian, Zhang Haomin, Digital Signal Processing . 2016,第Null期

机译：一种新的基于方差的声谱图特征在机器听力分类中的歧视性特征提取方法
2. On combining acoustic and modulation spectrograms in an attention LSTM-based system for speech intelligibility level classification [J] . Gallardo-Antolin Ascension, Montero Juan M. Neurocomputing . 2021,第Octa7期

机译：在基于LSTM的语音清晰度分类中的注意力和调制谱图中结合声学和调制谱图
3. Modulation classification based on spectrogram [J] . Jie Yang, Chenzhou Ye, Yue Zhou Systems Engineering and Electronics, Journal of . 2005,第3期

机译：基于频谱图的调制分类
4. Classification of phonemes using modulation spectrogram based features for Gujarati language [C] . Chittora A., Patil H.A. International conference on asian language processing . 2014

机译：基于古吉拉特语言的调制谱图的特征对音素进行分类
5. Automatic Modulation Classification Using Cyclic Features via Compressed Sensing [D] . Ramsey, Andrew J. 2018

机译：自动调制分类使用循环特征通过压缩感测
6. Automatic Modulation Classification Based on Deep Feature Fusion for High Noise Level and Large Dynamic Input [O] . Hui Han, Zhiyuan Ren, Lin Li, 2021

机译：基于深噪声水平和大动态输入的深色特征融合自动调制分类
7. Classification of Fricatives Using Novel Modulation Spectrogram Based Features [O] . Kewal D. Malde, Anshu Chittora, Hemant A. Patil 2013

机译：基于新型调制谱图的特征分类摩擦分类
8. Military Typesetting Equipment and Systems for Indo-Aryan and Dravidian Languages (Hindi, Marathi, Bengali, Punjabi, Gujarati, Malayalam, Tamil, and Telugu) (1961-1963) [R] . Nitenson, E. 1964

机译：印度 - 雅利安语和德拉威语的军事排版设备和系统（印地语，马拉地语，孟加拉语，旁遮普语，古吉拉特语，马拉雅拉姆语，泰米尔语和泰卢固语）（1961-1963）

Classification of phonemes using modulation spectrogram based features for Gujarati language

摘要

著录项

相似文献

相关主题

期刊订阅