首页> 外文期刊>Circuits and systems >An Efficient Approach for Segmentation, Feature Extraction and Classification of Audio Signals
【24h】

An Efficient Approach for Segmentation, Feature Extraction and Classification of Audio Signals

机译:音频信号的分割,特征提取和分类的有效方法

获取原文
获取原文并翻译 | 示例
           

摘要

Due to the presence of non-stationarities and discontinuities in the audio signal, segmentation and classification of audio signal is a really challenging task. Automatic music classification and annotation is still considered as a challenging task due to the difficulty of extracting and selecting the optimal audio features. Hence, this paper proposes an efficient approach for segmentation, feature extraction and classification of audio signals. Enhanced Mel Frequency Cepstral Coefficient (EMFCC)-Enhanced Power Normalized Cepstral Coefficients (EPNCC) based feature extraction is applied for the extraction of features from the audio signal. Then, multi-level classification is done to classify the audio signal as a musical or non-musical signal. The proposed approach achieves better performance in terms of precision, Normalized Mutual Information (NMI), F-score and entropy. The PNN classifier shows high False Rejection Rate (FRR), False Acceptance Rate (FAR), Genuine Acceptance rate (GAR), sensitivity, specificity and accuracy with respect to the number of classes.
机译:由于音频信号中存在非平稳性和不连续性,因此对音频信号进行分段和分类是一项真正具有挑战性的任务。由于难以提取和选择最佳音频特征,自动音乐分类和注释仍被认为是一项具有挑战性的任务。因此,本文提出了一种有效的音频信号分割,特征提取和分类方法。基于增强梅尔频率倒谱系数(EMFCC)-增强功率归一化倒谱系数(EPNCC)的特征提取适用于从音频信号中提取特征。然后,进行多级分类以将音频信号分类为音乐或非音乐信号。所提出的方法在精度,归一化互信息(NMI),F得分和熵方面都实现了更好的性能。 PNN分类器显示出较高的错误拒绝率(FRR),错误接受率(FAR),真实接受率(GAR),相对于类数的敏感性,特异性和准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号