首页> 外文会议>Hellenic Conference on Artificial Intelligence >Audio Feature Selection for Recognition of Non-linguistic Vocalization Sounds
【24h】

Audio Feature Selection for Recognition of Non-linguistic Vocalization Sounds

机译:识别非语言声音声音的音频特征选择

获取原文

摘要

Aiming at automatic detection of non-linguistic sounds from vocalizations, we investigate the applicability of various subsets of audio features, which were formed on the basis of ranking the relevance and the individual quality of several audio features. Specifically, based on the ranking of the large set of audio descriptors, we performed selection of subsets and evaluated them on the non-linguistic sound recognition task. During the audio parameterization process, every input utterance is converted to a single feature vector, which consists of 207 parameters. Next, a subset of this feature vector is fed to a classification model, which aims at straight estimation of the unknown sound class. The experimental evaluation showed that the feature vector composed of the 50-best ranked parameters provides a good trade-off between computational demands and accuracy, and that the best accuracy, in terms of recognition accuracy, is observed for the 150-best subset.
机译:针对从发声的自动检测非语言声音,我们调查了各种音频功能子集的适用性,这些数据集是在排序相关性和多个音频特征的各个质量的基础上形成的。具体而言,基于大量音频描述符的排名,我们执行了对子集的选择并在非语言声音识别任务上进行评估。在音频参数化过程中,每个输入话语都被转换为单个特征向量,该传感器由207个参数组成。接下来,将该特征向量的子集馈送到分类模型,该分类模型旨在直接估计未知声类。实验评估表明,由50最佳排名参数组成的特征向量在计算需求和准确性之间提供了良好的权衡,并且在150个最佳子集中观察到识别准确性的最佳精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号