Audio Feature Selection for Recognition of Non-linguistic Vocalization Sounds

机译：识别非语言声音声音的音频特征选择

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Aiming at automatic detection of non-linguistic sounds from vocalizations, we investigate the applicability of various subsets of audio features, which were formed on the basis of ranking the relevance and the individual quality of several audio features. Specifically, based on the ranking of the large set of audio descriptors, we performed selection of subsets and evaluated them on the non-linguistic sound recognition task. During the audio parameterization process, every input utterance is converted to a single feature vector, which consists of 207 parameters. Next, a subset of this feature vector is fed to a classification model, which aims at straight estimation of the unknown sound class. The experimental evaluation showed that the feature vector composed of the 50-best ranked parameters provides a good trade-off between computational demands and accuracy, and that the best accuracy, in terms of recognition accuracy, is observed for the 150-best subset.

机译：针对从发声的自动检测非语言声音，我们调查了各种音频功能子集的适用性，这些数据集是在排序相关性和多个音频特征的各个质量的基础上形成的。具体而言，基于大量音频描述符的排名，我们执行了对子集的选择并在非语言声音识别任务上进行评估。在音频参数化过程中，每个输入话语都被转换为单个特征向量，该传感器由207个参数组成。接下来，将该特征向量的子集馈送到分类模型，该分类模型旨在直接估计未知声类。实验评估表明，由50最佳排名参数组成的特征向量在计算需求和准确性之间提供了良好的权衡，并且在150个最佳子集中观察到识别准确性的最佳精度。

著录项

来源
《Hellenic Conference on Artificial Intelligence》|2014年||共11页
会议地点
作者
Theodoros Theodorou; Iosif Mporas; Nikos Fakotakis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Non-linguistic vocalizations; Sound recognition; Audio features; Classification algorithms;

机译：非语言发声;声音识别;音频功能;分类算法;

相似文献

外文文献
中文文献
专利

1. Supervised machine learning for audio emotion recognition Enhancing film sound design using audio features, regression models and artificial neural networks [J] . Stuart Cunningham, Harrison Ridley, Jonathan Weinel, Personal and Ubiquitous Computing . 2021,第4期

机译：监督机器学习音频情感识别使用音频特征，回归模型和人工神经网络增强电影声音设计
2. The development of emotion recognition from facial expressions and non-linguistic vocalizations during childhood [J] . Chronaki Georgia, Hadwin Julie A., Garner Matthew, The British journal of developmental psychology . 2015,第Pta2期

机译：儿童时期从面部表情和非语言发声的情感识别发展
3. Wavelet feature selection of audio and imagined/vocalized EEG signals for ANN based multimodal ASR system [J] . Mini P. P., Thomas Tessamma, Gopikakumari R. Biomedical signal processing and control . 2021,第Jana期

机译：基于ANN的多模式ASR系统的音频和想象/发声EEG信号的小波特征选择
4. Audio Feature Selection for Recognition of Non-linguistic Vocalization Sounds [C] . Theodoros Theodorou, Iosif Mporas, Nikos Fakotakis Artificial intelligence: methods and applications . 2014

机译：音频特征选择，用于识别非语言发声
5. Non-linguistic Vocalization Recognition Based on Convolutional, Long Short-term Memory, Deep Neural Networks [D] . Qiu, Liang. 2018

机译：基于卷积，长短时记忆，深度神经网络的非语言语音识别
6. A Mathematical Approach to Correlating Objective Spectro-Temporal Features of Non-linguistic Sounds With Their Subjective Perceptions in Humans [O] . Thomas Burns, Ramesh Rajan 2010

机译：一种将非语言声音的客观频谱-时间特征与其在人类中的主观感知相关联的数学方法
7. The development of emotion recognition from facial expressions and non-linguistic vocalizations during childhood [O] . Chronaki G, Hadwin JA, Garner M, 2015

机译：儿童时期面部表情和非语言发声情绪识别的发展

Audio Feature Selection for Recognition of Non-linguistic Vocalization Sounds

摘要

著录项

相似文献

相关主题

期刊订阅