A Method of Environmental Sound Classification Based on Residual Networks and Data Augmentation

Zeng Jinfang; Li Youming; Zhang YuChen Da

首页> 外文期刊>International Journal of Computational Intelligence and Applications >A Method of Environmental Sound Classification Based on Residual Networks and Data Augmentation

【24h】

A Method of Environmental Sound Classification Based on Residual Networks and Data Augmentation

机译：A Method of Environmental Sound Classification Based on Residual Networks and Data Augmentation

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Environmental sound classication (ESC) is a challenging problem due to the complexity of sounds. To date, a variety of signal processing and machine learning techniques have been applied to ESC task, including matrix factorization, dictionary learning, waveletlterbanks and deep neural networks. It is observed that features extracted from deeper networks tend to achieve higher performance than those extracted from shallow networks. However, in ESC task, only the deep convolutional neural networks (CNNs) which contain several layers are used and the residual networks are ignored, which lead to degradation in the performance. Meanwhile, a possible explanation for the limited exploration of CNNs and the diffculty to improve on simpler models is the relative scarcity of labeled data for ESC. In this paper, a residual network called EnvResNet for the ESC task is proposed. In addition, we propose to use audio data augmentation to overcome the problem of data scarcity. The experiments will be performed on the ESC-50 database. Combined with data augmentation, the proposed model outperforms baseline implementations relying on mel-frequency cepstral coeffcients and achieves results comparable to other state-of-the-art approaches in terms of classifcation accuracy.

著录项

来源
《International Journal of Computational Intelligence and Applications》 |2021年第3期|共10页
作者
Zeng Jinfang; Li Youming; Zhang YuChen Da;
展开▼
作者单位

Xiang Tan Univ, Sch Phys & Optoelect, Xiangtan 411105, Hunan, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类人工智能理论;
关键词
Environmental sound classification; residual networks; data augmentation;

A Method of Environmental Sound Classification Based on Residual Networks and Data Augmentation

摘要

著录项

相关主题

期刊订阅