首页> 外文会议>Asilomar Conference on Signals, Systems Computers >A scalable feature learning and tag prediction framework for natural environment sounds
【24h】

A scalable feature learning and tag prediction framework for natural environment sounds

机译:用于自然环境声音的可扩展功能学习和标签预测框架

获取原文

摘要

Building feature extraction approaches that can effectively characterize natural environment sounds is challenging due to the dynamic nature. In this paper, we develop a framework for feature extraction and obtaining semantic inferences from such data. In particular, we propose a new pooling strategy for deep architectures, that can preserve the temporal dynamics in the resulting representation. By constructing an ensemble of semantic embeddings, we employ an l-reconstruction based prediction algorithm for estimating the relevant tags. We evaluate our approach on challenging environmental sound recognition datasets, and show that the proposed features outperform traditional spectral features.
机译:由于具有动态性质,因此能够有效地表征自然环境声音的建筑特征提取方法具有挑战性。在本文中,我们开发了一个用于特征提取和从此类数据中获取语义推断的框架。特别是,我们为深度架构提出了一种新的池化策略,该策略可以在生成的表示中保留时间动态。通过构造语义嵌入的集合,我们采用基于l重构的预测算法来估计相关标签。我们评估了对具有挑战性的环境声音识别数据集的方法,并表明所提出的功能优于传统的频谱功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号