首页> 外文会议>Proceedings of 2011 3rd International Conference on Awareness Science and Technology >Environmental sound recognition using time-frequency intersection patterns
【24h】

Environmental sound recognition using time-frequency intersection patterns

机译:使用时频交叉点模式的环境声音识别

获取原文
获取原文并翻译 | 示例

摘要

Environmental sound recognition is an important function of robots and intelligent computer systems. In this research, we tried to use a multi-stage perceptron type neural network system for environmental sound recognition. The input data is the one-dimensional combination of instantaneous spectrum at power peak and the power pattern in time domain. Since for almost environmental sounds, their spectrum changes are not remarkable compared with speech or voice, the combination of power and frequency pattern will preserve the major features of environmental sounds but with drastically reduced data. Two experiments were conducted using an original database and a database created by the RWCP. The recognition rate for about 45 data kinds of environmental sound was about 92%. The merit of this method is the use of a one-dimensional input which combines the power pattern and the instantaneous spectrum of sound data. Comparing with the method using only instantaneous spectrum, the new method are sufficient for larger sound database and the recognition rate was increased about 12%. The results are also comparable with the methods of HMM, while those methods require 2-dimensional spectrum time series data and more complicated computation.
机译:环境声音识别是机器人和智能计算机系统的重要功能。在本研究中,我们尝试使用多级感知器型神经网络系统进行环境声音识别。输入数据是功率峰值处的瞬时频谱和时域中的功率模式的一维组合。由于几乎对于环境声音而言,它们的频谱变化与语音或语音相比并不明显,因此功率和频率模式的组合将保留环境声音的主要特征,但数据会大大减少。使用原始数据库和RWCP创建的数据库进行了两次实验。大约45种数据类型的环境声音的识别率约为92%。此方法的优点是使用一维输入,该输入将功率模式和声音数据的瞬时频谱结合在一起。与仅使用瞬时频谱的方法相比,该新方法足以用于较大的声音数据库,并且识别率提高了约12%。结果也与HMM方法相当,而这些方法需要二维频谱时间序列数据和更复杂的计算。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号