Environmental sound recognition using time-frequency intersection patterns

机译：使用时频交叉点模式的环境声音识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Environmental sound recognition is an important function of robots and intelligent computer systems. In this research, we tried to use a multi-stage perceptron type neural network system for environmental sound recognition. The input data is the one-dimensional combination of instantaneous spectrum at power peak and the power pattern in time domain. Since for almost environmental sounds, their spectrum changes are not remarkable compared with speech or voice, the combination of power and frequency pattern will preserve the major features of environmental sounds but with drastically reduced data. Two experiments were conducted using an original database and a database created by the RWCP. The recognition rate for about 45 data kinds of environmental sound was about 92%. The merit of this method is the use of a one-dimensional input which combines the power pattern and the instantaneous spectrum of sound data. Comparing with the method using only instantaneous spectrum, the new method are sufficient for larger sound database and the recognition rate was increased about 12%. The results are also comparable with the methods of HMM, while those methods require 2-dimensional spectrum time series data and more complicated computation.

机译：环境声音识别是机器人和智能计算机系统的重要功能。在本研究中，我们尝试使用多级感知器型神经网络系统进行环境声音识别。输入数据是功率峰值处的瞬时频谱和时域中的功率模式的一维组合。由于几乎对于环境声音而言，它们的频谱变化与语音或语音相比并不明显，因此功率和频率模式的组合将保留环境声音的主要特征，但数据会大大减少。使用原始数据库和RWCP创建的数据库进行了两次实验。大约45种数据类型的环境声音的识别率约为92％。此方法的优点是使用一维输入，该输入将功率模式和声音数据的瞬时频谱结合在一起。与仅使用瞬时频谱的方法相比，该新方法足以用于较大的声音数据库，并且识别率提高了约12％。结果也与HMM方法相当，而这些方法需要二维频谱时间序列数据和更复杂的计算。

著录项

来源
《Proceedings of 2011 3rd International Conference on Awareness Science and Technology》|2011年|p.243-246|共4页
会议地点 Dalian(CN)
作者
Xuan Guo; Toyoda Yoshiyuki; Huankang Li; Jie Huang; Shuxue Ding; Yong Liu;
展开▼
作者单位

School of Computer Science and Engineering, Department of Information Systems, The University of Aizu, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Environmental sound recognition; Robotic audition; Time-frequency intersection pattern;

机译：环境声识别机器人试听时频交会模式;

相似文献

外文文献
中文文献
专利

1. Environmental Sound Recognition Using Time-Frequency Intersection Patterns [J] . XuanGuo, YoshiyukiToyoda, HuankangLi, Applied computational intelligence and soft computing . 2012,第8期

机译：使用时频交叉点模式的环境声音识别
2. Environmental Sound Recognition Using Time-Frequency Intersection Patterns [J] . Xuan Guo, Yoshiyuki Toyoda, Huankang Li, Applied computational intelligence and soft computing . 2012,第期

机译：使用时频交叉点模式的环境声音识别
3. The Intersection of Sound Principles of Environmental Epidemiologic Research and Ethical Guidelines and Review: An Example from Canada of an Environmental Case-Control Study [J] . Mark S. Goldberg Reviews on environmental health . 2010,第2期

机译：合理的环境流行病学研究原理与道德准则与审查的交叉点：加拿大环境案例对照研究的例子
4. Environmental sound recognition using time-frequency intersection patterns [C] . Xuan Guo, Toyoda Yoshiyuki, Huankang Li, International Conference on Awareness Science and Technology . 2011

机译：使用时频交叉模式的环境声音识别
5. The influence of sound spectrum on recognition of temporal pattern of cricket (Teleogryllus oceanicus) song. [D] . El-Feghaly, Edmond M. 1992

机译：声谱对recognition（Teleogryllus oceanicus）歌曲时间模式识别的影响。
6. Extracting time-frequency feature of single-channel vastus medialis EMG signals for knee exercise pattern recognition [O] . Yi Zhang, Peiyang Li, Xuyang Zhu, 2011

机译：提取单通道腓肠肌肌电信号的时频特征用于膝盖运动模式识别
7. Sound-Imitation Word Recognition for Environmental Sounds: Disambiguation in Determining Phonemes of Sound-Imitation Words [O] . Kazushi Ishihara, Kazunori Komatani, Tetsuya Ogata, 2005

机译：用于环境声音的声音仿制词：在确定声音仿词音素时歧义

Environmental sound recognition using time-frequency intersection patterns

摘要

著录项

相似文献

相关主题

期刊订阅