首页> 外文会议>International School of Physics "Enrico Fermi" >Design and implementation of a web-based software framework for real time intelligent audio coding based on speech/music discrimination

【24h】

Design and implementation of a web-based software framework for real time intelligent audio coding based on speech/music discrimination

机译：基于网络/音乐辨别的实时智能音频编码的基于Web的软件框架的设计与实现

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work a software framework based on client-server architecture is implemented for real time intelligent audio coding. A speech/music discrimination scheme analyzes the input audio signal and takes a decision about the nature of the audio signal (speech or music) on a frame by frame basis. According to the decision of the speech/music discriminator, a suitable coder is selected at each frame. The designed software framework makes use of the speech and audio coders incorporated into the MPEG4 audio standard (HVXC or CELP for speech frames and TwinVQ or AAC for music frames) to evaluate the performance of an intelligent multi-mode audio coder. The framework supports several types of audio features (timbral texture features and rhythmic content features) and classifiers (classical Statistical Pattern Recognition (SPR) classifiers, Multilayer Perception Neural Networks (MLPNN), Support Vector Machines (SVM), Fuzzy Expert Systems (FES), Hidden Markov Models (HMM)) Comparison between a speech/music discrimination based-intelligent audio coder and MPEG4-AAC has been performed using audio signals representative of the two corresponding classes (speech and music). Subjective and objective tests have been accomplished aiming at assessing the behaviour of the intelligent audio coding scheme.

机译：在这项工作中，基于客户端 - 服务器架构的软件框架用于实时智能音频编码。语音/音乐鉴别方案分析输入音频信号，并通过帧的基础上帧上的音频信号（语音或音乐）的性质。根据语音/音乐鉴别器的决定，在每个帧中选择合适的编码器。设计的软件框架利用包含在MPEG4音频标准（HVXC或CELP用于语音帧和TWINVQ或AAC的MECVQ或TWINAMS）中的语音和音频编码器来评估智能多模式音频编码器的性能。该框架支持几种类型的音频功能（Timbral纹理特征和节奏内容特征）和分类器（经典统计模式识别（SPR）分类器，多层感知神经网络（MLPNN），支持向量机（SVM），模糊专家系统（FES），隐马尔可夫模型（HMM））一个语音/音乐鉴别基于智能音频编码器和MPEG4-AAC已经使用代表两个相应的类（语音和音乐）的音频信号进行比较。已经完成了主观和客观测试，旨在评估智能音频编码方案的行为。

著录项

来源
《International School of Physics "Enrico Fermi"》|2007年||共7页
会议地点
作者
J.E. Munoz-Exposito; N. Ruiz-Reyes; S Garcia-Galan; P. Vera-Candeas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 O4-532;
关键词

相似文献

外文文献
中文文献
专利

1. SPEECH/MUSIC DISCRIMINATION BASED ON WARPING TRANSFORMATION AND FUZZY LOGIC FOR INTELLIGENT AUDIO CODING [J] . Jose Enrique Munoz-Exposito, Sebastian Garcia Galan, Nicolas Ruiz Reyes, Applied Artificial Intelligence . 2009,第5期

机译：基于Warping变换和模糊逻辑的智能音频编码语音/音乐识别
2. Efficient audio-driven multimedia indexing through similarity-based speech/music discrimination [J] . Tsipas Nikolaos, Vrysis Lazaros, Dimoulas Charalampos, Multimedia Tools and Applications . 2017,第24期

机译：通过基于相似性的语音/音乐区分，高效的音频驱动多媒体索引
3. Fast time-frequency transform algorithms and their applications to real-time software implementation of AC-3 audio codec [J] . Yu-Chi Chen, CHien-Wu Tsai IEEE Transactions on Consumer Electronics . 1998,第2期

机译：快速时频变换算法及其在AC-3音频编解码器实时软件实现中的应用
4. Design and implementation of a web-based software framework for real time intelligent audio coding based on speech/music discrimination [C] . J.E. Munoz-Expósito, N. Ruiz-Reyes, S. García-Galán, Audio Engineering Society 122nd Convention . 2007

机译：基于Web的基于语音/音乐判别的实时智能音频编码软件框架的设计与实现
5. An architectural framework for the specification, analysis and design of intelligent real-time monitoring agent-based software systems. [D] . Aborizka, Mohamed Abdelfattah. 2002

机译：用于基于智能实时监控代理的软件系统的规范，分析和设计的体系结构框架。
6. Design and Implementation of an Interactive Web-Based Near Real-Time Forest Monitoring System [O] . Arun Kumar Pratihast, Ben DeVries, Valerio Avitabile, -1

机译：基于交互式Web的近实时森林监测系统的设计与实现
7. Implementation of Music Embedded System Software Using Real Time Software Analysis and Design Method [O] . Seong-Min Choi, Hoon Oh 2008

机译：使用实时软件分析和设计方法实现音乐嵌入式系统软件
8. Design and Real-Time Implementation of a Robust APC Coder for Speech Transmission over 16 Kb/s Noisy Channels. Volume II. Real-Time Implementation [R] . Wolf, J. J., Field, K. D., Russell, W. H. 1980

机译：用于16 Kb / s噪声信道语音传输的鲁棒apC编码器的设计与实时实现。第二卷。实时实施

Design and implementation of a web-based software framework for real time intelligent audio coding based on speech/music discrimination

摘要

著录项

相似文献

相关主题

期刊订阅