首页> 外文会议>International School of Physics "Enrico Fermi" >Design and implementation of a web-based software framework for real time intelligent audio coding based on speech/music discrimination
【24h】

Design and implementation of a web-based software framework for real time intelligent audio coding based on speech/music discrimination

机译:基于网络/音乐辨别的实时智能音频编码的基于Web的软件框架的设计与实现

获取原文

摘要

In this work a software framework based on client-server architecture is implemented for real time intelligent audio coding. A speech/music discrimination scheme analyzes the input audio signal and takes a decision about the nature of the audio signal (speech or music) on a frame by frame basis. According to the decision of the speech/music discriminator, a suitable coder is selected at each frame. The designed software framework makes use of the speech and audio coders incorporated into the MPEG4 audio standard (HVXC or CELP for speech frames and TwinVQ or AAC for music frames) to evaluate the performance of an intelligent multi-mode audio coder. The framework supports several types of audio features (timbral texture features and rhythmic content features) and classifiers (classical Statistical Pattern Recognition (SPR) classifiers, Multilayer Perception Neural Networks (MLPNN), Support Vector Machines (SVM), Fuzzy Expert Systems (FES), Hidden Markov Models (HMM)) Comparison between a speech/music discrimination based-intelligent audio coder and MPEG4-AAC has been performed using audio signals representative of the two corresponding classes (speech and music). Subjective and objective tests have been accomplished aiming at assessing the behaviour of the intelligent audio coding scheme.
机译:在这项工作中,基于客户端 - 服务器架构的软件框架用于实时智能音频编码。语音/音乐鉴别方案分析输入音频信号,并通过帧的基础上帧上的音频信号(语音或音乐)的性质。根据语音/音乐鉴别器的决定,在每个帧中选择合适的编码器。设计的软件框架利用包含在MPEG4音频标准(HVXC或CELP用于语音帧和TWINVQ或AAC的MECVQ或TWINAMS)中的语音和音频编码器来评估智能多模式音频编码器的性能。该框架支持几种类型的音频功能(Timbral纹理特征和节奏内容特征)和分类器(经典统计模式识别(SPR)分类器,多层感知神经网络(MLPNN),支持向量机(SVM),模糊专家系统(FES) ,隐马尔可夫模型(HMM))一个语音/音乐鉴别基于智能音频编码器和MPEG4-AAC已经使用代表两个相应的类(语音和音乐)的音频信号进行比较。已经完成了主观和客观测试,旨在评估智能音频编码方案的行为。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号