首页> 外文期刊>IEEE Journal on Selected Areas in Communications >Quantization of cepstral parameters for speech recognition over theWorld Wide Web
【24h】

Quantization of cepstral parameters for speech recognition over theWorld Wide Web

机译:量化倒频谱参数,以实现万维网上的语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

We examine alternative architectures for a client-server model ofnspeech-enabled applications over the World Wide Web (WWW). We compare anserver-only processing model where the client encodes and transmits thenspeech signal to the server, to a model where the recognition front endnruns locally at the client and encodes and transmits the cepstralncoefficients to the recognition server over the Internet. We follow annovel encoding paradigm, trying to maximize recognition performanceninstead of perceptual reproduction, and we find that by transmitting thencepstral coefficients we can achieve significantly higher recognitionnperformance at a fraction of the bit rate required when encoding thenspeech signal directly. We find that the required bit rate to achieventhe recognition performance of high-quality unquantized speech is justn2000 bits per second
机译:我们研究了万维网(WWW)上启用nspeech的应用程序的客户端-服务器模型的替代体系结构。我们比较了一个仅服务器处理模型,在该模型中,客户端进行编码,然后将语音信号传输到服务器,再将模型与识别前端在客户端本地运行,然后对倒谱系数进行编码,并通过Internet将其传输到识别服务器。我们遵循nonovel编码范例,试图最大化识别性能而不是感知再现,并且我们发现通过传输正弦系数,我们可以在直接编码语音信号时所需的比特率的一小部分上实现更高的识别性能。我们发现,实现高质量非量化语音的识别性能所需的比特率仅为每秒2000比特

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号