Quantization of cepstral parameters for speech recognition over theWorld Wide Web

Digalakis V.V.; Neumeyer L.G.; Perakakis M.

首页> 外文期刊>IEEE Journal on Selected Areas in Communications >Quantization of cepstral parameters for speech recognition over theWorld Wide Web

【24h】

Quantization of cepstral parameters for speech recognition over theWorld Wide Web

机译：量化倒频谱参数，以实现万维网上的语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We examine alternative architectures for a client-server model ofnspeech-enabled applications over the World Wide Web (WWW). We compare anserver-only processing model where the client encodes and transmits thenspeech signal to the server, to a model where the recognition front endnruns locally at the client and encodes and transmits the cepstralncoefficients to the recognition server over the Internet. We follow annovel encoding paradigm, trying to maximize recognition performanceninstead of perceptual reproduction, and we find that by transmitting thencepstral coefficients we can achieve significantly higher recognitionnperformance at a fraction of the bit rate required when encoding thenspeech signal directly. We find that the required bit rate to achieventhe recognition performance of high-quality unquantized speech is justn2000 bits per second

机译：我们研究了万维网（WWW）上启用nspeech的应用程序的客户端-服务器模型的替代体系结构。我们比较了一个仅服务器处理模型，在该模型中，客户端进行编码，然后将语音信号传输到服务器，再将模型与识别前端在客户端本地运行，然后对倒谱系数进行编码，并通过Internet将其传输到识别服务器。我们遵循nonovel编码范例，试图最大化识别性能而不是感知再现，并且我们发现通过传输正弦系数，我们可以在直接编码语音信号时所需的比特率的一小部分上实现更高的识别性能。我们发现，实现高质量非量化语音的识别性能所需的比特率仅为每秒2000比特

著录项

来源
《IEEE Journal on Selected Areas in Communications》 |1999年第1期|p.82-90|共9页
作者
Digalakis V.V.; Neumeyer L.G.; Perakakis M.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
Internet; cepstral analysis; client-server systems; information resources; speech coding; speech recognition; 2000 bit/s; WWW; World Wide Web; bit rate; cepstral coefficients; cepstral parameters quantization; client-server model; encoding paradigm; high-quality unqua;

机译：互联网;倒频谱分析;客户端-服务器系统;信息资源;语音编码;语音识别;2000 bit / s;WWW;万维网;比特率;倒频谱系数;倒频谱参数量化;客户端-服务器模型;编码范例;高质量不合格;

相似文献

外文文献
中文文献
专利

1. Quantization of cepstral parameters for speech recognition over the World Wide Web [J] . Digalakis V.V., Neumeyer L.G. IEEE Journal on Selected Areas in Communications . 1999,第1期

机译：量化倒频谱参数，以实现万维网上的语音识别
2. Speech Recognition for Isolated Words using Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) [J] . Yogesh S. Angal, R. H. Chile, R. S. Holambe Journal of the Instrument Society of India: Proceedings of the national symposium on instrumentation . 2011,第3期

机译：使用Mel频率倒谱系数（MFCC）和矢量量化（VQ）对孤立单词进行语音识别
3. Predictive Trellis-Coded Quantization of the Cepstral Coefficients for the Distributed Speech Recognition [J] . Sangwon KANG, Joonseok LEE IEICE Transactions on Communications . 2007,第6期

机译：分布语音识别的倒谱系数的预测网格编码量化
4. Quantization of cepstral parameters for speech recognition over the World Wide Web [C] . Digalakis, V., Neumeyer, . 1998

机译：量化倒频谱参数，以实现万维网上的语音识别
5. Estimation of cepstral coefficients for robust speech recognition. [D] . Indrebo, Kevin M. 2008

机译：倒频谱系数的估计，用于鲁棒的语音识别。
6. The application of fractional Mel cepstral coefficient in deceptive speech detection [O] . Xinyu Pan, Heming Zhao, Yan Zhou -1

机译：分数梅尔倒谱系数在欺骗性语音检测中的应用
7. Quantization of cepstral parameters for speech recognition over the World Wide Web [O] . V. Digalakis, L. Neumeyer, M. Perakakis 1999

机译：量化倒频谱参数，以实现万维网上的语音识别

Quantization of cepstral parameters for speech recognition over theWorld Wide Web

摘要

著录项

相似文献

相关主题

期刊订阅