首页> 外文学位 >Compression algorithms for distributed classification with applications to distributed speech recognition.
【24h】

Compression algorithms for distributed classification with applications to distributed speech recognition.

机译:用于分布式分类的压缩算法及其在分布式语音识别中的应用。

获取原文
获取原文并翻译 | 示例

摘要

With the wide proliferation of mobile devices coupled with the explosion of new multimedia applications, there is a need for adopting a client-server architecture to enable clients with low complexity/memory to support complex multimedia applications. In these client-server systems compression is vital to minimize the communication channel bandwidth requirements by compressing the transmitted data. Traditionally, compression techniques have been designed to minimize perceptual distortion, i.e., the compressed data was intended to be heard/viewed by humans. However, recently there has been an emergence of applications in which the compressed data is processed by an algorithm. Examples include distributed estimation or classification. In these applications, for best system performance, rather than minimizing perceptual distortion, the compression algorithm should be optimized to have the least effect on the estimation/classification capability of the processing algorithm. In this work novel compression techniques optimized for classification are proposed.; The first application considered is remote speech recognition, where the speech recognizer uses compressed data to recognize the spoken utterance. For this application, a scalable encoder designed to maximize recognition performance is proposed. The scalable encoder is shown to have superior rate-recognition performance compared to conventional speech encoders. Additionally, a scalable recognition system capable of trading off recognition performance for reduced complexity is also proposed. These are useful in distributed speech recognition systems where several clients are accessing a single server and efficient server design becomes important to both reduce the computational complexity and the bandwidth requirement at the server.; The second application considered is distributed classification, where the classifier operates on the compressed and transmitted data to make the class decision. A novel algorithm is proposed which is shown to significant reduce the misclassification penalty with a small sacrifice in distortion performance. The generality of this algorithm is demonstrated by extending it to improve the performance of table-lookup encoders. It is shown that by designing product vector quantizers (PVQ) to approximate a higher dimension vector quantizer (VQ), a significant improvement in PSNR performance over conventional PVQ design is possible while not increasing the encoding time significantly over conventional table-lookup encoding.; Finally, a new distortion metric, mutual information (MI) loss , is proposed for designing quantizers in distributed classification applications. It is shown that the MI loss optimized quantizers are able to provide significant improvements in classification performance when compared to mean square error optimized quantizers. Empirical quantizer design and rate allocation algorithms are provided to optimize quantizers for minimizing MI loss. Additionally, it is shown that the MI loss metric can be used to design quantizers operating on low dimension vectors. This is a vital requirement in classification systems employing high dimension classifiers as it enables design of optimal and practical minimum MI loss quantizers implementable on low complexity/memory clients.
机译:随着移动设备的广泛扩散以及新的多媒体应用程序的爆炸式增长,需要采用客户端-服务器架构以使具有低复杂度/存储器的客户端能够支持复杂的多媒体应用程序。在这些客户端-服务器系统中,压缩对于通过压缩传输的数据来最小化通信信道带宽需求至关重要。传统上,压缩技术已被设计为使感知失真最小化,即,压缩数据旨在被人听到/观看。然而,近来出现了其中通过算法来处理压缩数据的应用。示例包括分布式估计或分类。在这些应用中,为了获得最佳的系统性能,而不是最小化感知失真,应该对压缩算法进行优化,以使其对处理算法的估计/分类能力的影响最小。在这项工作中,提出了针对分类进行优化的新颖压缩技术。所考虑的第一个应用是远程语音识别,其中语音识别器使用压缩数据来识别语音。对于此应用,提出了一种旨在使识别性能最大化的可伸缩编码器。与传统的语音编码器相比,可伸缩编码器具有更高的速率识别性能。另外,还提出了一种能够折衷识别性能以降低复杂度的可扩展识别系统。这些在分布式语音识别系统中非常有用,在分布式语音识别系统中,多个客户端正在访问单个服务器,有效的服务器设计对于降低服务器的计算复杂性和带宽要求非常重要。所考虑的第二个应用是分布式分类,其中分类器对压缩和传输的数据进行操作以做出分类决策。提出了一种新颖的算法,该算法被证明可以显着减少误分类的损失,同时牺牲很小的失真性能。通过扩展算法来提高查表编码器的性能,证明了该算法的普遍性。结果表明,通过设计乘积矢量量化器(PVQ)近似于更高维的矢量量化器(VQ),与传统的PVQ设计相比,可以显着改善PSNR性能,而不会比传统的查表编码显着增加编码时间。最后,提出了一种新的失真度量,互信息(MI)损失,用于设计分布式分类应用中的量化器。结果表明,与均方误差优化的量化器相比,MI损失优化的量化器能够显着提高分类性能。提供了经验化的量化器设计和速率分配算法,以优化量化器以最小化MI损失。另外,示出了MI损耗度量可以用于设计在低维向量上运行的量化器。这对于使用高维分类器的分类系统是至关重要的要求,因为它可以设计可在低复杂度/内存客户端上实现的最佳和实用的最小MI损失量化器。

著录项

  • 作者

    Srinivasamurthy, Naveen.;

  • 作者单位

    University of Southern California.$bElectrical Engineering.;

  • 授予单位 University of Southern California.$bElectrical Engineering.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2007
  • 页码 185 p.
  • 总页数 185
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号