首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Classification and recognition with direct segment models
【24h】

Classification and recognition with direct segment models

机译:直接段模型分类和识别

获取原文

摘要

Segment based direct models have recently been used to improve the output of existing state-of-the-art speech recognizers. To date, however, they have relied on an existing HMM system to provide segment boundaries. This paper takes initial steps at using these models on their own, first by developing a segment-based maximum entropy phone classifier, and then by utilizing the features in a segmental conditional random field for recognition. To produce a feature representation that is independent of segment length, we utilize a set of ngram features based on vector-quantized representations of the acoustic input. We find that the models are able to integrate information at different granularities and from different streams. Contextual information from around the segment boundaries is particularly important. We obtain competitive results for TIMIT phone classification, and present initial recognition results.
机译:基于段的直接模型最近用于改善现有最先进的语音识别器的输出。但是,迄今为止,它们依赖于现有的HMM系统来提供分段边界。本文首先通过开发基于段的最大熵电话分类器,然后利用分段条件随机字段中的特征来使用这些模型,首先使用这些模型。为了生成独立于段长度的特征表示,我们利用了基于声学输入的矢量量化表示的一组ngram特征。我们发现该模型能够以不同的粒度和不同的流集成信息。来自段边界周围的上下文信息尤为重要。我们获得了Timit Phone Classification的竞争结果,并提出了初始识别结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号