【24h】

Classification and recognition with direct segment models

机译:使用直接细分模型进行分类和识别

获取原文

摘要

Segment based direct models have recently been used to improve the output of existing state-of-the-art speech recognizers. To date, however, they have relied on an existing HMM system to provide segment boundaries. This paper takes initial steps at using these models on their own, first by developing a segment-based maximum entropy phone classifier, and then by utilizing the features in a segmental conditional random field for recognition. To produce a feature representation that is independent of segment length, we utilize a set of ngram features based on vector-quantized representations of the acoustic input. We find that the models are able to integrate information at different granularities and from different streams. Contextual information from around the segment boundaries is particularly important. We obtain competitive results for TIMIT phone classification, and present initial recognition results.
机译:最近,基于片段的直接模型已用于改善现有的最新语音识别器的输出。但是,迄今为止,他们已经依靠现有的HMM系统来提供路段边界。本文首先采取了自己使用这些模型的步骤,首先是开发基于分段的最大熵电话分类器,然后利用分段条件随机字段中的特征进行识别。为了产生与片段长度无关的特征表示,我们基于声音输入的矢量量化表示利用了一组ngram特征。我们发现这些模型能够以不同的粒度和不同的流集成信息。来自段边界周围的上下文信息尤其重要。我们获得TIMIT手机分类的竞争结果,并提供初步的识别结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号