首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Unsupervised neural network based feature extraction using weak top-down constraints
【24h】

Unsupervised neural network based feature extraction using weak top-down constraints

机译:基于弱自顶向下约束的基于无监督神经网络的特征提取

获取原文

摘要

Deep neural networks (DNNs) have become a standard component in supervised ASR, used in both data-driven feature extraction and acoustic modelling. Supervision is typically obtained from a forced alignment that provides phone class targets, requiring transcriptions and pronunciations. We propose a novel unsupervised DNN-based feature extractor that can be trained without these resources in zero-resource settings. Using unsupervised term discovery, we find pairs of isolated word examples of the same unknown type; these provide weak top-down supervision. For each pair, dynamic programming is used to align the feature frames of the two words. Matching frames are presented as input-output pairs to a deep autoencoder (AE) neural network. Using this AE as feature extractor in a word discrimination task, we achieve 64% relative improvement over a previous state-of-the-art system, 57% improvement relative to a bottom-up trained deep AE, and come to within 23% of a supervised system.
机译:深度神经网络(DNN)已成为受监督ASR的标准组件,用于数据驱动的特征提取和声学建模。监督通常是从提供电话类目标的强制对齐中获得的,这些目标需要转录和发音。我们提出了一种新颖的无监督基于DNN的特征提取器,无需零资源设置中的这些资源即可对其进行训练。使用无监督术语发现,我们发现了成对的相同未知类型的孤立单词对。这些提供了薄弱的自上而下的监管。对于每一对,使用动态编程来对齐两个单词的特征帧。匹配帧作为输入输出对呈现给深度自动编码器(AE)神经网络。使用此AE作为单词识别任务中的特征提取器,我们比以前的先进系统实现了64%的相对改进,相对于自下而上训练的深度AE达到了57%的改进,并且在23%的范围内监督系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号