首页> 外国专利> USER SPECIFIED KEYWORD SPOTTING USING LONG SHORT TERM MEMORY NEURAL NETWORK FEATURE EXTRACTOR

USER SPECIFIED KEYWORD SPOTTING USING LONG SHORT TERM MEMORY NEURAL NETWORK FEATURE EXTRACTOR

机译:用户使用长短期记忆神经网络特征提取器指定关键字

摘要

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for recognizing keywords using a long short term memory neural network. One of the methods includes receiving, by a device for each of multiple variable length enrollment audio signals, a respective plurality of enrollment feature vectors that represent features of the respective variable length enrollment audio signal, processing each of the plurality of enrollment feature vectors using a long short term memory (LSTM) neural network to generate a respective enrollment LSTM output vector for each enrollment feature vector, and generating, for the respective variable length enrollment audio signal, a template fixed length representation for use in determining whether another audio signal encodes another spoken utterance of the enrollment phrase by combining at most a quantity k of the enrollment LSTM output vectors for the enrollment audio signal.
机译:方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于使用长期短期记忆神经网络识别关键字。该方法之一包括:由设备针对多个可变长度登记音频信号中的每一个接收代表各个可变长度登记音频信号的特征的相应的多个登记特征向量,使用多个信号处理每个登记特征向量。长短期记忆(LSTM)神经网络为每个注册特征向量生成相应的注册LSTM输出向量,并为相应的可变长度注册音频信号生成模板固定长度表示形式,用于确定另一个音频信号是否编码另一个音频信号通过组合最多为注册音频信号的注册LSTM输出向量的数量k,获得注册短语的口语。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号