首页> 外国专利> Dilated convolutions and gating for efficient keyword spotting

Dilated convolutions and gating for efficient keyword spotting

机译:扩张卷积和门控有效关键词斑点

摘要

A method for detection of a keyword in a continuous stream of audio signal, by using a dilated convolutional neural network (DCNN), implemented by one or more computers embedded on a device, the dilated convolutional network (DCNN) comprising a plurality of dilation layers (DL), including an input layer (IL) and an output layer (OL), each layer of the plurality of dilation layers (DL) comprising gated activation units, and skip-connections to the output layer (OL), the dilated convolutional network (DCNN) being configured to generate an output detection signal when a predetermined keyword is present in the continuous stream of audio signal, the generation of the output detection signal being based on a sequence (SSM) of successive measurements (SM) provided to the input layer (IL), each successive measurement (SM) of the sequence (SSM) being measured on a corresponding frame from a sequence of successive frames extracted from the continuous stream of audio signal, at a plurality of successive time steps.
机译:通过使用扩张的卷积神经网络(DCNN),通过嵌入在设备上的一个或多个计算机实现的扩张卷积神经网络(DCNN)来检测连续的音频信号中的关键字的方法,扩张的卷积网络(DCNN)包括多个扩张层(DL),包括输入层(IL)和输出层(OL),多个扩张层(DL)的每层包括门控激活单元,并跳过与输出层(OL)的连接,膨胀卷积器当在连续的音频信号中存在预定关键字时,被配置为生成输出检测信号的网络(DCNN),产生输出检测信号的产生基于提供给的连续测量(SM)的序列(SSM)。输入层(IL),序列的每个连续测量(SM)(SSM)在从从连续的音频信号中提取的连续流中提取的连续帧序列测量,在多个SUC中助攻时间步。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号