首页> 外国专利> METHODS AND SYSTEMS FOR COCKPIT SPEECH RECOGNITION ACOUSTIC MODEL TRAINING WITH MULTI-LEVEL CORPUS DATA AUGMENTATION

METHODS AND SYSTEMS FOR COCKPIT SPEECH RECOGNITION ACOUSTIC MODEL TRAINING WITH MULTI-LEVEL CORPUS DATA AUGMENTATION

机译:多级语料库数据增强的驾驶舱语音识别声学模型训练的方法和系统

摘要

A method for initializing a device for performing acoustic speech recognition (ASR) using an ASR model, by a computer system including at least one processor and a system memory element. The method includes obtaining a plurality of voice data articulations of predetermined phrases, by the at least one processor via a user interface. The plurality of voice data articulations includes a first quantity of audio samples of actual articulated voice data, and each of the plurality of voice data articulations includes one of the audio samples including acoustic frequency components. The method further includes performing a plurality of augmentations to the plurality of voice data articulations of predetermined phrases, to generate a corpus audio data set that includes the first quantity of audio samples and a second quantity of audio samples including augmented versions of the first quantity of audio samples.
机译:一种用于通过包括至少一个处理器和系统存储元件的计算机系统来初始化用于使用ASR模型执行声学语音识别(ASR)的设备的方法。该方法包括由至少一个处理器经由用户界面获得预定短语的多个语音数据清晰度。多个语音数据清晰度包括实际铰接的语音数据的第一数量的音频样本,并且多个语音数据清晰度中的每个包括包括声频分量的音频样本之一。该方法进一步包括对预定短语的多个语音数据清晰度执行多个增强,以生成包括第一数量的音频样本和第二数量的音频样本的语料库音频数据集,第二数量的音频样本包括第一数量的音频样本的增强版本。音频样本。

著录项

  • 公开/公告号US2020335084A1

    专利类型

  • 公开/公告日2020-10-22

    原文格式PDF

  • 申请/专利权人 HONEYWELL INTERNATIONAL INC.;

    申请/专利号US201916388647

  • 发明设计人 LUNING WANG;WEI YANG;ZHIYONG DAI;

    申请日2019-04-18

  • 分类号G10L15/06;G10L21/0364;B64D43;

  • 国家 US

  • 入库时间 2022-08-21 11:24:28

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号