首页> 中国专利> 一种语音识别系统中基于多个卷积神经网络的语音增强算法

一种语音识别系统中基于多个卷积神经网络的语音增强算法

页面导航

摘要
著录项
法律信息
相似文献

摘要

本发明涉及一种基于CNN的噪声识别以及一种结合CNN与平稳快速递归最小二乘法（SFTRLS）的语音增强模型‑‑SFTRLS‑CNN。首先提取带噪音频段中噪声的MFCC等648维特征，进入训练好的第一种卷积神经网络中来识别出噪声的环境类型。然后将提取的音频特征、信噪比和噪声类型值组成658维度特征，采用第二种卷积神经网络来自适应地匹配到SFTRLS算法进行语音增强的最佳遗忘因子。最后通过平稳快速递归最小二乘法实现在各个环境下的降噪处理。该算法让增强模型适用于不同的噪声环境，提高了自适应能力。相比传统的SFTRLS，语音质量评价指标值也更佳。

著录项

公开/公告号CN108172238A

专利类型发明专利
公开/公告日2018-06-15

原文格式PDF
申请/专利权人广州音书科技有限公司;
展开▼

申请/专利号CN201810012748.1
发明设计人陈国强;石城川;彭驷庆;
展开▼

申请日2018-01-06
分类号G10L21/0264(20130101);G10L25/30(20130101);
代理机构
代理人
地址 510006 广东省广州市番禺区小谷围街广州大学城华南理工大学图书馆房屋首层106房之26
入库时间 2023-06-19 05:42:43

法律信息

法律状态公告日

法律状态信息

法律状态
2018-08-10

实质审查的生效 IPC(主分类):G10L21/0264 申请日:20180106

实质审查的生效
2018-06-15

公开

公开

相似文献

专利
中文文献
外文文献

1. 一种语音识别系统中基于多个卷积神经网络的语音增强算法 [P] . 中国专利： CN108172238B . 2021.08.13
2. 一种语音识别系统中基于多个卷积神经网络的语音增强算法 [P] . 中国专利： CN108172238A . 2018-06-15
3. An Electronic Device for Playing a Reel-Based Game with Mini-Reels The present invention is a device embodying a reel-based game having a plurality of reels and a plurality of mini-reels. The mini-reels replace one or more of the plurality of reels or may replace one or more symbol-bearing positions of the reels. The added feature of the plurality of mini-reels enables the possible attainment of a greater number of symbol combinations and winning outcomes by replacing standard paylines associated with the reels or reel position with sets of paylines that cover all mini-reel based outcomes. [P] . AU2014203255A1 . 2015-01-22

机译：用小型转轮玩基于转轮的游戏的电子设备本发明是一种体现具有多个转轮和多个迷你转轮的基于转轮的游戏的设备。迷你卷筒替代多个卷筒中的一个或多个，或者可以替换卷筒的一个或多个符号承载位置。多个迷你转盘的附加功能可以通过用覆盖所有基于迷你转盘的结果的支付线组替换与转盘或转盘位置相关联的标准支付线来实现更大数量的符号组合和获胜结果。
4. System and method of video Telecommunication to compress and decompress The Video Data of color digitalThe present Invention relates to a method for compressing a digital color Video Data in a Telecommunication System Video that has a means for generating a video signal that is uN means for generating a video signal to a Plurality of color video framerate,With Every Frame Image consisting of a Plurality of Scanning Lines composed of a Plurality of pixelsAnd each pixel in the image Frame consists of the components of color digitalThe Method comprises the steps of determining a function); Luminance pixel based on at least one of the three components of color digital(b) identify at least one parameter decision for at least a significant portion of pixels in the scanlines of a Table of current image based on the difference ofThe role in Luminance between the pixels at a Predetermined distance from at least one pixel in each Scan line and at least a (c) comparison of decision parameter with [P] . MX166516B . 1993-01-11

机译：用于压缩和解压缩彩色数字视频数据的视频电信系统和方法技术领域本发明涉及一种用于压缩电信系统视频中数字彩色视频数据的方法，该方法具有用于生成视频信号的装置，该装置是用于生成视频信号的装置。将视频信号转换为多个彩色视频帧速率，每个帧图像由多个扫描线组成，扫描线由多个像素组成，图像中的每个像素由彩色数字分量组成（该方法包括确定功能的步骤）;基于彩色数字（b）的三个分量中的至少一个的亮度像素，基于两个像素之间的亮度差异，针对当前图像表的扫描线中的至少大部分像素，确定至少一个参数决策。与每条扫描线中至少一个像素相距预定距离的像素，以及至少（c）比较决策参数与
5. A method and apparatus for providing an integrated feature map using a plurality of output ensembles from convolution neural networksNtexored feature map using enamble of multi outcomes from consensus neural network [P] . JP6863619B2 . 2021-04-21

机译：一种方法和装置，用于使用来自卷积神经网络中的多个输出集成的集成功能映射使用来自互连神经网络的多结果的典型来看