生成式对抗网络在语音增强方面的研究

孙成立; 王海武

首页> 中文期刊>计算机技术与发展 >生成式对抗网络在语音增强方面的研究

生成式对抗网络在语音增强方面的研究

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

伴随着人工智能的兴起, 各种深度学习模型应运而生, 生成式对抗网络 (generative adversarial networks, GAN) 作为其中的一种深度学习模型成为了研究热点.GAN已成功应用在图像处理中, 但将其应用在语音增强方面是需要研究的问题.GAN应用在语音增强的研究方法与GAN的实质是一样的, 是通过构造两个模型, 即生成模型 (generative model) 和判别模型 (discriminative model), 也叫做生成器 (generator) 和判别器 (discriminator) .两者通过互相竞争、对抗的形式来学习训练, GAN最终要实现的目标是生成新的数据, 即实现去噪.对GAN在语音增强方面的应用进行了研究, 提出了使用传统的GAN数学模型用于语音增强进行建模, 同时改进了GAN的数学模型并加入了稀疏因式, 将GAN增强后的语音与其他传统的语音增强方法进行对比.实验结果表明, 使用GAN增强后的语音的segSNR和PESQ的得分要比传统的语音增强方法的得分高, 从而证明GAN比其他传统的语音增强方法更具优越性.%Along with the rise of artificial intelligence, all kinds of deep learning models emerge.Generative adversarial networks (GAN) as a deep learning model has become a research hotspot.GAN has been successfully applied in image processing, but its application in speech enhancement is a problem that needs to be studied.GAN's research method in speech enhancement is the same as the essence of GAN, which is based on the construction of two models, namely, generative model and discriminative model, also known as generator and discriminator.They learn and train by mutual competition and confrontation.The ultimate goal of GAN is to generate new data, that is realization of noise removal.The application of GAN in speech enhancement is studied, and the traditional GAN mathematical modeling is proposed for speech enhancement.At the same time, the mathematical model of GAN is improved and sparse factors are added.GAN enhanced speech is compared with other traditional speech enhancement methods.Experiment shows that segSNR and PESQ score of GAN enhanced voice are higher than that of traditional speech enhancement methods, which proves that GAN is more advantageous than other traditional speech enhancement methods.

著录项

来源
《计算机技术与发展》|2019年第2期|152-156161|共6页
作者
孙成立; 王海武;
展开▼
作者单位

南昌航空大学信息工程学院, 江西南昌 330063;

南昌航空大学信息工程学院, 江西南昌 330063;

展开▼
原文格式 PDF
正文语种 chi
中图分类计算机软件;
关键词
人工智能; 生成式对抗网络; 生成器; 判别器; 语音增强;
入库时间 2023-07-24 21:47:18

相似文献

中文文献
外文文献
专利

1. 基于改进生成式对抗网络的图像去雾算法研究 [J] . 王铭 ,姜淑华 ,吴杰 . 长春理工大学学报（自然科学版） . 2021,第002期
2. 生成式对抗网络中讲话人脸合成模型的研究现状 [J] . 田裕 ,景恩彪 . 现代计算机（专业版） . 2021,第019期
3. SealGAN:基于生成式对抗网络的印章消除研究 [J] . 李新利 ,邹昌铭 ,杨国田 . 自动化学报 . 2021,第011期
4. 生成式对抗网络及其在图像生成中的研究进展 [J] . 马永杰 ,徐小冬 ,张茹 . 计算机科学与探索 . 2021,第010期
5. 基于循环神经网络和生成式对抗网络的口令猜测模型研究 [J] . 汪定 ,邹云开 ,陶义 . 计算机学报 . 2021,第008期
6. 基于生成式对抗网络的文本生成研究 [C] . 代威 ,陈博 ,熊振 . 辽宁省通信学会2019年度学术年会 . 2019
7. 基于生成式对抗网络的语音增强算法 [A] . 谭诺亚 . 2020

生成式对抗网络在语音增强方面的研究

摘要

著录项

相似文献

相关主题

期刊订阅