What’s all the Fuss about Free Universal Sound Separation Data?

机译：关于免费通用声音分离数据的所有大惊小怪是什么？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce the Free Universal Sound Separation (FUSS) dataset, a new corpus for experiments in separating mixtures of an unknown number of sounds from an open domain of sound types. The dataset consists of 23 hours of single-source audio data drawn from 357 classes, which are used to create mixtures of one to four sources. To simulate reverberation, an acoustic room simulator is used to generate impulse responses of box-shaped rooms with frequency-dependent reflective walls. Additional open-source data augmentation tools are also provided to produce new mixtures with different combinations of sources and room simulations. Finally, we introduce an open-source baseline separation model, based on an improved time-domain convolutional network (TDCN++), that can separate a variable number of sources in a mixture. This model achieves 9.8 dB of scale-invariant signal-to-noise ratio improvement (SI-SNRi) on mixtures with two to four sources, while reconstructing single-source inputs with 35.8 dB absolute SI-SNR. We hope this dataset will lower the barrier to new research and allow for fast iteration and application of novel techniques from other machine learning domains to the sound separation challenge.

机译：我们介绍了自由的通用声音分离（FUSS）DataSet，一种新的语料库，用于从声音类型的开放域分离未知数量的声音的混合物。 DataSet由33小时的单源音频数据组成，从357类绘制，用于创建一个到四个源的混音。为了模拟混响，声学室模拟器用于产生带频率依赖性反射壁的盒形房间的脉冲响应。还提供了附加的开源数据增强工具，以产生具有不同组合的新混合物和房间模拟。最后，我们引入了一个基于改进的时域卷积网络（TDCN ++）的开源基线分离模型，其可以将变量数量分离在混合物中。该模型在具有两到四个源的混合物上实现了9.8 dB的比例不变信噪比改进（SI-SNRI），同时重建35.8 dB绝对SI-SNR的单源输入。我们希望这个数据集将降低新研究的障碍，并允许从其他机器学习域中快速迭代和应用新颖的技术到声音分离挑战。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2021年|186-190|共5页
会议地点
作者
Scott Wisdom; Hakan Erdogan; Daniel P. W. Ellis; Romain Serizel; Nicolas Turpault; Eduardo Fonseca; Justin Salamon; Prem Seetharaman; John R. Hershey;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Shape; Machine learning; Tools; Data models; Reverberation; Time-domain analysis; Task analysis;

机译：形状;机器学习;工具;数据模型;混响;时间域分析;任务分析;

相似文献

外文文献
中文文献
专利

1. Land Surface Temperature and Emissivity Separation from Cross-Track Infrared Sounder Data with Atmospheric Reanalysis Data and ISSTES Algorithm [J] . Yu-Ze Zhang, Xiao-Guang Jiang, HuaWu, Advances in Meteorology . 2017,第Pta3期

机译：用大气再分析数据和ISSTES算法从跨轨道红外发声器数据和isstes算法的陆地表面温度和发射率分离
2. Universal experimental test for the role of free charge carriers in the thermal Casimir effect within a micrometer separation range [J] . G. Bimonte, G. L. Klimchitskaya, V. M. Mostepanenko Physical Review, A . 2017,第5aPta1期

机译：自由电荷载体在千分尺分离范围内的热Casimir效应中的作用的通用实验试验
3. A new and universal free/bound separation technique for the "CENTRIA" automated radioimmunoassay system. [J] . B Mériadec, J P Jolu, R Henry Clinical Chemistry: Journal of the American Association for Clinical Chemists . 1979,第9期

机译：一种用于“ CENTRIA”自动放射免疫分析系统的新型通用自由/结合分离技术。
4. Improving Universal Sound Separation Using Sound Classification [C] . Efthymios Tzinis, Scott Wisdom, John R. Hershey, IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：使用声音分类改善通用声音分离
5. Interaction of sound with sound by novel mechanisms: Ultrasonic four-wave mixing mediated by a suspension and ultrasonic three-wave mixing at a free surface. [D] . Simpson, Harry Jay. 1992

机译：声音和声音通过新机制相互作用：通过悬浮作用介导的超声四波混合和在自由表面进行超声三波混合。
6. Universal Linear Fit Identification: A Method Independent of Data Outliers and Noise Distribution Model and Free of Missing or Removed Data Imputation [O] . K. K. L. B. Adikaram, M. A. Hussein, M. Effenberger, -1

机译：通用线性拟合识别：一种独立于数据离群值和噪声分布模型且无缺失或缺失数据插补的方法
7. Improving Universal Sound Separation Using Sound Classification [O] . Efthymios Tzinis, Scott Wisdom, John R. Hershey, 2020

机译：使用声音分类改善通用声音分离

What’s all the Fuss about Free Universal Sound Separation Data?

摘要

著录项

相似文献

相关主题

期刊订阅