首页> 外文会议>IEEE International Symposium on Multimedia >Kara1k: A Karaoke Dataset for Cover Song Identification and Singing Voice Analysis
【24h】

Kara1k: A Karaoke Dataset for Cover Song Identification and Singing Voice Analysis

机译:Kara1k:用于翻唱歌曲识别和歌唱语音分析的卡拉OK数据集

获取原文

摘要

We introduce Kara1k, a new musical dataset composed of 2,000 analyzed songs thanks to a partnership with a karaoke company. The dataset is divided into 1,000 cover songs provided by Recisio Karafun application1, and the corresponding 1,000 songs by the original artists. Kara1k is mainly dedicated toward cover song identification and singing voice analysis. For both tasks, it offers novel approaches, as each cover song is a studio-recorded song with the same arrangement as the original recording, but with different singers and musicians. Essentia, harmony-analyser, Marsyas, Vamp plugins and YAAFE have been used to extract audio features for each track in Kara1k. We provide metadata such as the title, genre, original artist, year, International Standard Recording Code and the ground truths for the singer's gender, backing vocals, duets and lyrics' language. Additionally, we provide the instrumental track and the pure singing voice track for each cover song. We showcase two use-case experiments for Kara1k. In the cover song identification task using the Dynamic Time Warping method, we provide a comparison of traditional and new features: chroma and MFCC features, chords and keys, and chroma and chord distances. We obtain 84-89% identification accuracy for three of the features, which justifies our focus on karaoke songs. In the supporting experiment on singer gender classification, we evaluate the difference in the performance in two conditions - a pure singing voice and the singing voice mixed with the background music. The Kara1k dataset is freely available under the KaraMIR project website2.
机译:我们引入了Kara1k,这是一个新的音乐数据集,该数据集归功于与一家卡拉OK公司的合作,其中包含2,000首分析过的歌曲。数据集分为Recisio Karafun应用程序1提供的1,000首翻唱歌曲和原始艺术家相应的1,000首歌曲。 Kara1k主要致力于翻唱歌曲识别和歌声分析。对于这两个任务,它都提供了新颖的方法,因为每首翻唱歌曲都是录音室录制的歌曲,与原始录音的排列方式相同,但是歌手和音乐家的身份不同。 Essentia,和声分析器,Marsyas,Vamp插件和YAAFE已用于提取Kara1k中每个音轨的音频功能。我们提供元数据,例如标题,流派,原歌手,年份,国际标准录音代码以及歌手性别,背景音乐,二重唱和歌词语言的基本事实。此外,我们还为每首翻唱歌曲提供器乐音轨和纯唱歌声音轨。我们展示了Kara1k的两个用例实验。在使用动态时间规整方法的翻唱歌曲识别任务中,我们提供了传统功能和新功能的比较:色度和MFCC功能,和弦和键以及色度和和弦距离。我们通过三个功能获得了84-89%的识别准确度,这证明了我们对卡拉OK歌曲的关注是正确的。在歌手性别分类的支持实验中,我们评估了两种条件下的演奏差异:纯唱歌声和混合背景音乐的歌声。 KaraMIR项目网站2上免费提供Kara1k数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号