【24h】

Generating Music Medleys via Playing Music Puzzle Games

机译:通过播放音乐益智游戏生成音乐Medleys

获取原文

摘要

Generating music medleys is about finding an optimal permutation of a given set of music clips. Toward this goal, we propose a self-supervised learning task, called the music puzzle game, to train neural network models to learn the sequential patterns in music. In essence, such a game requires machines to correctly sort a few multisecond music fragments. In the training stage, we learn the model by sampling multiple non-overlapping fragment pairs from the same songs and seeking to predict whether a given pair is consecutive and is in the correct chronological order. For testing, we design a number of puzzle games with different difficulty levels, the most difficult one being music medley, which requiring sorting fragments from different songs. On the basis of state-of-the-art Siamese convolutional network, we propose an improved architecture that learns to embed frame-level similarity scores computed from the input fragment pairs to a common space, where fragment pairs in the correct order can be more easily identified. Our result shows that the resulting model, dubbed as the similarity embedding network (SEN), performs better than competing models across different games, including music jigsaw puzzle, music sequencing, and music medley. Example results can be found at our project website, https://remyhuang.github.io/DJnet.
机译:生成音乐Medleys是关于找到给定的一组音乐剪辑的最佳排列。为了实现这一目标,我们提出了一种称为音乐益智游戏的自我监督的学习任务,培训神经网络模型,以学习音乐中的连续模式。从本质上讲,这种游戏需要机器正确地排序一些多重音乐碎片。在培训阶段,我们通过从相同的歌曲中采样多个非重叠片段对并寻求预测给定对是连续的,以正确的按时间顺序进行预测。对于测试,我们设计了许多具有不同难度级别的益智游戏,最困难的是音乐混合的乐趣,这需要从不同歌曲中排序片段。在最先进的暹罗卷积网络的基础上,我们提出了一种改进的架构,该架构学习将从输入片段对计算的帧级相似度分数嵌入到公共空间,其中换流顺序的片段对可以更多容易识别。我们的结果表明,由此产生的模型称为相似性嵌入网络(SEN),比在不同游戏中的竞争模型更好,包括音乐拼图,音乐测序和音乐混合。可以在我们的项目网站,https://remyhuang.github.io/djnet中找到示例结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号