首页> 外文会议>Grace Hopper Celebration India Conference >Music genre classification using multi-modal deep learning based fusion
【24h】

Music genre classification using multi-modal deep learning based fusion

机译:音乐流派分类使用多模态深度基于融合的分类

获取原文

摘要

Music genre classification is extensively used in almost all music streaming applications and websites. Most of them use it either to recommend playlists to their customers (such as Spotify, Soundcloud) or simply as a product (e.g. Shazam and MusixMatch). In this paper, we present a novel approach to classify a given song by encoding both textual and music features. The contribution of this work is twofold, i) We propose a multi modal fusion network approach which enables music genre classification utilizing both the textual features (lyrics) and musical features (mel spectrogram) achieving an accuracy of 90.4%. ii) We also propose a multiframe convolutional recurrent neural network (CRNN) based classifier that uses K-nearest neighbor approach over the predictions of every frame to predict the genre of a given song. In multi-modal fusion approach, we utilize co-attention between the textual and musical features for training classification network. The advantage of CRNN based multi frame approach is that it not only enriches the classification process but also enables to generate more training data from a smaller number of music files and thus helps in data augmentation. Our models and code are available on https://github.com/laishawadhwa/Multi-modal-music-genre-classification.
机译:几乎所有音乐流应用和网站都广泛使用音乐类型分类。他们中的大多数人都使用它来推荐给客户的播放列表(例如Spotify,SoundCloud)或简单地作为产品(例如Shazam和MusixMatch)。在本文中,我们提出了一种通过编码文本和音乐功能来分类给定歌的新方法。这项工作的贡献是双重的,i)我们提出了一种多模态融合网络方法,它能够利用文本特征(歌词)和音乐特征(MEL谱图)实现90.4%的精确度。 ii)我们还提出了一种基于多帧卷积经常性神经网络(CRNN)的分类器,该分类器使用K-最近邻近对每个帧的预测来预测给定歌曲的类型。在多模态融合方法中,我们利用文本和音乐特征之间的共同关注进行培训分类网络。基于CRNN的多帧方法的优点是它不仅丰富了分类过程,还可以从较少数量的音乐文件生成更多培训数据,从而有助于数据增强。我们的模型和代码可在https://github.com/laishawadhwa/multi-modal-genre-classification上获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号