首页> 外文会议>International conference on advances in computing, communications and informatics >Multi-band sum of spectrogram based audio fingerprinting of Indian film songs for multi-lingual song retrieval
【24h】

Multi-band sum of spectrogram based audio fingerprinting of Indian film songs for multi-lingual song retrieval

机译:基于频谱图的多频带总和的印度电影歌曲音频指纹识别,用于多语言歌曲检索

获取原文

摘要

Film music compositions are highly diversified, exhibiting not just changes in background scores and singer's voices, but even the lyrical embellishments are morphed into different languages to suit regional audiences. Given this diversified prevalence amongst recorded film music, retrieval becomes extremely challenging. In this paper we propose an approach based on a multi-band sum of spectrogram, executing a delicate tradeoff between sensitivity to pitch jitters incurred by lyrical and singer voice changes while keeping the melodic signature intact. The top-3 retrieval accuracy for the multi-band sum of spectrogram has been found to be around 91% for an STFT window size of 128ms.
机译:电影音乐作品高度多样化,不仅表现出背景乐谱和歌手声音的变化,而且甚至抒情的装饰物也变身为不同的语言,以适应当地观众的需求。鉴于已录制电影音乐中的这种普遍流行,检索变得极具挑战性。在本文中,我们提出了一种基于频谱图多频带总和的方法,在保持对乐曲和歌手声音变化的音高抖动敏感度之间进行微妙的权衡,同时保持旋律签名完整。对于128ms的STFT窗口大小,已发现多波段频谱图总和的前3位检索精度约为91%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号