首页> 外文会议>Adaptive Multimedia Retrieval: Retrieval, User, and Semantics >A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models
【24h】

A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models

机译:使用特定于类型的隐马尔可夫模型从音频自动和弦转录的系统

获取原文
获取原文并翻译 | 示例

摘要

We describe a system for automatic chord transcription from the raw audio using genre-specific hidden Markov models trained on audio-from-symbolic data. In order to avoid enormous amount of human labor required to manually annotate the chord labels for ground-truth, we use symbolic data such as MIDI files to automate the labeling process. In parallel, we synthesize the same symbolic files to provide the models with the sufficient amount of observation feature vectors along with the automatically generated annotations for training. In doing so, we build different models for various musical genres, whose model parameters reveal characteristics specific to their corresponding genre. The experimental results show that the HMMs trained on synthesized data perform very well on real acoustic recordings. It is also shown that when the correct genre is chosen, simpler, genre-specific model yields performance better than or comparable to that of more complex model that is genre-independent. Furthermore, we also demonstrate the potential application of the proposed model to the genre classification task.
机译:我们描述了一种系统,用于使用从符号数据中提取的音频训练的特定于类型的隐藏马尔可夫模型,从原始音频中自动提取和弦。为了避免为地面真实的和弦标签手动添加注释所需的大量人工,我们使用诸如MIDI文件之类的符号数据来自动执行标签处理。同时,我们合成相同的符号文件,以为模型提供足够数量的观察特征向量以及自动生成的用于训练的注释。通过这样做,我们为各种音乐流派建立了不同的模型,其模型参数揭示了其相应流派的特定特征。实验结果表明,在合成数据上训练的HMM在真实的声音记录中表现很好。还显示出,当选择正确的体裁时,与体裁无关的更复杂模型相比,更简单的体裁特定模型所产生的效果更好或更可比。此外,我们还演示了该模型在体裁分类任务中的潜在应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号