A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models

机译：使用特定于类型的隐马尔可夫模型从音频自动和弦转录的系统

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe a system for automatic chord transcription from the raw audio using genre-specific hidden Markov models trained on audio-from-symbolic data. In order to avoid enormous amount of human labor required to manually annotate the chord labels for ground-truth, we use symbolic data such as MIDI files to automate the labeling process. In parallel, we synthesize the same symbolic files to provide the models with the sufficient amount of observation feature vectors along with the automatically generated annotations for training. In doing so, we build different models for various musical genres, whose model parameters reveal characteristics specific to their corresponding genre. The experimental results show that the HMMs trained on synthesized data perform very well on real acoustic recordings. It is also shown that when the correct genre is chosen, simpler, genre-specific model yields performance better than or comparable to that of more complex model that is genre-independent. Furthermore, we also demonstrate the potential application of the proposed model to the genre classification task.

机译：我们描述了一种系统，用于使用从符号数据中提取的音频训练的特定于类型的隐藏马尔可夫模型，从原始音频中自动提取和弦。为了避免为地面真实的和弦标签手动添加注释所需的大量人工，我们使用诸如MIDI文件之类的符号数据来自动执行标签处理。同时，我们合成相同的符号文件，以为模型提供足够数量的观察特征向量以及自动生成的用于训练的注释。通过这样做，我们为各种音乐流派建立了不同的模型，其模型参数揭示了其相应流派的特定特征。实验结果表明，在合成数据上训练的HMM在真实的声音记录中表现很好。还显示出，当选择正确的体裁时，与体裁无关的更复杂模型相比，更简单的体裁特定模型所产生的效果更好或更可比。此外，我们还演示了该模型在体裁分类任务中的潜在应用。

著录项

来源
《Adaptive Multimedia Retrieval: Retrieval, User, and Semantics》|2008年|P.134-146|共13页
会议地点 Paris(FR);Paris(FR)
作者
Kyogu Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类多媒体技术与多媒体计算机;
关键词

相似文献

外文文献
中文文献
专利

1. Automatic Transcription of Guitar Chords and Fingering From Audio [J] . Barbancho A.M., Klapuri A., Tardon L.J., Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第3期

机译：吉他和弦的自动转录和音频中的指法
2. Hidden Markov Model Inversion for Audio-to-Visual Conversion in an MPEG-4 Facial Animation System [J] . KYOUNGHO CHOI, YING LUO, JENQ-NENG HWANG Journal of VLSI signal processing . 2001,第1a2期

机译：MPEG-4面部动画系统中用于视听转换的隐马尔可夫模型反演
3. A supervised hidden markov model framework for efficiently segmenting tiling array data in transcriptional and chIP-chip experiments: systematically incorporating validated biological knowledge [J] . Du J, Rozowsky JS, Korbel JO, Bioinformatics . 2006,第24期

机译：一个有监督的隐马尔可夫模型框架，可有效地分割转录和芯片实验中的切片阵列数据：系统地整合经过验证的生物学知识
4. A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models [C] . Kyogu Lee International Workshop on Adaptive Multimedial Retrieval . 2008

机译：使用类型特定的隐马尔可夫模型从音频自动弦转录系统
5. A system for acoustic chord transcription and key extraction from audio using hidden Markov models trained on synthesized audio. [D] . Lee, Kyogu. 2008

机译：一种使用在合成音频上训练的隐马尔可夫模型从音频进行和弦转录和音调提取的系统。
6. Image segmentation for automatic particle identification in electron micrographs based on hidden Markov random field models and expectation maximization [O] . Vivek Singh, Dan C. Marinescu, Timothy S. Baker -1

机译：基于隐马尔可夫随机场模型和期望最大化的电子显微图像中颗粒自动识别的图像分割
7. A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models [O] . Kyogu Lee 2008

机译：一种基于体裁特定隐马尔可夫模型的音频自动和弦转录系统

A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models

摘要

著录项

相似文献

相关主题

期刊订阅