首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Vocal timbre analysis using latent Dirichlet allocation and cross-gender vocal timbre similarity
【24h】

Vocal timbre analysis using latent Dirichlet allocation and cross-gender vocal timbre similarity

机译:使用潜在Dirichlet分配和跨性别的人声音色相似度进行人声音色分析

获取原文

摘要

This paper presents a vocal timbre analysis method based on topic modeling using latent Dirichlet allocation (LDA). Although many works have focused on analyzing characteristics of singing voices, none have dealt with “latent” characteristics (topics) of vocal timbre, which are shared by multiple singing voices. In the work described in this paper, we first automatically extracted vocal timbre features from polyphonic musical audio signals including vocal sounds. The extracted features were used as observed data, and mixing weights of multiple topics were estimated by LDA. Finally, the semantics of each topic were visualized by using a word-cloud-based approach. Experimental results for a singer identification task using 36 songs sung by 12 singers showed that our method achieved a mean reciprocal rank of 0.86. We also proposed a method for estimating cross-gender vocal timbre similarity by generating pitch-shifted (frequency-warped) signals of every singing voice. Experimental results for a cross-gender singer retrieval task showed that our method discovered interesting similar pitch-shifted singers.
机译:本文提出了一种基于基于潜在狄利克雷分配(LDA)主题建模的人声音色分析方法。尽管许多作品都专注于分析歌声的特征,但没有一件作品涉及人声音色的“潜在”特征(主题),这些特征被多个歌声共享。在本文所述的工作中,我们首先自动从包括人声在内的复音音乐音频信号中提取人声音色特征。提取的特征用作观察数据,并通过LDA估算多个主题的混合权重。最后,使用基于词云的方法将每个主题的语义可视化。使用12位歌手演唱的36首歌曲进行歌手识别任务的实验结果表明,我们的方法获得的平均倒数排名为0.86。我们还提出了一种方法,可以通过生成每个歌声的音高偏移(频率扭曲)信号来估计跨性别的人声音色相似度。跨性别歌手检索任务的实验结果表明,我们的方法发现了有趣的类似音高变化的歌手。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号