首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >Multi-Pitch Estimation by a Joint 2-D Representation of Pitch and Pitch Dynamics
【24h】

Multi-Pitch Estimation by a Joint 2-D Representation of Pitch and Pitch Dynamics

机译:节距和节距动力学的联合二维表示的多节距估计

获取原文

摘要

Multi-pitch estimation of co-channel speech is especially challenging when the underlying pitch tracks are close in pitch value (e.g., when pitch tracks cross). Building on our previous work in [1], we demonstrate the utility of a two-dimensional (2-D) analysis method of speech for this problem by exploiting its joint representation of pitch and pitch-derivative information from distinct speakers. Specifically, we propose a novel multi-pitch estimation method consisting of 1) a data-driven classifier for pitch candidate selection, 2) local pitch and pitch-derivative estimation by k-means clustering, and 3) a Kalman filtering mechanism for pitch tracking and assignment. We evaluate our method on a database of all-voiced speech mixtures and illustrate its capability to estimate pitch tracks in cases where pitch tracks are separate and when they are close in pitch value (e.g., at crossings).
机译:当基础音调轨道的音调值接近时(例如,当音调轨道交叉时),同频道语音的多音调估计尤其具有挑战性。基于我们在[1]中的先前工作,我们通过利用其对不同说话者的音高和音高导数信息的联合表示,证明了针对该问题的二维(2-D)语音分析方法的实用性。具体而言,我们提出了一种新颖的多音高估计方法,该方法包括:1)用于音高候选选择的数据驱动分类器; 2)通过k-means聚类进行局部音高和音高微分估计; 3)用于音高跟踪的卡尔曼滤波机制和分配。我们在全语音混合语音数据库中评估了我们的方法,并说明了在音高音轨分开且音高值接近时(例如在交叉路口)的情况下估计音高音轨的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号