首页> 外文会议>International Conference on Neural Information Processing >Speaker Segmentation Based on Subsegmental Features and Neural Network Models
【24h】

Speaker Segmentation Based on Subsegmental Features and Neural Network Models

机译:基于子段特征和神经网络模型的扬声器分割

获取原文

摘要

In this paper, we propose an alternate approach for detecting speaker changes in a multispeaker speech signal. Current approaches for speaker segmentation employ features based on characteristics of the vocal tract system and they rely on the dissimilarity between the distributions of two sets of feature vectors. This statistical approach to a point phenomenon (speaker change) fails when the given conversation involves short speaker turns (< 5 s duration). The excitation source signal plays an important role in characterizing a speakers voice. We use autoassociative neural network (AANN) models to capture the characteristics of the excitation source that are present in the linear prediction (LP) residual of speech signal. The AANN models are then used to detect the speaker changes. Results show that excitation source features provide better evidence for speaker segmentation as compared to vocal tract features.
机译:在本文中,我们提出了一种替代方法,用于检测多方位箱语音信号中的扬声器变化的方法。 当前扬声器分割方法采用基于声道系统特征的特征,并且它们依赖于两组特征向量的分布之间的不相似性。 当给定的对话涉及短扬声器时(<5次持续时间)时,这种点现象(扬声器变化)的统计方法失败。 激励源信号在表征扬声器语音中起着重要作用。 我们使用自动关联神经网络(AANN)模型来捕获在语音信号的线性预测(LP)残余中存在的激励源的特性。 然后使用AANN模型来检测扬声器的变化。 结果表明,与声带特征相比,激励源特征为扬声器分割提供了更好的证据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号