首页> 外文会议>6th International Conference on Spoken Language Processing ICSLP 2000 Oct.16-Oct.20 2000 Beijing International Convention Center, Beijing, China >A Combined Adaptive And Decision Tree based Speech Spearation Technique for Telemedicine applications
【24h】

A Combined Adaptive And Decision Tree based Speech Spearation Technique for Telemedicine applications

机译:基于自适应和决策树相结合的语音分离技术在远程医疗中的应用

获取原文

摘要

We present a novel technique for separation of doctor and patient's speech i nconversations over a telemedicine network. The mixed speech signals acquired at doctor's site is first broken into single talkers' speech segments and background by uisng thresholds of energy and duration. The speech segments are then identified as spoken by doctor or patient in two steps. In the first step, Gaussian mixture models (GMM) of doctor and patient are used, where the docotor's model is obtained fro mhis/her training speech, and the patient's model is initialized by a general speaker model and hten adapted by the patient's speech. In the second step, a decision tree that uses contextual and confidence features is applied to refine the identification results. Preliminary experiemnts were performed on three data sets collected in telemedicine. Without adaptation and decision tree, error rates at the segment-level and frame-level were 25.44
机译:我们提出了一种通过远程医疗网络将医生和患者的语音对话分离的新技术。首先,通过设置能量和持续时间的阈值,将在医生现场获得的混合语音信号分解为单个讲话者的语音片段和背景。然后,通过两个步骤将语音片段识别为医生或患者所说的话。第一步,使用医生和患者的高斯混合模型(GMM),在该模型中,他/她的训练语音获得了医生的模型,患者的模型由一般说话者模型初始化,并根据患者的语音进行调整。在第二步中,将使用上下文和置信度特征的决策树应用于优化识别结果。对远程医疗中收集的三个数据集进行了初步实验。如果没有适应和决策树,则段级别和帧级别的错误率均为25.44

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号