首页> 外文学位 >Leveraging the speaker and noise space for effective in-set/out-of-set speaker recognition.
【24h】

Leveraging the speaker and noise space for effective in-set/out-of-set speaker recognition.

机译:利用扬声器和噪声空间,可以有效地进行内置/外置扬声器识别。

获取原文
获取原文并翻译 | 示例

摘要

This study addresses the problem of identifying in-set versus out-of-set speakers in noise for limited train/test duration speech segments in situations where rapid detection and tracking is required. The objective is to form a decision as to whether the current input speaker is accepted as a member of the enrolled in-set group or rejected as an outside speaker. A new scoring algorithm that combines scores across an energy-frequency grid is developed where high-energy speaker dependent frames are fused with weighted scores from low-energy noise dependent frames. By leveraging the balance between the speaker versus the background noise environment, it is possible to realize an improvement in overall equal error rate performance. Using speakers from the TIMIT database with 5 seconds of train and 2 seconds of test, the average optimum relative EER performance improvement for the proposed full selective leveraging approach is 31.6%. The results confirm that for situations in which the background environment type remains constant between train and test, an in-set/out-of-set speaker recognition system that takes advantage of information gathered from the environmental noise can be formulated which realizes significant improvement.
机译:这项研究解决了在需要快速检测和跟踪的情况下,在有限的训练/测试持续时间语音段中识别噪声中的内置和外置扬声器的问题。目的是就当前输入的发言者是否被接受为已注册的嵌入式小组成员或拒绝作为外部发言者做出决定。开发了一种新的评分算法,该算法结合了整个能量频率网格上的分数,在该算法中,高能扬声器相关框架与低能噪声相关框架的加权分数融合在一起。通过利用扬声器与背景噪声环境之间的平衡,可以实现整体平均错误率性能的提高。使用TIMIT数据库中的扬声器,经过5秒钟的训练和2秒钟的测试,对于建议的完全选择性杠杆方法,平均最佳相对EER性能提高了31.6%。结果证实,对于在火车和测试之间背景环境类型保持恒定的情况,可以制定一套利用从环境噪声中收集的信息的套内/套外说话人识别系统,从而实现显着改善。

著录项

  • 作者

    Leonard, Matthew Ryan.;

  • 作者单位

    The University of Texas at Dallas.;

  • 授予单位 The University of Texas at Dallas.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 M.S.E.E.
  • 年度 2008
  • 页码 48 p.
  • 总页数 48
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号