首页> 外文会议>International Conference on speech and computer >Optimized Active Learning Strategy for Audiovisual Speaker Recognition
【24h】

Optimized Active Learning Strategy for Audiovisual Speaker Recognition

机译:视听说话人识别的优化主动学习策略

获取原文

摘要

The purpose of this work is to investigate the improved recognition accuracy caused from exploiting optimization stages for tuning parameters of an Active Learning (AL) classifier. Since plenty of data could be available during Speaker Recognition (SR) tasks, the AL concept, which incorporates human entities inside its learning kernel for exploring hidden insights into unlabeled data, seems extremely suitable, without demanding much expertise on behalf of the human factor. Six datasets containing 8 and 16 speakers' utterances under different recording setups, are described by audiovisual features and evaluated through the time-efficient Uncertainty Sampling query strategy (UncS). Both Support Vector Machines (SVMs) and Random Forest (RF) algorithms were selected to be tuned over a small subset of the initial training data and then applied iteratively for mining the most suitable instances from a corresponding pool of unlabeled instances. Useful conclusions are drawn concerning the values of the selected parameters, allowing future optimization attempts to get employed into more restricted regions, while remarkable improvements rates were obtained using an ideal annotator.
机译:这项工作的目的是研究由于利用优化阶段来调整主动学习(AL)分类器的参数而导致的提高的识别准确性。由于在说话人识别(SR)任务期间可以获取大量数据,因此AL概念将人类实体整合到其学习内核中,以探索对未标记数据的隐藏见解,这似乎非常适合,不需要代表人为因素的大量专业知识。视听功能描述了六个数据集,这些数据集包含不同录制设置下的8和16个说话者的话语,并通过省时的不确定性采样查询策略(UncS)进行了评估。支持向量机(SVM)和随机森林(RF)算法均被选择在初始训练数据的一小部分上进行调整,然后迭代应用以从相应的未标记实例池中挖掘最合适的实例。关于所选参数的值,得出了有用的结论,允许将来的优化尝试被应用到更多的受限区域中,同时使用理想的注释器可以获得显着的改善率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号