首页> 外文期刊>IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans >A Low-Complexity Parabolic Lip Contour Model With Speaker Normalization for High-Level Feature Extraction in Noise-Robust Audiovisual Speech Recognition
【24h】

A Low-Complexity Parabolic Lip Contour Model With Speaker Normalization for High-Level Feature Extraction in Noise-Robust Audiovisual Speech Recognition

机译:具有说话人归一化功能的低复杂度抛物线形嘴唇轮廓模型,用于噪声鲁棒的视听语音识别中的高级特征提取

获取原文
           

摘要

This paper proposes a novel low-complexity lip contour model for high-level optic feature extraction in noise-robust audiovisual (AV) automatic speech recognition systems. The model is based on weighted least-squares parabolic fitting of the upper and lower lip contours, does not require the assumption of symmetry across the horizontal axis of the mouth, and is therefore realistic. The proposed model does not depend on the accurate estimation of specific facial points, as do other high-level models. Also, we present a novel low-complexity algorithm for speaker normalization of the optic information stream, which is compatible with the proposed model and does not require parameter training. The use of the proposed model with speaker normalization results in noise robustness improvement in AV isolated-word recognition relative to using the baseline high-level model.
机译:本文提出了一种新颖的低复杂度嘴唇轮廓模型,用于在鲁棒视听(AV)自动语音识别系统中进行高级光学特征提取。该模型基于上嘴唇轮廓和下嘴唇轮廓的加权最小二乘抛物线拟合,不需要在嘴的水平轴上对称的假设,因此是现实的。所提出的模型不像其他高级模型那样依赖于特定面部的准确估计。此外,我们提出了一种新颖的低复杂度算法,用于光学信息流的说话人归一化,它与提出的模型兼容,并且不需要参数训练。相对于使用基线高级模型,将建议的模型与说话者归一化配合使用可提高AV孤立单词识别的噪声鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号