首页> 外文会议>NICTA-HCSNet Multimodal User Interaction Workshop 2005(MMUI2005); 200511; Sydney(AU) >A New Lip Feature Representation Method for Video-based Bimodal Authentication
【24h】

A New Lip Feature Representation Method for Video-based Bimodal Authentication

机译:一种基于视频的双峰认证的唇形特征表示新方法

获取原文
获取原文并翻译 | 示例

摘要

As the low-cost video transmission becomes popular, video based bimodal (audio and visual) authentication has great potential in various applications that require access control. It is especially useful for handheld terminals, which are often used under adverse environments, where the signal quality is rather poor. When human voice is used for authentication, one of the most relevant visual features is the dynamic movement of lips. In this research, we investigate on the use of static and dynamic features of speaking lips in the context of voice based authentication. A new feature representation that preserves both appearance and motion pattern of speaking lips is proposed. The dimension of extracted features is reduced by multiple discriminant analysis (MDA) and the method of nearest neighbor is used for classification. Our method can achieve an identification rate of 98% with only lips features for 200 clients of the XM2VTS database. Experiments on speaker verification using fused audio and visual features are on-going.
机译:随着低成本视频传输的普及,基于视频的双峰(音频和视频)身份验证在需要访问控制的各种应用中具有巨大的潜力。对于手持终端尤其有用,手持终端通常在信号质量很差的恶劣环境下使用。当使用人类语音进行身份验证时,最相关的视觉功能之一就是嘴唇的动态运动。在这项研究中,我们研究了在基于语音的身份验证的情况下对口唇的静态和动态特征的使用。提出了一种既保留说话嘴唇的外观又保留运动模式的新特征表示。通过多重判别分析(MDA)减少了提取特征的维数,并使用最近邻法进行分类。我们的方法仅凭嘴唇特征即可为XM2VTS数据库的200个客户端实现98%的识别率。正在进行使用融合的音频和视频功能进行说话人验证的实验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号