首页> 外文会议>Mobile Multimedia/Image Processing for Military and Security Applications >Comparison of weighting strategies in early and late fusion approaches to audio-visual person authentication
【24h】

Comparison of weighting strategies in early and late fusion approaches to audio-visual person authentication

机译:早期和晚期融合方法在视听人员身份验证中的加权策略比较

获取原文
获取原文并翻译 | 示例

摘要

Person authentication can be strongly enhanced by the combination of different modalities. This is also true for the face and voice signals, which can be obtained with minimal inconvenience for the user. However, features from each modality can be combined at various different levels of processing and for face and voice signals the advantage of fusion depends strongly on the way they are combined. The aim of the work presented is to investigate the optimal strategy for combining voice and face modalities for signals of varying quality. The experimental data are taken from a newly acquired database using a PDA, which contains audio-visual recordings in different conditions. Voice features use rnel-frequency cepstral coefficients, while the face signal is parameterised using wavelet coefficients in certain subbands. Results are presented for both early (feature-level) and late (score-level) fusion. At each level different fixed and variable weightings are used, both to weight between frames within each modality and to weight between modalities, where weights are based on some measure of signal reliability, such as the accuracy of automatic face detection or the audio signal to noise ratio. In addition, the contribution to authentication of information from different areas of the face is explored to determine a regional weighting for the face coefficients.
机译:通过组合不同的方式,可以大大增强人员身份验证。对于脸部和语音信号也是如此,这对于用户来说是最小的麻烦。但是,每个模态的特征可以在各种不同的处理级别进行组合,对于脸部和语音信号,融合的优势很大程度上取决于它们的组合方式。提出的工作的目的是研究将语音和面部模式组合在一起以改变质量信号的最佳策略。实验数据是使用PDA从新获取的数据库中获取的,该数据库包含不同条件下的视听记录。语音特征使用轴频倒频谱系数,而面部信号则使用某些子带中的小波系数进行参数化。呈现了早期(功能级)和晚期(分数级)融合的结果。在每个级别使用不同的固定权重和可变权重,既对每个模态内的帧之间进行加权,又对模态之间进行加权,其中权重基于信号可靠性的某种度量,例如自动人脸检测或音频信号对噪声的准确性。比。另外,探索了对来自面部的不同区域的信息的认证的贡献,以确定面部系数的区域加权。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号