首页> 外文会议>International Conference on Multimedia Modeling >A Collaborative Multi-modal Fusion Method Based on Random Variational Information Bottleneck for Gesture Recognition
【24h】

A Collaborative Multi-modal Fusion Method Based on Random Variational Information Bottleneck for Gesture Recognition

机译:一种基于随机变分信息瓶颈进行手势识别的协同多模态融合方法

获取原文

摘要

Gesture is a typical human-machine interaction manner, accurate and robust gesture recognition can assist to achieve more natural interaction and understanding. Multi-modal gesture recognition can improve the recognition performance with the help of complex multi-modal relationship. However, it still faces the challenge of how to effectively balance the correlation and redundancy among different modalities, so as to guarantee the accuracy and robustness of the recognition. Hence, in this paper, a collaborative multi-modal learning method based on Random Variational Information Bottleneck (RVIB) is proposed. With random local information selection strategy, some information is compressed by information bottleneck, and the rest is retained directly, so as to make full use of effective redundant information while eliminating invalid redundant information. Experiments on open dataset show that the proposed method can achieve 95.77% recognition accuracy for 21 dynamic gestures, and can guarantee the recognition accuracy when some modality is missing.
机译:手势是一种典型的人机交互方式,准确且鲁棒的手势识别可以帮助实现更自然的相互作用和理解。多模态手势识别可以通过复杂的多模态关系提高识别性能。然而,它仍然面临如何在不同模式之间有效地平衡相关性和冗余的挑战,以保证识别的准确性和鲁棒性。因此,在本文中,提出了一种基于随机变分信息瓶颈(RVIB)的协同多模态学习方法。对于随机本地信息选择策略,一些信息被信息瓶颈压缩,其余的被直接保留,以便充分利用有效的冗余信息,同时消除无效的冗余信息。 Open DataSet的实验表明,该方法可以实现21个动态手势的95.77%的识别准确性,并可以保证缺少某些模态时的识别准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号