首页> 外文会议>International Conference on Multimedia Modeling >A Collaborative Multi-modal Fusion Method Based on Random Variational Information Bottleneck for Gesture Recognition

【24h】

A Collaborative Multi-modal Fusion Method Based on Random Variational Information Bottleneck for Gesture Recognition

机译：一种基于随机变分信息瓶颈进行手势识别的协同多模态融合方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Gesture is a typical human-machine interaction manner, accurate and robust gesture recognition can assist to achieve more natural interaction and understanding. Multi-modal gesture recognition can improve the recognition performance with the help of complex multi-modal relationship. However, it still faces the challenge of how to effectively balance the correlation and redundancy among different modalities, so as to guarantee the accuracy and robustness of the recognition. Hence, in this paper, a collaborative multi-modal learning method based on Random Variational Information Bottleneck (RVIB) is proposed. With random local information selection strategy, some information is compressed by information bottleneck, and the rest is retained directly, so as to make full use of effective redundant information while eliminating invalid redundant information. Experiments on open dataset show that the proposed method can achieve 95.77% recognition accuracy for 21 dynamic gestures, and can guarantee the recognition accuracy when some modality is missing.

机译：手势是一种典型的人机交互方式，准确且鲁棒的手势识别可以帮助实现更自然的相互作用和理解。多模态手势识别可以通过复杂的多模态关系提高识别性能。然而，它仍然面临如何在不同模式之间有效地平衡相关性和冗余的挑战，以保证识别的准确性和鲁棒性。因此，在本文中，提出了一种基于随机变分信息瓶颈（RVIB）的协同多模态学习方法。对于随机本地信息选择策略，一些信息被信息瓶颈压缩，其余的被直接保留，以便充分利用有效的冗余信息，同时消除无效的冗余信息。 Open DataSet的实验表明，该方法可以实现21个动态手势的95.77％的识别准确性，并可以保证缺少某些模态时的识别准确性。

著录项

来源
《International Conference on Multimedia Modeling》|2021年|62-74|共13页
会议地点
作者
Yang Gu; Yajie Li; Yiqiang Chen; Jiwei Wang; Jianfei Shen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Multi-modal fusion; Information bottleneck; Random regularization; Gesture recognition;

机译：多模态融合;信息瓶颈;随机正规化;姿态识别;

相似文献

外文文献
中文文献
专利

1. Nonparametric Feature Matching Based Conditional Random Fields for Gesture Recognition from Multi-Modal Video [J] . Ju Yong Chang IEEE Transactions on Pattern Analysis and Machine Intelligence . 2016,第8期

机译：基于非参数特征匹配的条件随机场用于多模态视频的手势识别
2. Multi-modal user interaction method based on gaze tracking and gesture recognition [J] . Heekyung Lee, Seong Yong Lim, Injae Lee, Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing . 2013,第2期

机译：基于凝视跟踪和手势识别的多模式用户交互方法
3. A Fusion Recognition Method Based on Multifeature Hidden Markov Model for Dynamic Hand Gesture [J] . Guoliang Chen, Kaikai Ge Computational intelligence and neuroscience . 2020,第4期

机译：一种基于Multififure Hidden Markov模型动态手势的融合识别方法
4. A Variational Information Bottleneck Based Method to Compress Sequential Networks for Human Action Recognition [C] . Ayush Srivastava, Oshin Dutta, Jigyasa Gupta, IEEE Winter Conference on Applications of Computer Vision . 2021

机译：基于变分信息瓶颈压缩人类行为识别顺序网络的方法
5. Observation Points Based Multi-modal Fusion Systems for Skeleton Action Recognition [D] . Singh, Iqbal. 2020

机译：基于观测点的骨架动作识别多模态融合系统
6. Multi-Modal Fusion Emotion Recognition Method of Speech Expression Based on Deep Learning [O] . Dong Liu, Zhiyong Wang, Lifeng Wang, 2021

机译：基于深度学习的语音表达多模态融合情绪识别方法
7. A Fusion Recognition Method Based on Multifeature Hidden Markov Model for Dynamic Hand Gesture [O] . Guoliang Chen, Kaikai Ge 2020

机译：一种基于Multififure Hidden Markov模型动态手势的融合识别方法

A Collaborative Multi-modal Fusion Method Based on Random Variational Information Bottleneck for Gesture Recognition

摘要

著录项

相似文献

相关主题

期刊订阅