Explore Efficient Local Features from RGB-D Data for One-Shot Learning Gesture Recognition

Jun Wan; Guodong Guo; Stan Z. Li

首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Explore Efficient Local Features from RGB-D Data for One-Shot Learning Gesture Recognition

【24h】

Explore Efficient Local Features from RGB-D Data for One-Shot Learning Gesture Recognition

机译：通过RGB-D数据探索高效的局部特征，实现一键式学习手势识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Availability of handy RGB-D sensors has brought about a surge of gesture recognition research and applications. Among various approaches, one shot learning approach is advantageous because it requires minimum amount of data. Here, we provide a thorough review about one-shot learning gesture recognition from RGB-D data and propose a novel spatiotemporal feature extracted from RGB-D data, namely mixed features around sparse keypoints (MFSK). In the review, we analyze the challenges that we are facing, and point out some future research directions which may enlighten researchers in this field. The proposed MFSK feature is robust and invariant to scale, rotation and partial occlusions. To alleviate the insufficiency of one shot training samples, we augment the training samples by artificially synthesizing versions of various temporal scales, which is beneficial for coping with gestures performed at varying speed. We evaluate the proposed method on the Chalearn gesture dataset (CGD). The results show that our approach outperforms all currently published approaches on the challenging data of CGD, such as translated, scaled and occluded subsets. When applied to the RGB-D datasets that are not one-shot (e.g., the Cornell Activity Dataset-60 and MSR Daily Activity 3D dataset), the proposed feature also produces very promising results under leave-one-out cross validation or one-shot learning.

机译：方便的RGB-D传感器的可用性带来了手势识别研究和应用的激增。在各种方法中，一种射击学习方法是有利的，因为它需要最少的数据量。在这里，我们提供了有关从RGB-D数据中进行一次学习手势识别的详尽综述，并提出了从RGB-D数据中提取的新颖时空特征，即稀疏关键点（MFSK）周围的混合特征。在这篇综述中，我们分析了我们面临的挑战，并指出了一些未来的研究方向，可能会启发该领域的研究人员。所提出的MFSK特征是鲁棒的，并且对于缩放，旋转和部分遮挡不变。为了减轻一次射击训练样本的不足，我们通过人工合成各种时间尺度的版本来增加训练样本，这对于应对以不同速度执行的手势是有益的。我们在Chalearn手势数据集（CGD）上评估了提出的方法。结果表明，在CGD具有挑战性的数据上，我们的方法优于所有已发布的方法，例如翻译，缩放和遮挡的子集。当应用于非一次性的RGB-D数据集（例如，康奈尔活动数据集60和MSR每日活动3D数据集）时，建议的功能在留一法交叉验证或一分法下也能产生非常有希望的结果射击学习。

著录项

来源
《IEEE Transactions on Pattern Analysis and Machine Intelligence》 |2016年第8期|1626-1639|共14页
作者
Jun Wan; Guodong Guo; Stan Z. Li;
展开▼
作者单位

Center for Biometrics and Security Research & National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Room 1411, Intelligent Building, 95 Zhongguancun Donglu, Haidian District, Beijing, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
One-shot learning; RGB-D data; bag of visual words model; gesture reco gnition;

机译：一键式学习;RGB-D数据;视觉词袋模型;手势识别;

相似文献

外文文献
中文文献
专利

1. Adaptive Local Spatiotemporal Features from RGB-D Data for One-Shot Learning Gesture Recognition [J] . Jia Lin, Xiaogang Ruan, Naigong Yu, Sensors . 2016,第12期

机译：来自RGB-D数据的自适应局部时空特征，用于一键式学习手势识别
2. One-shot Learning Gesture Recognition from RGB-D Data Using Bag of Features [J] . Jun Wan, Qiuqi Ruan, Wei Li, Journal of machine learning research . 2013,第Apr期

机译：使用功能包从RGB-D数据中一次性识别学习手势
3. MultiD-CNN: A multi-dimensional feature learning approach based on deep convolutional networks for gesture recognition in RGB-D image sequences [J] . Elboushaki Abdessamad, Hannane Rachida, Afdel Karim, Expert Systems with Application . 2020,第Jana期

机译：MultiD-CNN：基于深度卷积网络的多维特征学习方法，用于RGB-D图像序列中的手势识别
4. One-shot learning gesture recognition based on improved 3D SMoSIFT feature descriptor from RGB-D videos [C] . Lin Jia, Ruan Xiaogang, Yu Naigong, Chinese Control and Decision Conference . 2015

机译：基于改进的RGB-D视频3D SMoSIFT特征描述符的一键式学习手势识别
5. Viewpoint invariant gesture recognition and 3D hand pose estimation using RGB-D [D] . Doliotis, Paul. 2013

机译：使用RGB-D的视点不变手势识别和3D手姿势估计
6. Adaptive Local Spatiotemporal Features from RGB-D Data for One-Shot Learning Gesture Recognition [O] . Jia Lin, Xiaogang Ruan, Naigong Yu, 2016

机译：来自RGB-D数据的自适应局部时空特征用于一键式学习手势识别
7. Adaptive Local Spatiotemporal Features from RGB-D Data for One-Shot Learning Gesture Recognition [O] . Jia Lin, Xiaogang Ruan, Naigong Yu, 2016

机译：用于一次性学习手势识别的RGB-D数据的自适应局部时空特征

Explore Efficient Local Features from RGB-D Data for One-Shot Learning Gesture Recognition

摘要

著录项

相似文献

相关主题

期刊订阅