...
首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Explore Efficient Local Features from RGB-D Data for One-Shot Learning Gesture Recognition
【24h】

Explore Efficient Local Features from RGB-D Data for One-Shot Learning Gesture Recognition

机译:通过RGB-D数据探索高效的局部特征,实现一键式学习手势识别

获取原文
获取原文并翻译 | 示例
           

摘要

Availability of handy RGB-D sensors has brought about a surge of gesture recognition research and applications. Among various approaches, one shot learning approach is advantageous because it requires minimum amount of data. Here, we provide a thorough review about one-shot learning gesture recognition from RGB-D data and propose a novel spatiotemporal feature extracted from RGB-D data, namely mixed features around sparse keypoints (MFSK). In the review, we analyze the challenges that we are facing, and point out some future research directions which may enlighten researchers in this field. The proposed MFSK feature is robust and invariant to scale, rotation and partial occlusions. To alleviate the insufficiency of one shot training samples, we augment the training samples by artificially synthesizing versions of various temporal scales, which is beneficial for coping with gestures performed at varying speed. We evaluate the proposed method on the Chalearn gesture dataset (CGD). The results show that our approach outperforms all currently published approaches on the challenging data of CGD, such as translated, scaled and occluded subsets. When applied to the RGB-D datasets that are not one-shot (e.g., the Cornell Activity Dataset-60 and MSR Daily Activity 3D dataset), the proposed feature also produces very promising results under leave-one-out cross validation or one-shot learning.
机译:方便的RGB-D传感器的可用性带来了手势识别研究和应用的激增。在各种方法中,一种射击学习方法是有利的,因为它需要最少的数据量。在这里,我们提供了有关从RGB-D数据中进行一次学习手势识别的详尽综述,并提出了从RGB-D数据中提取的新颖时空特征,即稀疏关键点(MFSK)周围的混合特征。在这篇综述中,我们分析了我们面临的挑战,并指出了一些未来的研究方向,可能会启发该领域的研究人员。所提出的MFSK特征是鲁棒的,并且对于缩放,旋转和部分遮挡不变。为了减轻一次射击训练样本的不足,我们通过人工合成各种时间尺度的版本来增加训练样本,这对于应对以不同速度执行的手势是有益的。我们在Chalearn手势数据集(CGD)上评估了提出的方法。结果表明,在CGD具有挑战性的数据上,我们的方法优于所有已发布的方法,例如翻译,缩放和遮挡的子集。当应用于非一次性的RGB-D数据集(例如,康奈尔活动数据集60和MSR每日活动3D数据集)时,建议的功能在留一法交叉验证或一分法下也能产生非常有希望的结果射击学习。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号