首页> 外文会议>SPIE Conference on Multispectral Image Processing and Pattern Recognition >Hand Pose Estimation in Depth Image using CNN and Random Forest
【24h】

Hand Pose Estimation in Depth Image using CNN and Random Forest

机译:使用CNN和随机林的深度图像中的手姿态估计

获取原文

摘要

Thanks to the availability of low cost depth cameras, like Microsoft Kinect, 3D hand pose estimation attracted special research attention in these years. Due to the large variations in hand' s viewpoint and the high dimension of hand motion. 3D hand pose estimation is still challenging. In this paper we propose a two-stage framework which joint with CNN and Random Forest to boost the performance of hand pose estimation. First, we use a standard Convolutional Neural Network (CNN) to regress the hand joints' locations. Second, using a Random Forest to refine the joints from the first stage. In the second stage, we propose a pyramid feature which merges the information flow of the CNN. Specifically, we get the rough joints' location from first stage, then rotate the convolutional feature maps (and image). After this, for each joint, we map its location to each feature map (and image) firstly, then crop features at each feature map (and image) around its location, put extracted features to Random Forest to refine at last. Experimentally, we evaluate our proposed method on ICVL dataset and get the mean error about 1lmm, our method is also real-time on a desktop.
机译:由于低成本深度摄像头的可用性,如Microsoft Kinect,3D手姿势估计这些年来吸引了特殊的研究。由于手动观点的巨大变化和手动运动的高尺寸。 3D手姿势估计仍然具有挑战性。在本文中,我们提出了一种与CNN和随机林关节的两级框架,以提高手姿势估计的性能。首先,我们使用标准卷积神经网络(CNN)来分配手关节的位置。其次,使用随机森林来优化第一阶段的关节。在第二阶段,我们提出了一个金字塔特征,其合并了CNN的信息流。具体而言,我们从第一阶段获取粗糙的关节位置,然后旋转卷积特征映射(和图像)。在此之后,对于每个关节,我们首先将其位置映射到每个特征映射(和图像),然后在其位置周围的每个特征映射(和图像)的裁剪特征,将提取的功能放到随机林中以便最后改进。实验,我们在ICVL数据集上评估我们提出的方法,并获得大约1LMM的平均误差,我们的方法也是桌面上的实时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号