首页> 外文学位 >Image-based gesture recognition with support vector machines.
【24h】

Image-based gesture recognition with support vector machines.

机译:支持向量机基于图像的手势识别。

获取原文
获取原文并翻译 | 示例

摘要

Recent advances in various display and virtual technologies, coupled with an explosion in available computing power, have given rise to a number of novel human-computer interaction (HCI) modalities, among which gesture recognition is undoubtedly the most grammatically structured and complex. However, despite the abundance of novel interaction devices, the naturalness and efficiency of HCI has remained low. This is due in particular to the lack of robust sensory data interpretation techniques. To address the task of gesture recognition, this dissertation establishes novel probabilistic approaches based on support vector machines (SVM). Of special concern in this dissertation are the shapes of contact images on a multi-touch input device for both 2D and 3D. Five main topics are covered in this work.;The first topic deals with the hand pose recognition problem. To perform classification of different gestures, a recognition system must attempt to leverage between class variations (semantically varying gestures), while accommodating potentially large within-class variations (different hand poses to perform certain gestures). For recognition of gestures, a sequence of hand shapes should be recognized. We present a novel shape recognition approach using Active Shape Model (ASM) based matching and SVM based classification. Firstly, a set of correspondences between the reference shape and query image are identified through ASM. Next, a dissimilarity measure is created to measure how well any correspondence in the set aligns the reference shape and candidate shape in the query image. Finally, SVM classification is employed to search through the set to find the best match from the kernel defined by the dissimilarity measure above. Results presented show better recognition results than conventional segmentation and template matching methods.;In the second topic, dynamic time alignment (DTA) based SVM gesture recognition is addressed. In particular, the proposed method combines DTA and SVM by establishing a new kernel. The gesture data is first projected into a common eigenspace formed by principal component analysis (PCA) and a distance measure is derived from the DTA. By incorporating DTA in the kernel function, general classification problems with variable-sized sequential data can be handled.;In the third topic, a C++ based gesture recognition application for the multi-touchpad is implemented. It uses the proposed gesture classification method along with a recursive neural networks approach to recognize definable gestures in real time, then runs an associated command. This application can further enable users with different disabilities or preferences to custom define gestures and enhance the functionality of the multi-touchpad.;Fourthly, an SVM-based classification method that uses the DTW to measure the similarity score is presented. The key contribution of this approach is the extension of trajectory based approaches to handle shape information, thereby enabling the expansion of the system's gesture vocabulary. It consists of two steps: converting a given set of frames into fixed-length vectors and training an SVM from the vectorized manifolds. Using shape information not only yields discrimination among various gestures, but also enables gestures that cannot be characterized solely based on their motion information to be classified, thus boosting overall recognition scores.;Finally, a computer vision based gesture command and communication system is developed. This system performs two major tasks: the first is to utilize the 3D traces of laser pointing devices as input to perform common keyboard and mouse control; the second is supplement free continuous gesture recognition, i.e., data gloves or other assistive devices are not necessary for 3D gestures recognition. As a result, the gesture can be used as a text entry system in wearable computers or mobile communication devices, though the recognition rate is lower than the approaches with the assistive tools. The purpose of this system is to develop new perceptual interfaces for human computer interaction based on visual input captured by computer vision systems, and to investigate how such interfaces can complement or replace traditional interfaces.;Original contributions of this work span the areas of SVMs and interpretation of computer sensory inputs, such as gestures for advanced HCI. In particular, we have addressed the following important issues: (1) ASM base kernels for shape recognition. (2) DTA based sequence kernels for gesture classification. (3) Recurrent neural networks (RNN). (4) Exploration of a customizable HCI. (5) Computer vision based 3D gesture recognition algorithms and system.
机译:各种显示和虚拟技术的最新进展,再加上可用计算能力的爆炸式增长,带来了许多新颖的人机交互(HCI)模式,其中手势识别无疑是最语法化和最复杂的。然而,尽管有许多新颖的相互作用装置,但HCl的自然性和效率仍然很低。这尤其是由于缺乏健壮的感官数据解释技术。为了解决手势识别的任务,本文建立了基于支持向量机的新型概率方法。本文特别关注的是2D和3D的多点触摸输入设备上的接触图像的形状。这项工作涵盖五个主要主题。;第一个主题涉及手势识别问题。为了执行不同手势的分类,识别系统必须尝试在类别变化(语义上变化的手势)之间发挥杠杆作用,同时适应潜在的较大的类别内变化(执行某些手势的不同手势)。为了识别手势,应识别一系列手形。我们提出了一种基于主动形状模型(ASM)的匹配和基于SVM的分类的新颖形状识别方法。首先,通过ASM识别参考形状和查询图像之间的一组对应关系。接下来,创建相异度度量以度量集合中的任何对应关系对齐查询图像中的参考形状和候选形状的程度。最后,采用SVM分类来搜索集合,以从由上述差异度量定义的内核中找到最佳匹配。提出的结果显示出比常规分割和模板匹配方法更好的识别结果。在第二个主题中,解决了基于动态时间对齐(DTA)的SVM手势识别。特别地,所提出的方法通过建立新内核将DTA和SVM相结合。首先将手势数据投影到由主成分分析(PCA)形成的公共特征空间中,然后从DTA导出距离度量。通过将DTA合并到内核函数中,可以处理具有可变大小的顺序数据的一般分类问题。在第三个主题中,实现了基于C ++的多触摸板手势识别应用程序。它使用提出的手势分类方法以及递归神经网络方法实时识别可定义的手势,然后运行关联的命令。此应用程序还可以使具有不同残疾或偏好的用户自定义定义手势并增强多点触摸板的功能。第四,提出了一种使用DTW来衡量相似度得分的基于SVM的分类方法。这种方法的主要作用是扩展了基于轨迹的方法来处理形状信息,从而扩展了系统的手势词汇。它包括两个步骤:将一组给定的帧转换为固定长度的向量,并从向量化的流形训练SVM。使用形状信息不仅可以区分各种手势,而且还可以对不能仅基于其运动信息进行表征的手势进行分类,从而提高总体识别分数。最后,开发了一种基于计算机视觉的手势命令和通信系统。该系统执行两个主要任务:首先是利用激光定点设备的3D轨迹作为输入来执行常见的键盘和鼠标控制。第二个是无补充的连续手势识别,即3D手势识别不需要数据手套或其他辅助设备。结果,尽管识别率低于使用辅助工具的方法,但是该手势可以用作可穿戴计算机或移动通信设备中的文本输入系统。该系统的目的是基于计算机视觉系统捕获的视觉输入来开发用于人机交互的新感知界面,并研究此类界面如何补充或替代传统界面。解释计算机的感觉输入,例如高级HCI的手势。特别是,我们解决了以下重要问题:(1)用于形状识别的ASM基本内核。 (2)基于DTA的序列核用于手势分类。 (3)递归神经网络(RNN)。 (4)探索可定制的HCI。 (5)基于计算机视觉的3D手势识别算法和系统。

著录项

  • 作者

    Yuan, Yu.;

  • 作者单位

    University of Delaware.;

  • 授予单位 University of Delaware.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2008
  • 页码 155 p.
  • 总页数 155
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号