...
首页> 外文期刊>Image and Vision Computing >Statistical adaptive metric learning in visual action feature set recognition
【24h】

Statistical adaptive metric learning in visual action feature set recognition

机译:视觉动作特征集识别中的统计自适应度量学习

获取原文
获取原文并翻译 | 示例
           

摘要

Great variances in visual features often present significant challenges in human action recognitions. To address this common problem, this paper proposes a statistical adaptive metric learning (SAML) method by exploring various selections and combinations of multiple statistics in a unified metric learning framework. Most statistics have certain advantages in specific controlled environments, and systematic selections and combinations can adapt them to more realistic "in the wild" scenarios. In the proposed method, multiple statistics, include means, covariance matrices and Gaussian distributions, are explicitly mapped or generated in the Riemannian manifolds. Typically, d-dimensional mean vectors in R-d are mapped to a R-d x d space of symmetric positive definite (SPD) matrices Sym-(+)(d). Subsequently, by embedding the heterogeneous manifolds in their tangent Hilbert space, subspace combination with minimal deviation is selected from multiple statistics. Then Mahalanobis metrics are introduced to map them back into the Euclidean space. Unified optimizations are finally performed based on the Euclidean distances. In the proposed method, subspaces with smaller deviations are selected before metric learning. Therefore, by exploring different metric combinations, the final learning is more representative and effective than exhaustively learning from all the hybrid metrics. Experimental evaluations are conducted on human action recognitions in both static and dynamic scenarios. Promising results demonstrate that the proposed method performs effectively for human action recognitions in the wild. (C) 2016 Elsevier B.V. All rights reserved.
机译:视觉特征的巨大差异通常会给人类动作识别带来巨大挑战。为了解决这个常见问题,本文提出了一种统计自适应度量学习(SAML)方法,该方法通过探索统一度量学习框架中多种统计数据的各种选择和组合。大多数统计数据在特定的受控环境中具有某些优势,系统的选择和组合可以使它们适应更实际的“野外”场景。在提出的方法中,多个统计量,包括均值,协方差矩阵和高斯分布,都在黎曼流形中被明确映射或生成。通常,将R-d中的d维平均向量映射到对称正定(SPD)矩阵Sym-(+)(d)的R-d x d空间。随后,通过将异质流形嵌入其切线希尔伯特空间中,可以从多个统计信息中选择偏差最小的子空间组合。然后引入了Mahalanobis度量,以将其映射回欧几里得空间。最后根据欧几里得距离执行统一的优化。在提出的方法中,在度量学习之前选择具有较小偏差的子空间。因此,通过探索不同的指标组合,最终学习比从所有混合指标中进行详尽学习要更具代表性和有效性。在静态和动态场景中都对人类动作识别进行了实验评估。有希望的结果表明,所提出的方法对于野外的人类动作识别有效。 (C)2016 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号