首页> 外文期刊>Image Processing, IEEE Transactions on >High-Order Local Pooling and Encoding Gaussians Over a Dictionary of Gaussians
【24h】

High-Order Local Pooling and Encoding Gaussians Over a Dictionary of Gaussians

机译:高斯字典上的高阶局部池化和高斯编码

获取原文
获取原文并翻译 | 示例
           

摘要

Local pooling (LP) in configuration (feature) space proposed by Boureau et al. explicitly restricts similar features to be aggregated, which can preserve as much discriminative information as possible. At the time it appeared, this method combined with sparse coding achieved competitive classification results with only a small dictionary. However, its performance lags far behind the state-of-the-art results as only the zero-order information is exploited. Inspired by the success of high-order statistical information in existing advanced feature coding or pooling methods, we make an attempt to address the limitation of LP. To this end, we present a novel method called high-order LP (HO-LP) to leverage the information higher than the zero-order one. Our idea is intuitively simple: we compute the first- and second-order statistics per configuration bin and model them as a Gaussian. Accordingly, we employ a collection of Gaussians as visual words to represent the universal probability distribution of features from all classes. Our problem is naturally formulated as encoding Gaussians over a dictionary of Gaussians as visual words. This problem, however, is challenging since the space of Gaussians is not a Euclidean space but forms a Riemannian manifold. We address this challenge by mapping Gaussians into the Euclidean space, which enables us to perform coding with common Euclidean operations rather than complex and often expensive Riemannian operations. Our HO-LP preserves the advantages of the original LP: pooling only similar features and using a small dictionary. Meanwhile, it achieves very promising performance on standard benchmarks, with either conventional, hand-engineered features or deep learning-based features.
机译:Boureau等人提出的配置(功能)空间中的局部池(LP)。明确限制要聚合的相似功能,这样可以保留尽可能多的区分性信息。当时看来,这种方法与稀疏编码相结合,仅用很小的字典就可以达到竞争性的分类结果。但是,由于仅利用零阶信息,其性能远远落后于最新技术的结果。受现有高级特征编码或合并方法中高阶统计信息成功的启发,我们尝试解决LP的局限性。为此,我们提出了一种称为高阶LP(HO-LP)的新颖方法,以利用高于零阶的信息。我们的想法直观上很简单:我们计算每个配置bin的一阶和二阶统计量并将其建模为高斯模型。因此,我们采用高斯集合作为视觉词来表示所有类别特征的普遍概率分布。我们的问题很自然地被公式化为在高斯字典上将高斯编码为视觉单词。但是,由于高斯空间不是欧几里得空间而是形成黎曼流形,因此这个问题具有挑战性。我们通过将高斯映射到欧几里得空间中来应对这一挑战,这使我们能够使用普通的欧几里得运算而不是复杂且通常是昂贵的黎曼运算来执行编码。我们的HO-LP保留了原始LP的优点:仅合并相似的功能并使用小的字典。同时,无论是传统的手工设计功能还是基于深度学习的功能,它在标准基准测试上均具有非常可观的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号