首页> 外文期刊>Mathematical Programming >Sample size selection in optimization methods for machine learning
【24h】

Sample size selection in optimization methods for machine learning

机译:机器学习优化方法中的样本量选择

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a methodology for using varying sample sizes in batch-type optimization methods for large-scale machine learning problems. The first part of the paper deals with the delicate issue of dynamic sample selection in the evaluation of the function and gradient. We propose a criterion for increasing the sample size based on variance estimates obtained during the computation of a batch gradient. We establish an ${O(1/epsilon)}$ complexity bound on the total cost of a gradient method. The second part of the paper describes a practical Newton method that uses a smaller sample to compute Hessian vector-products than to evaluate the function and the gradient, and that also employs a dynamic sampling technique. The focus of the paper shifts in the third part of the paper to L 1-regularized problems designed to produce sparse solutions. We propose a Newton-like method that consists of two phases: a (minimalistic) gradient projection phase that identifies zero variables, and subspace phase that applies a subsampled Hessian Newton iteration in the free variables. Numerical tests on speech recognition problems illustrate the performance of the algorithms.
机译:本文提出了一种在大规模机器学习问题的批处理类型优化方法中使用不同样本量的方法。本文的第一部分讨论了在函数和梯度评估中动态样本选择的微妙问题。我们提出了一个标准,用于基于批次梯度计算过程中获得的方差估计来增加样本量。我们建立了梯度方法总成本的$ {O(1 / epsilon)} $复杂度。本文的第二部分描述了一种实用的牛顿方法,该方法使用较小的样本来计算Hessian向量乘积,而不是评估函数和梯度,并且还采用了动态采样技术。本文的重点在本文的第三部分转移到了旨在产生稀疏解的L 1 正则化问题。我们提出一种类似于牛顿的方法,该方法包括两个阶段:(最小)梯度投影阶段,该阶段标识零变量;子空间阶段,在自由变量中应用子采样的Hessian Newton迭代。语音识别问题的数值测试说明了算法的性能。

著录项

  • 来源
    《Mathematical Programming》 |2012年第1期|p.127-155|共29页
  • 作者单位

    Department of Computer Science, University of Colorado, Boulder, CO, USA;

    Department of Industrial Engineering and Management Sciences, Northwestern University, Evanston, IL, USA;

    Department of Industrial Engineering and Management Sciences, Northwestern University, Evanston, IL, USA;

    Google Inc., Mountain View, CA, USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    49M15; 49M37; 65K05;

    机译:49 m15;49 pt;65 k0;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号