Accelerating Stochastic Gradient Descent using Predictive Variance Reduction

机译：使用预测方差减少加速随机梯度下降

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Stochastic gradient descent is popular for large scale optimization but has slow convergence asymptotically due to the inherent variance. To remedy this problem, we introduce an explicit variance reduction method for stochastic gradient descent which we call stochastic variance reduced gradient (SVRG). For smooth and strongly convex functions, we prove that this method enjoys the same fast convergence rate as those of stochastic dual coordinate ascent (SDCA) and Stochastic Average Gradient (SAG). However, our analysis is significantly simpler and more intuitive. Moreover, unlike SDCA or SAG, our method does not require the storage of gradients, and thus is more easily applicable to complex problems such as some structured prediction problems and neural network learning.

机译：随机梯度下降是大规模优化的流行，但由于固有的差异而渐近的收敛性慢。为了解决这个问题，我们介绍了一种用于随机梯度下降的显式方差减少方法，其呼叫随机方差减少梯度（SVRG）。对于光滑和强大的凸起功能，我们证明该方法享有与随机双坐标上升（SDCA）和随机平均梯度（SAG）相同的快速收敛速度。但是，我们的分析明显更简单，更直观。此外，与SDCA或SAG不同，我们的方法不需要存储梯度，因此更容易适用于一些结构化预测问题和神经网络学习等复杂问题。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2013年||共9页
会议地点
作者
Rie Johnson; Tong Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Stochastic gradient descent with variance reduction technique [J] . Zhang Jinjing, Hu Fei, Xu Xiaofei, Web Intelligence and Agent Systems . 2018,第3期

机译：采用方差减少技术的随机梯度下降
2. Distributed and asynchronous Stochastic Gradient Descent with variance reduction [J] . Ming Yuewei, Zhao Yawei, Wu Chengkun, Neurocomputing . 2018,第MARa15期

机译：具有减少方差的分布式和异步随机梯度下降
3. An Improvement of Stochastic Gradient Descent Approach for Mean-Variance Portfolio Optimization Problem [J] . Stephanie S. W. Su, Sie Long Kek Journal of Mathematics . 2021,第a期

机译：平均方差组合优化问题的随机梯度下降方法改进
4. Accelerating Stochastic Gradient Descent using Predictive Variance Reduction [C] . Rie Johnson, Tong Zhang Annual conference on Neural Information Processing Systems . 2013

机译：使用预测方差减少加速随机梯度下降
5. Asymptotic Analysis of Accelerated Stochastic Gradient Descent [D] . Wu, Shang. 2020

机译：加速随机梯度下降的渐近分析
6. Variance Reduction in Stochastic Gradient Langevin Dynamics [O] . Avinava Dubey, Sashank J. Reddi, Barnabás Póczos, -1

机译：随机梯度Langevin动力学的方差减小
7. An Improvement of Stochastic Gradient Descent Approach for Mean-Variance Portfolio Optimization Problem [O] . Stephanie S. W. Su, Sie Long Kek 2021

机译：平均方差组合优化问题的随机梯度下降方法改进

Accelerating Stochastic Gradient Descent using Predictive Variance Reduction

摘要

著录项

相似文献

相关主题

期刊订阅