Escaping Saddle Points for Zeroth-order Non-convex Optimization using Estimated Gradient Descent

机译：使用估计的梯度下降为零阶非凸优化转义鞍点

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Gradient descent and its variants are widely used in machine learning. However, oracle access of gradient may not be available in many applications, limiting the direct use of gradient descent. This paper proposes a method of estimating gradient to perform gradient descent, that converges to a second-order stationary point for general non-convex optimization problems. Beyond the first-order stationary properties, the second-order stationary properties are important in machine learning applications to achieve better performance. We show that the proposed model-free non-convex optimization algorithm returns an ε-second-order stationary point with $ilde Oleft( {rac{{{d^{2 + rac{heta }{2}}}}}{{{arepsilon ^{8 + heta }}}}} ight)$ queries of the function for any arbitrary θ > 0.

机译：梯度下降及其变体广泛用于机器学习中。但是，在许多应用程序中可能无法使用oracle进行梯度访问，从而限制了直接使用梯度下降。本文提出了一种估计梯度以进行梯度下降的方法，该方法收敛到一般非凸优化问题的二阶固定点。除了一阶平稳特性外，二阶平稳特性在机器学习应用程序中对于实现更好的性能也很重要。我们表明，所提出的无模型非凸优化算法返回了带有$ \ tilde O \ left（{\ frac {{{d ^ {2 + \ frac {\ theta} {2} }}}} {{{{varepsilon ^ {8 + \ theta}}}}} \ right）$查询任意θ> 0的函数。

著录项

来源
《Annual Conference on Information Sciences and Systems》|2020年|1-6|共6页
会议地点
作者
Qinbo Bai; Mridul Agarwal; Vaneet Aggarwal;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Optimization; Stochastic processes; Radio frequency; Estimation; Machine learning; Neural networks; Convergence;

机译：优化;随机过程;射频;估计;机器学习;神经网络;收敛;

相似文献

外文文献
中文文献
专利

1. Accelerated Gradient Descent Escapes Saddle Points Faster than Gradient Descent [J] . Chi Jin, Praneeth Netrapalli, Michael I. Jordan JMLR: Workshop and Conference Proceedings . 2018,第12期

机译：加速梯度下降比梯度下降更快地逃避了鞍点
2. Accelerated Gradient Descent Escapes Saddle Points Faster than Gradient Descent [J] . Chi Jin, Praneeth Netrapalli, Michael I. Jordan JMLR: Workshop and Conference Proceedings . 2017,第1期

机译：加速梯度下降比梯度下降更快地逃避了鞍点
3. Comparing Gradient Descent with Automatic Differentiation and Particle Swarm Optimization Techniques for Estimating Tumor Blood Flow Parameters in Contrast-Enhanced Imaging [J] . Chang Kao-Pu, Libertini Jessica M., Seay Steven Journal of Scientific Computing . 2019,第3期

机译：将梯度下降与自动分化和粒子群优化技术进行比较，以估算肿瘤血流参数对比增强成像
4. Escaping Saddle Points for Zeroth-order Non-convex Optimization using Estimated Gradient Descent [C] . Qinbo Bai, Mridul Agarwal, Vaneet Aggarwal Annual Conference on Information Sciences and Systems . 2020

机译：使用估计的梯度下降逃离零顺序非凸优化的鞍座点
5. When Can Nonconvex Optimization Problems Be Solved with Gradient Descent? A Few Case Studies [D] . Gilboa, Dar. 2020

机译：何时可以用梯度下降解决非渗透优化问题？一些案例研究
6. Noise Helps Optimization Escape From Saddle Points in the Synaptic Plasticity [O] . Ying Fang, Zhaofei Yu, Feng Chen 2020

机译：噪音有助于优化从突触可塑性中的马鞍点逃逸
7. Perturbed Proximal Descent to Escape Saddle Points for Non-convex and Non-smooth Objective Functions [O] . Zhishen Huang, Stephen Becker 2019

机译：扰动近端下降，以逃避非凸和非平滑目标函数的鞍点

Escaping Saddle Points for Zeroth-order Non-convex Optimization using Estimated Gradient Descent

摘要

著录项

相似文献

相关主题

期刊订阅