首页> 外文会议>AAAI Conference on Artificial Intelligence >A Stochastic Derivative-Free Optimization Method with Importance Sampling: Theory and Learning to Control

【24h】

A Stochastic Derivative-Free Optimization Method with Importance Sampling: Theory and Learning to Control

机译：具有重要性抽样的随机衍生优化方法：理论和学习控制

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider the problem of unconstrained minimization of a smooth objective function in R~n in a setting where only function evaluations are possible. While importance sampling is one of the most popular techniques used by machine learning practitioners to accelerate the convergence of their models when applicable, there is not much existing theory for this acceleration in the derivative-free setting. In this paper, we propose the first derivative free optimization method with importance sampling and derive new improved complexity results on non-convex, convex and strongly convex functions. We conduct extensive experiments on various synthetic and real LIBSVM datasets confirming our theoretical results. We test our method on a collection of continuous control tasks on MuJoCo environments with varying difficulty. Experiments show that our algorithm is practical for high dimensional continuous control problems where importance sampling results in a significant sample complexity improvement.

机译：我们考虑在只有功能评估的情况下，我们在r〜n中对r〜n中的平滑目标函数的不受约束最小化的问题。虽然重要性采样是机器学习从业者使用的最受欢迎的技术之一，用于在适用时加速其模型的收敛性，但在无衍生物环境中的加速度没有太大的现有理论。在本文中，我们提出了具有重要性采样的第一种衍生自由优化方法，并导出了非凸，凸强和强凸函数的新改进的复杂性结果。我们对各种合成和真正的LIBSVM数据集进行了广泛的实验，确认了我们的理论结果。我们在Mujoco环境中的连续控制任务集合中测试了我们的方法，具有不同的难度。实验表明，我们的算法对于高维连续控制问题实用，重要的采样导致显着的样本复杂性改善。

著录项

来源
《AAAI Conference on Artificial Intelligence》|2020年|3096-3881p|共8页
会议地点
作者
Adel Bibi; El Houcine Bergou; Ozan Sener; Bernard Ghanem; Peter Richtarik;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. ASTRO-DF: A CLASS OF ADAPTIVE SAMPLING TRUST-REGION ALGORITHMS FOR DERIVATIVE-FREE STOCHASTIC OPTIMIZATION [J] . Shashaani Sara, Hashemi Fatemeh S., Pasupathy Raghu SIAM Journal on Optimization: A Publication of the Society for Industrial and Applied Mathematics . 2018,第4期

机译：Astro-DF：一类自适应采样信任区域算法，用于无衍生随机优化
2. A method for stochastic constrained optimization using derivative-free surrogate pattern search and collocation [J] . Sankaran S., Audet C., Marsden A.L. Journal of Computational Physics . 2010,第12期

机译：基于无导数替代模式搜索和配置的随机约束优化方法
3. Application and comparison of derivative-free optimization algorithms to control and optimize free radical polymerization simulated using the kinetic Monte Carlo method [J] . Hanyu Gao, Andreas Waechter, Ivan A. Konstantinov, Computers & Chemical Engineering . 2018,第jana4期

机译：动力学蒙特卡罗方法模拟无自由基优化算法在控制和优化自由基聚合中的应用与比较
4. A Stochastic Derivative-Free Optimization Method with Importance Sampling: Theory and Learning to Control [C] . Adel Bibi, El Houcine Bergou, Ozan Sener, AAAI Conference on Artificial Intelligence . 2020

机译：具有重要性抽样的随机衍生优化方法：理论和学习控制
5. Model-Based Derivative-Free Optimization Methods and Analysis of Stochastic Nonlinear Optimization [D] . Cao, Liyuan. 2021

机译：基于模型的无衍生优化方法和随机非线性优化分析
6. The value of social practice theory for implementation science: learning from a theory-based mixed methods process evaluation of a randomised controlled trial [O] . Julia Frost, Jennifer Wingham, Nicky Britten, 2020

机译：社会实践理论的实施科学价值：从基于理论的混合方法学习随机对照试验的过程评估
7. A Stochastic Derivative-Free Optimization Method with Importance Sampling: Theory and Learning to Control [O] . Adel Bibi, El Houcine Bergou, Ozan Sener, 2020

机译：具有重要性抽样的随机衍生优化方法：理论和学习控制

A Stochastic Derivative-Free Optimization Method with Importance Sampling: Theory and Learning to Control

摘要

著录项

相似文献

相关主题

期刊订阅