Exploring the use of adaptive gradient methods in effective deep learning systems

机译：探索在有效的深度学习系统中使用自适应梯度方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Successful applications of Deep Learning have brought about breakthroughs in natural language understanding, speech recognition, and computer vision. One of the major challenges of designing powerful Deep Learning solutions for tasks such as image classification and text parsing, however, is the difficulty of training Deep Neural Networks (DNNs) properly. Recent research has raised serious doubts about the use of adaptive gradient methods, which have been popularized for running faster and requiring less parameter tuning than nonadaptive gradient methods. A recent study shows that adaptive gradient methods are worse than nonadaptive gradient methods in terms of training loss and test error. In this paper, we aim to revisit this problem, evaluating several nonadaptive and adaptive gradient methods including a recently-proposed adaptive gradient algorithm, AMSGrad, which seeks to solve some of the problems present in previous adaptive gradient methods. We focus on the benchmark MNIST optical character recognition task, one of the most widely-used in machine learning research, to investigate the differences in using adaptive gradient methods and nonadaptive gradient methods to train DNNs.

机译：深度学习的成功应用带来了自然语言理解，语音识别和计算机视觉方面的突破。然而，为诸如图像分类和文本解析之类的任务设计功能强大的深度学习解决方案的主要挑战之一是难以正确训练深度神经网络（DNN）。最近的研究对自适应梯度方法的使用提出了严重怀疑，自适应梯度方法已被广泛普及，与非自适应梯度方法相比，运行速度更快且所需的参数调整更少。最近的研究表明，就训练损失和测试误差而言，自适应梯度法比非自适应梯度法差。在本文中，我们旨在重新评估该问题，评估几种非自适应和自适应梯度方法，包括最近提出的自适应梯度算法AMSGrad，其目的是解决以前的自适应梯度方法中存在的一些问题。我们专注于基准MNIST光学字符识别任务（这是机器学习研究中使用最广泛的任务），以研究使用自适应梯度方法和非自适应梯度方法训练DNN的差异。

著录项

来源
《Systems and Information Engineering Design Symposium》|2018年|220-224|共5页
会议地点
作者
Luke Merrick; Quanquan Gu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Gradient methods; Machine learning; Tuning; Error analysis; Task analysis;

机译：培训;梯度法;机器学习;调优;错误分析;任务分析;

相似文献

外文文献
中文文献
专利

1. PP-PG: Combining Parameter Perturbation with Policy Gradient Methods for Effective and Efficient Explorations in Deep Reinforcement Learning [J] . Li Shilei, Li Meng, Su Jiongming, ACM transactions on intelligent systems and technology . 2021,第3期

机译：PP-PG：将参数扰动与政策梯度方法相结合，为深加固学习中有效和高效的探索
2. An adaptive deep learning method for item recommendation system [J] . Dau Aminu, Salim Naomie, Idris Rabiu Knowledge-Based Systems . 2021,第Feba15期

机译：项目推荐系统的自适应深度学习方法
3. Handwritten Devanagari Character Recognition Using Layer-Wise Training of Deep Convolutional Neural Networks and Adaptive Gradient Methods [J] . Mahesh Jangid, Sumit Srivastava Journal of Imaging . 2018,第2期

机译：深度卷积神经网络的分层明智训练和自适应梯度法的手写体梵文字符识别
4. Exploring the use of adaptive gradient methods in effective deep learning systems [C] . Luke Merrick, Quanquan Gu Systems and Information Engineering Design Symposium . 2018

机译：探索在有效深度学习系统中的自适应梯度方法的使用
5. Exploring deep learning methods for discovering features in speech signals. [D] . Jaitly, Navdeep. 2014

机译：探索用于发现语音信号特征的深度学习方法。
6. Automated diagnosis of myositis from muscle ultrasound: Exploring the use of machine learning and deep learning methods [O] . Philippe Burlina, Seth Billings, Neil Joshi, -1

机译：通过肌肉超声自动诊断肌炎：探索使用机器学习和深度学习方法
7. A Robust Adaptive Stochastic Gradient Method for Deep Learning [O] . Gulcehre, Caglar, Sotelo, Jose, Moczulski, Marcin, 2017

机译：一种用于深度学习的鲁棒自适应随机梯度法
8. Adaptive Control, Learning, and Cost-Effective Sensor Systems for Robotics or Advanced Automation Systems [R] . Nevins, J. L., Edsall, A. C., Stepien, T. M., 1985

机译：适用于机器人或高级自动化系统的自适应控制，学习和经济高效的传感器系统

Exploring the use of adaptive gradient methods in effective deep learning systems

摘要

著录项

相似文献

相关主题

期刊订阅