Exploring the use of adaptive gradient methods in effective deep learning systems

机译：探索在有效深度学习系统中的自适应梯度方法的使用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Successful applications of Deep Learning have brought about breakthroughs in natural language understanding, speech recognition, and computer vision. One of the major challenges of designing powerful Deep Learning solutions for tasks such as image classification and text parsing, however, is the difficulty of training Deep Neural Networks (DNNs) properly. Recent research has raised serious doubts about the use of adaptive gradient methods, which have been popularized for running faster and requiring less parameter tuning than nonadaptive gradient methods. A recent study shows that adaptive gradient methods are worse than nonadaptive gradient methods in terms of training loss and test error. In this paper, we aim to revisit this problem, evaluating several nonadaptive and adaptive gradient methods including a recently-proposed adaptive gradient algorithm, AMSGrad, which seeks to solve some of the problems present in previous adaptive gradient methods. We focus on the benchmark MNIST optical character recognition task, one of the most widely-used in machine learning research, to investigate the differences in using adaptive gradient methods and nonadaptive gradient methods to train DNNs.

机译：深度学习的成功应用引起了自然语言理解，语音识别和计算机视觉的突破。然而，为图像分类和文本解析等任务设计强大的深度学习解决方案的主要挑战之一是难以培训深神经网络（DNNS）。最近的研究对使用自适应梯度方法的使用提出了严重的疑虑，这些方法已被推广到运行更快，并且需要比非接受梯度方法更少的参数调整。最近的一项研究表明，在训练损失和测试误差方面，自适应梯度方法比非接受梯度方法差。在本文中，我们的目的是重新审视这个问题，评估包括最近提出的自适应梯度算法的若干不适应和自适应梯度方法，AMSGRAD，它试图解决先前自适应梯度方法中存在的一些问题。我们专注于基准Mnist光学字符识别任务，是机器学习研究中最广泛使用的一个，调查使用自适应梯度方法和非接受梯度方法来训练DNN的差异。

著录项

来源
《Systems and Information Engineering Design Symposium》|2018年|292p|共5页
会议地点
作者
Luke Merrick; Quanquan Gu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP30-53;
关键词
Training; Gradient methods; Machine learning; Tuning; Error analysis; Task analysis;

机译：培训;梯度方法;机器学习;调整;错误分析;任务分析;

相似文献

外文文献
中文文献
专利

1. PP-PG: Combining Parameter Perturbation with Policy Gradient Methods for Effective and Efficient Explorations in Deep Reinforcement Learning [J] . Li Shilei, Li Meng, Su Jiongming, ACM transactions on intelligent systems and technology . 2021,第3期

机译：PP-PG：将参数扰动与政策梯度方法相结合，为深加固学习中有效和高效的探索
2. An adaptive deep learning method for item recommendation system [J] . Dau Aminu, Salim Naomie, Idris Rabiu Knowledge-Based Systems . 2021,第Feba15期

机译：项目推荐系统的自适应深度学习方法
3. Handwritten Devanagari Character Recognition Using Layer-Wise Training of Deep Convolutional Neural Networks and Adaptive Gradient Methods [J] . Mahesh Jangid, Sumit Srivastava Journal of Imaging . 2018,第2期

机译：深度卷积神经网络的分层明智训练和自适应梯度法的手写体梵文字符识别
4. Exploring the use of adaptive gradient methods in effective deep learning systems [C] . Luke Merrick, Quanquan Gu Systems and Information Engineering Design Symposium . 2018

机译：探索在有效的深度学习系统中使用自适应梯度方法
5. Exploring deep learning methods for discovering features in speech signals. [D] . Jaitly, Navdeep. 2014

机译：探索用于发现语音信号特征的深度学习方法。
6. Automated diagnosis of myositis from muscle ultrasound: Exploring the use of machine learning and deep learning methods [O] . Philippe Burlina, Seth Billings, Neil Joshi, -1

机译：通过肌肉超声自动诊断肌炎：探索使用机器学习和深度学习方法
7. A Robust Adaptive Stochastic Gradient Method for Deep Learning [O] . Gulcehre, Caglar, Sotelo, Jose, Moczulski, Marcin, 2017

机译：一种用于深度学习的鲁棒自适应随机梯度法
8. Adaptive Control, Learning, and Cost-Effective Sensor Systems for Robotics or Advanced Automation Systems [R] . Nevins, J. L., Edsall, A. C., Stepien, T. M., 1985

机译：适用于机器人或高级自动化系统的自适应控制，学习和经济高效的传感器系统

Exploring the use of adaptive gradient methods in effective deep learning systems

摘要

著录项

相似文献

相关主题

期刊订阅