Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks

机译：权重归一化：一个简单的重新参数化以加快深度神经网络的训练

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present weight normalization: a reparameterization of the weight vectors in a neural network that decouples the length of those weight vectors from their direction. By reparameterizing the weights in this way we improve the conditioning of the optimization problem and we speed up convergence of stochastic gradient descent. Our reparameterization is inspired by batch normalization but does not introduce any dependencies between the examples in a minibatch. This means that our method can also be applied successfully to recurrent models such as LSTMs and to noise-sensitive applications such as deep reinforcement learning or generative models, for which batch normalization is less well suited. Although our method is much simpler, it still provides much of the speed-up of full batch normalization. In addition, the computational overhead of our method is lower, permitting more optimization steps to be taken in the same amount of time. We demonstrate the usefulness of our method on applications in supervised image recognition, generative modelling, and deep reinforcement learning.

机译：我们提出了权重归一化：神经网络中权重向量的重新参数化，将这些权重向量的长度与其方向解耦。通过以这种方式重新设置权重，我们改善了优化问题的条件，并加快了随机梯度下降的收敛速度。我们的重新参数化受到批处理规范化的启发，但没有在微型批处理中引入示例之间的任何依存关系。这意味着我们的方法还可以成功地应用于递归模型（例如LSTM）以及对噪声敏感的应用（例如深度强化学习或生成模型），而批量归一化不太适合此类方法。尽管我们的方法要简单得多，但它仍可提供完整批次标准化的许多加速。此外，我们方法的计算开销较低，从而允许在相同的时间内采取更多的优化步骤。我们展示了我们的方法在监督图像识别，生成建模和深度强化学习中的应用的有用性。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2016年|901-909|共9页
会议地点
作者
Tim Salimans; Diederik P. Kingma;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. 一个用于气候模式的简单冻土过程参数化方案的建立和检验 [J] . 张宇, 吕世华大气科学进展（英文版） . 2002,第003期
2. GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework [J] . Deng Lei, Jiao Peng, Pei Jing, Neural Networks: The Official Journal of the International Neural Network Society . 2018,第期

机译：GXNOR-NET：在统一的离散化框架下，使用三元权重和激活，在没有全精密内存的情况下培训深神经网络
3. Accelerating Training of Deep Neural Networks on GPU using CUDA [J] . D.T.V. Dharmajee Rao, K.V. Ramana International Journal of Intelligent Systems and Applications . 2019,第5期

机译：使用CUDA加速GPU上的深度神经网络训练
4. Training Lightweight Deep Convolutional Neural Networks Using Bag-of-Features Pooling [J] . Passalis Nikolaos, Tefas Anastasios Neural Networks and Learning Systems, IEEE Transactions on . 2019,第6期

机译：使用特征包池训练轻型深度卷积神经网络
5. Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks [C] . Tim Salimans, Diederik P. Kingma Annual conference on Neural Information Processing Systems . 2016

机译：重量标准化：加速深度神经网络训练的简单回报率
6. Pipelined Training with Stale Weights of Deep Convolutional Neural Networks [D] . ?Zhang, Lifu 2020

机译：流水线训练与深卷积神经网络的陈旧重量
7. Deep neural network with weight sparsity control and pre-training extracts hierarchical features and enhances classification performance: Evidence from whole-brain resting-state functional connectivity patterns of schizophrenia [O] . Junghoe Kim, Vince D. Calhoun, Eunsoo Shim, -1

机译：具有权重稀疏控制和预训练的深度神经网络可提取分层特征并增强分类性能：来自精神分裂症的全脑静止状态功能连接模式的证据
8. Generalized Batch Normalization: Towards Accelerating Deep Neural Networks [O] . Xiaoyong Yuan, Zheng Feng, Matthew Norton, 2019

机译：广义批量标准化：朝向加速深神经网络

Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅