首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Dynamical Isometry and a Mean Field Theory of RNNs: Gating Enables Signal Propagation in Recurrent Neural Networks
【24h】

Dynamical Isometry and a Mean Field Theory of RNNs: Gating Enables Signal Propagation in Recurrent Neural Networks

机译:动力学等距和RNN的平均场理论:门控能够在经常性神经网络中的信号传播

获取原文
       

摘要

Recurrent neural networks have gained widespread use in modeling sequence data across various domains. While many successful recurrent architectures employ a notion of gating, the exact mechanism that enables such remarkable performance is not well understood. We develop a theory for signal propagation in recurrent networks after random initialization using a combination of mean field theory and random matrix theory. To simplify our discussion, we introduce a new RNN cell with a simple gating mechanism that we call the minimalRNN and compare it with vanilla RNNs. Our theory allows us to define a maximum timescale over which RNNs can remember an input. We show that this theory predicts trainability for both recurrent architectures. We show that gated recurrent networks feature a much broader, more robust, trainable region than vanilla RNNs, which corroborates recent experimental findings. Finally, we develop a closed-form critical initialization scheme that achieves dynamical isometry in both vanilla RNNs and minimalRNNs. We show that this results in significantly improved training dynamics. Finally, we demonstrate that the minimalRNN achieves comparable performance to its more complex counterparts, such as LSTMs or GRUs, on a language modeling task.
机译:经常性神经网络在各个域跨越序列数据的广泛使用。虽然许多成功的经常性架构采用了门控的概念,但能够很好地理解实现这种显着性能的确切机制。在使用平均场理论和随机矩阵理论的组合之后,在随机初始化之后,在随机初始化之后开发一种反复网络中信号传播的理论。为了简化我们的讨论,我们介绍了一个新的RNN单元,具有简单的门控机制,我们称之为最小值并将其与香草RNN进行比较。我们的理论允许我们定义最大时间尺度,RNN可以记住输入。我们表明该理论预测了两种经常性架构的可训练。我们表明,门控复发网络具有比香草RNN更广泛,更强大的可训练区域,其证实了最近的实验结果。最后,我们开发了一种封闭形式的关键初始化方案,该方案在香草RNNS和MINEALRNN中实现了动态等距。我们表明这导致显着改善的培训动态。最后,我们证明了在语言建模任务上,最小值验证了对其更复杂的对应物,例如LSTMS或GRUS的比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号