首页> 外文期刊>Neural computation >The Remarkable Robustness of Surrogate Gradient Learning for Instilling Complex Function in Spiking Neural Networks
【24h】

The Remarkable Robustness of Surrogate Gradient Learning for Instilling Complex Function in Spiking Neural Networks

机译:代理梯度学习在尖峰神经网络中灌输复杂功能的显着稳健性

获取原文
获取原文并翻译 | 示例
           

摘要

Brains process information in spiking neural networks. Their intricate connections shape the diverse functions these networks perform. Yet how network connectivity relates to function is poorly understood, and the functional capabilities of models of spiking networks are still rudimentary. The lack of both theoretical insight and practical algorithms to find the necessary connectivity poses a major impediment to both studying information processing in the brain and building efficient neuromorphic hardware systems. The training algorithms that solve this problem for artificial neural networks typically rely on gradient descent. But doing so in spiking networks has remained challenging due to the nondifferentiable nonlinearity of spikes. To avoid this issue, one can employ surrogate gradients to discover the required connectivity. However, the choice of a surrogate is not unique, raising the question of how its implementation influences the effectiveness of the method. Here, we use numerical simulations to systematically study how essential design parameters of surrogate gradients affect learning performance on a range of classification problems.We show that surrogate gradient learning is robust to different shapes of underlying surrogate derivatives, but the choice of the derivative’s scale can substantially affect learning performance.Whenwe combine surrogate gradients with suitable activity regularization techniques, spiking networks perform robust information processing at the sparse activity limit. Our study provides a systematic account of the remarkable robustness of surrogate gradient learning and serves as a practical guide to model functional spiking neural networks.
机译:尖峰神经网络中的脑限进程信息。它们复杂的连接形状这些网络执行的不同功能。然而,网络连接如何涉及功能较差,并且尖刺网络模型的功能能力仍然是基本的。缺乏理论洞察力和实用算法来寻找必要的连接,对脑和建筑有效的神经形状硬件系统中的学习信息处理构成主要障碍。解决人工神经网络的这个问题的训练算法通常依赖于梯度下降。但是,由于尖刺的非难的非线性,因此在尖峰网络中仍然挑战。为了避免这个问题,可以使用代理梯度来发现所需的连接。然而,替代品的选择并不是独一无二的,提出了其实施方式如何影响方法的有效性。在这里,我们使用数值模拟来系统地研究代理梯度的基本设计参数如何影响一系列分类问题的学习性能。我们表明代理梯度学习对不同的替代品衍生物的不同形状是强大的,但衍生的规模可以选择基本上影响学习性能.WHENWHENWE将代理梯度与合适的活动正则化技术组合,尖峰网络以稀疏活动限制执行鲁棒信息处理。我们的研究提供了替代梯度学习的显着稳健性的系统陈述,并作为模型功能尖刺神经网络的实用指南。

著录项

  • 来源
    《Neural computation》 |2021年第4期|899-925|共27页
  • 作者

    Friedemann Zenke; Tim P.Vogels;

  • 作者单位

    Centre for Neural Circuits and Behaviour University of Oxford Oxford OX1 3SR U.K. and Friedrich Miescher Institute for Biomedical Research 4058 Basel Switzerland;

    Centre for Neural Circuits and Behaviour University of Oxford Oxford OX1 3SR U.K. and Institute for Science and Technology 3400 Klosterneuburg Austria;

  • 收录信息 美国《科学引文索引》(SCI);美国《化学文摘》(CA);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号