首页> 外国专利> METHODS AND ALGORITHMS OF REDUCING COMPUTATION FOR DEEP NEURAL NETWORKS VIA PRUNING

METHODS AND ALGORITHMS OF REDUCING COMPUTATION FOR DEEP NEURAL NETWORKS VIA PRUNING

机译:通过修剪的深层神经网络减少计算的方法和算法

摘要

A method is disclosed to reduce computational load of a deep neural network. A number of multiply-accumulate (MAC) operations is determined for each layer of the deep neural network. A pruning error allowance per weight is determined based on a computational load of each layer. For each layer of the deep neural network: a threshold estimator is initialized, and weights of each layer are pruned based on a standard deviation of all weights within the layer. A pruning error per weight is determined for the layer, and if the pruning error per weight exceeds a predetermined threshold, the threshold estimator is updated for the layer the weights of the layer are repruned using the updated threshold estimator and the pruning error per weight is re-determined until the pruning error per weight is less than the threshold. The deep neural network is then retrained.
机译:公开了一种减少深度神经网络的计算负荷的方法。为深度神经网络的每一层确定了许多乘累加(MAC)操作。基于每层的计算负荷确定每单位重量的修剪误差余量。对于深度神经网络的每一层:初始化阈值估计器,并根据该层内所有权重的标准偏差来修剪每一层的权重。确定该层的每权重修剪误差,并且如果每权重的修剪误差超过预定阈值,则为该层更新阈值估计器,使用更新的阈值估计器重新修剪该层的权重,并且每权重的修剪误差为重新确定,直到每单位重量的修剪误差小于阈值为止。然后重新训练深度神经网络。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号