首页> 外国专利> METHODS AND ALGORITHMS OF REDUCING COMPUTATION FOR DEEP NEURAL NETWORKS VIA PRUNING

METHODS AND ALGORITHMS OF REDUCING COMPUTATION FOR DEEP NEURAL NETWORKS VIA PRUNING

机译：通过修剪的深层神经网络减少计算的方法和算法

页面导航

摘要
著录项
相似文献

摘要

A method is disclosed to reduce computational load of a deep neural network. A number of multiply-accumulate (MAC) operations is determined for each layer of the deep neural network. A pruning error allowance per weight is determined based on a computational load of each layer. For each layer of the deep neural network: a threshold estimator is initialized, and weights of each layer are pruned based on a standard deviation of all weights within the layer. A pruning error per weight is determined for the layer, and if the pruning error per weight exceeds a predetermined threshold, the threshold estimator is updated for the layer the weights of the layer are repruned using the updated threshold estimator and the pruning error per weight is re-determined until the pruning error per weight is less than the threshold. The deep neural network is then retrained.

机译：公开了一种减少深度神经网络的计算负荷的方法。为深度神经网络的每一层确定了许多乘累加（MAC）操作。基于每层的计算负荷确定每单位重量的修剪误差余量。对于深度神经网络的每一层：初始化阈值估计器，并根据该层内所有权重的标准偏差来修剪每一层的权重。确定该层的每权重修剪误差，并且如果每权重的修剪误差超过预定阈值，则为该层更新阈值估计器，使用更新的阈值估计器重新修剪该层的权重，并且每权重的修剪误差为重新确定，直到每单位重量的修剪误差小于阈值为止。然后重新训练深度神经网络。

著录项

公开/公告号US2019050735A1

专利类型
公开/公告日2019-02-14

原文格式PDF
申请/专利权人 SAMSUNG ELECTRONICS CO. LTD.;
展开▼

申请/专利号US201715724267
发明设计人 ZHENGPING JI;JOHN WAKEFIELD BROTHERS;WEIRAN DENG;GEORGIOS GEORGIADIS;
展开▼

申请日2017-10-03
分类号G06N3/08;G06N3/04;
国家 US
入库时间 2022-08-21 12:07:58

相似文献

专利
外文文献
中文文献