首页> 外国专利> QUANTIZATION OF NEURAL NETWORK MODELS USING DATA AUGMENTATION

QUANTIZATION OF NEURAL NETWORK MODELS USING DATA AUGMENTATION

机译：使用数据增强的神经网络模型的量化

页面导航

摘要
著录项
相似文献

摘要

A neural network is trained at a first precision using a training dataset. The neural network is then calibrated using an augmented calibration dataset that includes a first dataset and one or more second datasets produced by modifying the first dataset. A range of values of activations of nodes in the neural network at the first precision is determined based on inputs to the neural network from the augmented calibration dataset. The activations of the nodes are then quantized to a second precision based on the range of values of the activations of the nodes at the first precision. The first precision is higher than the second precision. For example, in some cases the first precision is a 32-bit floating point precision and the second precision is an 8-bit integer precision.

机译：使用训练数据集，以第一精度培训神经网络。然后使用包含第一个数据集的增强校准数据集来校准神经网络，该数据集包括通过修改第一个数据集而产生的一个或多个第二数据集。基于来自增强校准数据集的神经网络的输入确定第一精度处的神经网络中的节点的激活值的范围。然后，基于第一精度的节点的激活的值的范围，将节点的激活量化为第二精度。第一精度高于第二精度。例如，在某些情况下，第一精度是32位浮点精度，第二精度是8位整数精度。

著录项

公开/公告号US2021406682A1

专利类型
公开/公告日2021-12-30

原文格式PDF
申请/专利权人 ADVANCED MICRO DEVICES INC.;
展开▼

申请/专利号US202017030021
发明设计人 RAJY MEEYAKHAN RAWTHER;MICHAEL L. SCHMIT;
展开▼

申请日2020-09-23
分类号G06N3/08;G06F17/18;
国家 US
入库时间 2022-08-24 23:07:28

相似文献

专利
外文文献
中文文献