首页>
外国专利>
QUANTIZATION OF NEURAL NETWORK MODELS USING DATA AUGMENTATION
QUANTIZATION OF NEURAL NETWORK MODELS USING DATA AUGMENTATION
展开▼
机译:使用数据增强的神经网络模型的量化
展开▼
页面导航
摘要
著录项
相似文献
摘要
A neural network is trained at a first precision using a training dataset. The neural network is then calibrated using an augmented calibration dataset that includes a first dataset and one or more second datasets produced by modifying the first dataset. A range of values of activations of nodes in the neural network at the first precision is determined based on inputs to the neural network from the augmented calibration dataset. The activations of the nodes are then quantized to a second precision based on the range of values of the activations of the nodes at the first precision. The first precision is higher than the second precision. For example, in some cases the first precision is a 32-bit floating point precision and the second precision is an 8-bit integer precision.
展开▼