首页> 外文会议>Design, Automation Test in Europe Conference Exhibition >Inference of quantized neural networks on heterogeneous all-programmable devices
【24h】

Inference of quantized neural networks on heterogeneous all-programmable devices

机译:在异构全可编程设备上的量化神经网络推断

获取原文

摘要

Neural networks have established as a generic and powerful means to approach challenging problems such as image classification, object detection or decision making. Their successful employment foots on an enormous demand of compute. The quantization of network parameters and the processed data has proven a valuable measure to reduce the challenges of network inference so effectively that the feasible scope of applications is expanded even into the embedded domain. This paper describes the making of a real-time object detection in a live video stream processed on an embedded all-programmable device. The presented case illustrates how the required processing is tamed and parallelized across both the CPU cores and the programmable logic and how the most suitable resources and powerful extensions, such as NEON vectorization, are leveraged for the individual processing steps. The crafted result is an extended Darknet framework implementing a fully integrated, end-to-end solution from video capture over object annotation to video output applying neural network inference at different quantization levels running at 16 frames per second on an embedded Zynq UltraScale+ (XCZU3EG) platform.
机译:神经网络建立了一个通用的有力手段接近具有挑战性的问题,如图像分类,目标检测或决策。在计算的巨大需求他们的顺利就业脚灯。的网络参数和处理的数据的量化已经证明了宝贵的措施,以便有效的是应用的可行范围被扩展,以减少网络推断的挑战甚至到嵌入域。本文描述了一种嵌入所有可编程的设备上处理一个实时视频流的实时对象检测的决策。所呈现的情况下示出了所要求的处理是如何驯服和整个CPU内核既和可编程逻辑和如何最合适的资源和强大的扩展,如NEON矢量并行化,正在利用用于各个处理步骤。该制作的结果是一个扩展的暗网框架实现完全集成的,端 - 端在对象注释从视频捕获溶液到视频输出在不同的量化等级以每秒16帧上的嵌入式ZYNQ的UltraScale +运行应用神经网络推断(XCZU3EG)平台。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号