首页> 外文会议>International Conference On Intelligent Computing in Data Sciences >HW/SW Co-Design of Cost-Efficient CNN Inference for Cognitive IoT
【24h】

HW/SW Co-Design of Cost-Efficient CNN Inference for Cognitive IoT

机译:HW / SW共同设计成本高效的CNN推论认知物联网

获取原文

摘要

Cognitive Internet of things (IoT) is a novel paradigm that outfits the contemporary IoT with a “brain” to impart high-level intelligence. Convolutional neural networks (CNNs) are an integral part of cognitive IoT that support inference and decision-making. In this paper, we demonstrate a resource-efficient hardware/software (HW/SW) co-design of a CNN architecture for cognitive IoT. We only offload image-tocolumn (im2col) and general matrix multiply (GEMM), which are the most time- and energy-consuming part of convolution layer operations, to the field-programmable gate array (FPGA)-based accelerator. We also exploit the parallelism in the operations of convolution layers to efficiently hide a non-negligible portion of execution time required for bias and activation. Experimental results demonstrate the resource, performance, and energy efficiency of our HW/SW co-design. Results indicate a speedup of $1.3mathrm{X} sim 2.0mathrm{X}$ and energy reduction of 19.4%$sim 44.3$% as compared to using only a general-purpose processor.
机译:认知物联网(物联网)是一种新颖的范式,它与“大脑”推荐了“大脑”来推动高级智能。卷积神经网络(CNNS)是支持推理和决策的认知物联网的组成部分。在本文中,我们展示了用于认知物联网的CNN架构的资源有效的硬件/软件(HW / SW)共同设计。我们仅卸载图像 - 致映像(IM2COL)和一般矩阵乘以(GEMM),这是卷积层操作的最具时间和耗能的部分,到现场可编程门阵列(FPGA)的加速器。我们还利用了卷积层的操作中的并行性,以有效地隐藏偏差和激活所需的执行时间的不可忽略部分。实验结果展示了我们HW / SW Co-Design的资源,性能和能源效率。结果表明加速1.3 mathrm {x} sim 2.0 mathrm {x} $和能量减少19.4%$ sim 44.3 $%,只使用通用处理器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号