SpWA: An Efficient Sparse Winograd Convolutional Neural Networks Accelerator on FPGAs

机译：SPAW：FPGA上有效的稀疏Winograd卷积神经网络加速器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

FPGAs have been an efficient accelerator for CNN inference due to its high performance, flexibility, and energy-efficiency. To improve the performance of CNNs on FPGAs, fast algorithms and sparse methods emerge as the most attractive alternatives, which can effectively reduce the complexity of CNNs. Using fast algorithms, the feature maps are transformed to special domain to reduce the arithmetic complexity. On the other hand, compressing CNN models by pruning the unimportant connections reduces both storage and arithmetic complexity. In this paper, we introduce sparse Winograd convolution accelerator (SpWA) combining these two orthogonal approaches on FPGAs. First, we employ a novel dataflow by rearranging the filter layout in Winograd convolution. Then we design an efficient architecture to implement SpWA using line buffer design and Compress-Sparse-Column (CSC) format-based processing element. Finally, we propose an efficient algorithm based on dynamic programming to balance the computation among different processing elements. Experimental results on VGG16 and YOLO network show a 2.9×～3.1× speedup compared with state-of-the-art technique.

机译：由于其高性能，灵活性和能效，FPGA是CNN推断的高效加速器。为了提高CNNS对FPGA的性能，快速算法和稀疏方法作为最具吸引力的替代方案，可以有效降低CNN的复杂性。使用快速算法，将特征映射转换为特殊域以降低算术复杂性。另一方面，通过修剪不重要的连接来压缩CNN模型可降低存储和算术复杂性。在本文中，我们介绍了在FPGA上结合这两个正交方法的稀疏Winograd卷积加速器（SPWA）。首先，我们通过重新排列Winograd卷积中的过滤器布局来使用新颖的数据流。然后，我们使用行缓冲器设计和压缩 - 稀疏列（CSC）基于格式的处理元素来设计一个有效的架构来实现SPWA。最后，我们提出了一种基于动态编程的高效算法，以平衡不同处理元件的计算。与最先进的技术相比，VGG16和YOLO网络的实验结果显示了2.9×〜3.1倍的加速。

著录项

来源
《ACM/ESDA/IEEE Design Automation Conference》|2018年|515-1076p|共6页
会议地点
作者
Liqiang Lu; Yun Liang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP2-53;
关键词

相似文献

外文文献
中文文献
专利

1. WinoNN: Optimizing FPGA-Based Convolutional Neural Network Accelerators Using Sparse Winograd Algorithm [J] . Wang Xuan, Wang Chao, Cao Jing, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems . 2020,第11期

机译：WINONN：使用稀疏Winograd算法优化基于FPGA的卷积神经网络加速器
2. An Efficient Hardware Accelerator for Structured Sparse Convolutional Neural Networks on FPGAs [J] . Zhu Chaoyang, Huang Kejie, Yang Shuyuan, IEEE transactions on very large scale integration (VLSI) systems . 2020,第9期

机译：FPGA上结构化稀疏卷积神经网络的有效硬件加速器
3. SENTEI: Filter-Wise Pruning with Distillation towards Efficient Sparse Convolutional Neural Network Accelerators [J] . Masayuki SHIMODA, Youki SADA, Ryosuke KURAMOCHI, IEICE transactions on information and systems . 2020,第12期

机译：Sentei：通过蒸馏到有效的稀疏卷积神经网络加速器的滤波器 - 明智的修剪
4. SpWA: An Efficient Sparse Winograd Convolutional Neural Networks Accelerator on FPGAs [C] . Liqiang Lu, Yun Liang 2018 55th ACM/ESDA/IEEE Design Automation Conference . 2018

机译：SpWA：FPGA上的高效稀疏Winograd卷积神经网络加速器
5. FPGA-based Accelerators for Convolutional Neural Networks on Embedded Devices [D] . Perera Miro, Jordi. 2020

机译：基于FPGA的嵌入式设备卷积神经网络的加速器
6. FPGA Implementation for Odor Identification with Depthwise Separable Convolutional Neural Network [O] . Zhuofeng Mo, Dehan Luo, Tengteng Wen, 2021

机译：FPGA实现气味识别与深度可分离的卷积神经网络
7. FPGA-Based Inter-layer Pipelined Accelerators for Filter-Wise Weight-Balanced Sparse Fully Convolutional Networks with Overlapped Tiling [O] . Masayuki Shimoda, Youki Sada, Hiroki Nakahara 2021

机译：基于FPGA的层间流水线加速器，用于滤波器的重量平衡的稀疏完全卷积网络，具有重叠的平铺

SpWA: An Efficient Sparse Winograd Convolutional Neural Networks Accelerator on FPGAs

摘要

著录项

相似文献

相关主题

期刊订阅