首页> 外文会议>International Computer Symposium >The Brain Memory Architecture HW/SW Co-Design Platform with Adaptive CNN Algorithm

【24h】

The Brain Memory Architecture HW/SW Co-Design Platform with Adaptive CNN Algorithm

机译：大脑内存架构HW / SW共设计平台，具有自适应CNN算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

As the demand for machine learning, edge computing, and the Internet of Things technology increases, computing efficiency and energy consumption has become an important basis for computing choices. Although the graphics processing unit(GPU) has a high degree of parallel computing capability, its energy consumption is large, and the data transmission is limited by the system bus bandwidth. Therefore, our laboratory previously proposed the Brain Memory Architecture prototype architecture, which integrates FPGA and memory as a computing architecture, which has the advantages of high-efficiency, and low-power computing and does not require data exchange through the system bus. Based on this prototype architecture, this paper constructs the Brain Memory Architecture HW/SW Co-Design Platform (BMCD platform) to provide a good user interface so that users can easily build a hardware and software collaborative design computing environment. Through the library provided by the platform to establish the data transmission and calculation between acceleration hardware and memory to solve the bandwidth limitation of the traditional system bus. In this platform, the AXI4-stream interconnect core is provided as a standard interface for data handshaking with acceleration hardware, which reduces user design complexity and maintains the scalability of connection with other computing IP cores. In platform evaluation, design and adaptive CNN algorithm for hardware and software design platform, provide data quantization methods to reduce data bits to reduce the required data bandwidth and storage space and propose a dynamic adjustment algorithm for integer and decimal ratios to correct the accuracy and design problems that may be caused by data quantization. With this adaptive CNN algorithm architecture and BMCD platform to construct a rapid data transmission. This paper finally analyzes the comparison of the weight transmission time of different CNN models with the CPU and the GPU. The method proposed in this paper can reach about 20 times faster than the CPU and about 10 times faster than the GPU.

机译：至于机器学习，边计算和物联网技术增加互联网，计算效率和能源消耗的需求已成为计算选择的重要依据。尽管图形处理单元（GPU）具有高度的并行计算能力，它的能量消耗大，并且将数据传输是由系统总线带宽的限制。因此，我们的实验室先前提出的大脑内存架构的原型架构，集成了FPGA和内存的计算架构，它具有高效率的优势，低功耗计算，并不需要通过系统总线进行数据交换。本文在此基础上的原型架构，构建了脑内存架构HW / SW协同设计平台（BMCD平台），以提供良好的用户界面，使用户可以轻松地构建一个软硬件协同设计的计算环境。通过由平台提供的库建立加速硬件和存储器之间的数据传输和计算，解决了传统系统总线的带宽限制。在该平台上，AXI4-流互连芯被提供作为用于数据与加速硬件，从而降低了用户设计的复杂性和维护与其他计算IP核连接的可扩展性握手的标准接口。在平台评价，设计和自适应CNN算法的硬件和软件设计平台，提供数据的量化方法，以减少数据的比特以减少所需的数据带宽和存储空间，提出了一种动态调整算法用于整数和十进制比率来校正的准确性和设计可以由数据量化引起的问题。与此自适应算法CNN架构和BMCD平台构建快速数据传输。本文最后分析的不同型号CNN与CPU和GPU的重量传输时间的比较。在本文所提出的方法可达到约比CPU快20倍和大约10倍，比GPU更快。

著录项

来源
《International Computer Symposium》|2020年|197-202|共6页
会议地点
作者
Jih-Ching Chiu; Yu-Yi Wang; Wei-Yi Lin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Heuristic algorithms; Memory architecture; Software algorithms; Graphics processing units; Bandwidth; Hardware; Data communication;

机译：启发式算法;内存架构;软件算法;图形处理单元;带宽;硬件;数据通信;

相似文献

外文文献
中文文献
专利

1. The COMPLEX reference framework for HW/SW co-design and power management supporting platform-based design-space exploration [J] . Kim Gruettner, Philipp A. Hartmann, Kai Hylla, Microprocessors and microsystems . 2013,第8ptaC期

机译：用于硬件/软件协同设计和电源管理的COMPLEX参考框架，支持基于平台的设计空间探索
2. HW/SW Co-Design of the HOG algorithm on a Xilinx Zynq SoC [J] . Jens Rettkowski, Andrew Boutros, Diana Göhringer Journal of Parallel and Distributed Computing . 2017,第NOVa期

机译：Xilinx Zynq SoC上HOG算法的硬件/软件协同设计
3. HW/SW co-design of reconfigurable hardware-based genetic algorithm in FPGAs applicable to a variety of problems [J] . Vishnu P. Nambiar, Sathivellu Balakrishnan, Mohamed Khalil-Hani, Computing . 2013,第9期

机译：在FPGA中基于硬件的可重构遗传算法的硬件/软件协同设计，适用于各种问题
4. Performance evaluation over HW/SW co-design SoC memory transfers for a CNN accelerator [C] . A. Rios-Navarro, R. Tapiador-Morales, A. Jimenez-Fernandez, IEEE International Conference on Nanotechnology . 2018

机译：针对CNN加速器的硬件/软件协同设计SoC存储器传输的性能评估
5. HW/SW co-design and ASIP architectures for cryptographic primitives in embedded security systems [D] . Hodjat, Alireza 2005

机译：嵌入式安全系统中用于密码原语的硬件/软件协同设计和ASIP架构
6. Algorithmic analysis for dental caries detection using an adaptive neural network architecture [O] . Shashikant Patil, Vaishali Kulkarni, Archana Bhise 2019

机译：使用自适应神经网络架构的龋齿检测算法分析
7. Performance evaluation over HW/SW co-design SoC memory transfers for a CNN accelerator [O] . A. Rios-Navarro, R. Tapiador-Morales, A. Jimenez-Fernandez, 2018

机译：用于CNN加速器的HW / SW Co-Design SoC存储器转移的性能评估

The Brain Memory Architecture HW/SW Co-Design Platform with Adaptive CNN Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅