首页> 外文会议>IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing >SOL: Effortless Device Support for AI Frameworks without Source Code Changes
【24h】

SOL: Effortless Device Support for AI Frameworks without Source Code Changes

机译:SOL:不更改源代码即可轻松支持AI框架的设备

获取原文

摘要

Modern high performance computing clusters heavily rely on accelerators to overcome the limited compute power of CPUs. These supercomputers run various applications from different domains such as simulations, numerical applications or artificial intelligence (AI). As a result, vendors need to be able to efficiently run a wide variety of workloads on their hardware.In the AI domain this is in particular exacerbated by the existance of a number of popular frameworks (e.g, PyTorch, TensorFlow, etc.) that have no common code base, and can vary in functionality. The code of these frameworks evolves quickly, making it expensive to keep up with all changes and potentially forcing developers to go through constant rounds of upstreaming.In this paper we explore how to provide hardware support in AI frameworks without changing the framework’s source code in order to minimize maintenance overhead. We introduce SOL, an AI acceleration middleware that provides a hardware abstraction layer that allows us to transparently support heterogenous hardware. As a proof of concept, we implemented SOL for PyTorch with three backends: CPUs, GPUs and vector processors.
机译:现代高性能计算集群严重依赖加速器来克服CPU有限的计算能力。这些超级计算机可以运行来自不同领域的各种应用程序,例如仿真,数值应用程序或人工智能(AI)。因此,供应商需要能够在其硬件上有效地运行各种各样的工作负载。在AI领域,由于存在许多流行的框架(例如PyTorch,TensorFlow等),这种情况尤其恶化了。没有通用的代码库,并且功能可能有所不同。这些框架的代码发展迅速,使其跟上所有更改的成本很高,并可能迫使开发人员不断进行上游轮回。本文探讨了如何在不按顺序更改框架源代码的情况下在AI框架中提供硬件支持。以最大程度地减少维护费用。我们介绍SOL,这是一种AI加速中间件,它提供了硬件抽象层,使我们能够透明地支持异构硬件。作为概念验证,我们为PyTorch实施了SOL,具有三个后端:CPU,GPU和矢量处理器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号