首页> 外文会议>International Workshops on ISC High Performance >Bioinformatics Application with Kubeflow for Batch Processing in Clouds
【24h】

Bioinformatics Application with Kubeflow for Batch Processing in Clouds

机译:Bioinformatics应用与Kubeflow在云中进行批处理

获取原文

摘要

Bioinformatics pipelines make extensive use of HPC batch processing. The rapid growth of data volumes and computational complexity, especially for modern applications such as machine learning algorithms, imposes significant challenges to local HPC facilities. Many attempts have been made to burst HPC batch processing into clouds with virtual machines. They all suffer from some common issues, for example: very high overhead, slow to scale up and slow to scale down, and nearly impossible to be cloud-agnostic. We have successfully deployed and run several pipelines on Kuber-netes in OpenStack, Google Cloud Platform and Amazon Web Services. In particular, we use Kubeflow on top of Kubernetes for more sophisticated job scheduling, workflow management, and first class support for machine learning. We choose Kubeflow/Kubernetes to avoid the overhead of provisioning of virtual machines, to achieve rapid scaling with containers, and to be truly cloud-agnostic in all cloud environments. Kubeflow on Kubernetes also creates some new challenges in deployment, data access, performance monitoring, etc. We will discuss the details of these challenges and provide our solutions. We will demonstrate how our solutions work across all three very different clouds for both classical pipelines and new ones for machine learning.
机译:生物信息学管道大量使用HPC批量加工。数据卷和计算复杂性的快速增长,特别是对于机器学习算法等现代应用,对本地HPC设施施加了重大挑战。已经进行了许多尝试,以将HPC批处理与虚拟机爆发到云中。它们都遭受了一些常见问题,例如:非常高的开销,向上扩大缓慢,向下缩放,几乎不可能成为云无话。我们在OpenStack,Google云平台和亚马逊Web服务中成功部署并运行了kuber-网上的几个管道。特别是,我们在Kubernetes的顶部使用Kubeflow以获得更复杂的作业调度,工作流管理和机器学习的第一类支持。我们选择Kubeflow / Kubernetes避免虚拟机的部署的开销,实现与集装箱快速缩放,并成为真正的云无关的所有云环境。 Kubernetes上的Kubeflow还在部署,数据访问,性能监控等中创造了一些新的挑战。我们将讨论这些挑战的细节并提供我们的解决方案。我们将展示我们的解决方案如何在所有三个非常不同的云中工作,为经典管道和新的机器学习。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号