首页> 外文会议>International Symposium on Parallel and Distributed Computing >Scalable and Resilient Workflow Executions on Production Distributed Computing Infrastructures
【24h】

Scalable and Resilient Workflow Executions on Production Distributed Computing Infrastructures

机译:生产分布式计算基础架构上的可扩展和弹性工作流程执行

获取原文

摘要

In spite of the growing interest for grids and cloud infrastructures among scientific communities and the availability of such facilities at large-scale, achieving high performance in production environments remains challenging due to at least four factors: the low reliability of very large-scale distributed computing infrastructures, the performance overhead induced by shared facilities, the difficulty to obtain fair balance of all user jobs in such an heterogeneous environment, and the complexity of large-scale distributed applications deployment. All together, these difficulties make infrastructure exploitation complex, and often limited to experts. This paper introduces a pragmatic solution to tackle these four issues based on a service-oriented methodology, the reuse of existing middleware services, and the joint exploitation of local and distributed computing resources. Emphasis is put on the integrated environment ease of use. Results on an actual neuroscience application show the impact of the environment setup in terms of reliability and performance. Recommendations and best practices are derived from this experiment.
机译:尽管科学社区之间的电网和云基础设施的兴趣日益增长,并且在大规模的这种设施的可用性的可用性,但由于至少有四个因素,在生产环境中实现高性能仍然具有挑战性:非常大的分布式计算的低可靠性基础设施,共享设施引起的性能开销,难以获得在这种异构环境中的所有用户工作的公平平衡,以及大规模分布式应用部署的复杂性。总之,这些困难使基础设施开发复杂,并经常限于专家。本文介绍了一种基于面向服务的方法,重用现有的中间件服务以及本地和分布式计算资源的联合开发来解决这四个问题的务实解决方案。重点是综合环境易用。结果实际神经科学应用程序显示环境设置在可靠性和性能方面的影响。推荐和最佳实践来自该实验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号