首页> 外文会议>IEEE International conference on cluster computing >It takes a village: Monitoring the blue waters supercomputer
【24h】

It takes a village: Monitoring the blue waters supercomputer

机译:花村:监控碧水超级计算机

获取原文

摘要

The performance of science applications on modern HPC equipment depends on many factors. Architectural features, individual hardware characteristics, and scheduler traits all have an impact on how a particular application performs, not only in isolation but when run in concert with other user applications. Being able to correlate system events and conditions at particular times can give insight into causes of good or bad performance. Unfortunately, the information we seek is not necessarily in a readily accessible form. The problem at hand is how to enable efficient query of the raw data and flexible graphical representation of the results. Web applications that access an underlying database serve this sort of functionality for many science applications quite well. Our scenario of data access is not very different. The data collected for a large HPC environment is complex and grows in size with time. This aspect is different from applications that deal with more static data. It is the dynamic nature of the data that make the problem interesting. In this work we present our approach for the analysis and visualization of HPC system performance data based on database access and web based graphical presentation. We discuss the details of how data is collected and processed from raw logs into the database, how queries are formulated, and how the data are graphically displayed. This process includes dynamic formulation of the queries. Finally we discuss how the system is utilized to analyze system performance.
机译:现代HPC设备上科学应用程序的性能取决于许多因素。架构功能,单独的硬件特征和调度程序特征都对特定应用程序的性能产生影响,不仅是孤立地而且与其他用户应用程序一起运行时。能够在特定时间关联系统事件和条件可以洞悉性能好坏的原因。不幸的是,我们寻求的信息不一定是易于获取的形式。当前的问题是如何实现对原始数据的高效查询和结果的灵活图形表示。访问基础数据库的Web应用程序可以很好地为许多科学应用程序提供这种功能。我们的数据访问方案没有太大不同。在大型HPC环境中收集的数据非常复杂,并且随着时间的推移会增加。这方面与处理更多静态数据的应用程序不同。正是数据的动态性质使问题变得有趣。在这项工作中,我们介绍了基于数据库访问和基于Web的图形表示来分析和可视化HPC系统性能数据的方法。我们讨论了如何从原始日志收集和处理数据到数据库,如何制定查询以及如何以图形方式显示数据的细节。该过程包括动态制定查询。最后,我们讨论了如何利用系统来分析系统性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号