首页> 外文会议>IEEE International Conference on Cluster Computing >Extending LDMS to Enable Performance Monitoring in Multi-core Applications
【24h】

Extending LDMS to Enable Performance Monitoring in Multi-core Applications

机译:扩展LDMS以在多核应用程序中启用性能监控

获取原文

摘要

Identifying design patterns that limit the performance of multi-core algorithms is a challenging task. There are many known methods by which threads synchronize their actions and each method may exhibit different behavior in different use cases. These use cases may vary in regards to the workload being executed, number of parallel tasks, dependencies between these tasks, and the behavior of the system scheduler. Restructuring algorithms to overcome performance limitations requires intimate knowledge on how these algorithms utilize the hardware. In our experience, we have found a lack of adequate tools to gain such knowledge. To address this, we have enhanced and implemented additional data sampler modules for OVIS's Lightweight Distributed Metric Service (LDMS) to enable scalable distributed collection of hardware performance counter data. These modules provide an interface by which LDMS can utilize the PAPI library, Linux perf tools, and RAPL to collect hardware performance data of interest. Using these samplers, we plan to monitor the intra-node behavior, including contention for node level shared resources, of multi-core applications for a diverse set of use cases. We are currently exploring how the values reported are affected by the level of concurrency, the synchronization methodologies, and progress guarantees. We hope to use this information to identify ways to restructure algorithms to increase their performance.
机译:确定限制多核算法性能的设计模式是一项艰巨的任务。有许多已知的方法可以使线程同步其动作,并且每种方法在不同的用例中可能表现出不同的行为。这些用例在执行的工作量,并行任务的数量,这些任务之间的依赖性以及系统调度程序的行为方面可能有所不同。重组算法以克服性能限制需要对这些算法如何利用硬件有深入的了解。根据我们的经验,我们发现缺少足够的工具来获取这些知识。为解决此问题,我们为OVIS的轻量级分布式度量标准服务(LDMS)增强并实施了其他数据采样器模块,以实现硬件性能计数器数据的可扩展的分布式收集。这些模块提供了一个接口,LDMS可以通过该接口利用PAPI库,Linux性能工具和RAPL来收集感兴趣的硬件性能数据。我们计划使用这些采样器,针对多种用例,监视多核应用程序的节点内行为,包括争用节点级共享资源。我们目前正在探索报告的值如何受到并发级别,同步方法和进度保证的影响。我们希望使用此信息来确定重组算法以提高其性能的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号