首页> 外文会议>IEEE International Conference on Big Data >ECL-watch: A big data application performance tuning tool in the HPCC systems platform
【24h】

ECL-watch: A big data application performance tuning tool in the HPCC systems platform

机译:ECL-watch:HPCC系统平台中的大数据应用程序性能调整工具

获取原文

摘要

The proliferation of Big Data processing environments such as Hadoop, Apache Spark, and HPCC Systems is driving the development of performance analysis tools in these distributed systems. The goal is to achieve high performance through the optimization of Big Data applications. However, tuning performance in a fine-grained manner is quite challenging due to the high complexity and massive size of the distributed systems. ECL-Watch is a data-flow based fine-grained comprehensive Big Data performance analysis tool utilizing the high level declarative dataflow programming language ECL in HPCC Systems. As a case study, we implement and optimize the Yinyang K-Means machine learning algorithm in ECL in HPCC Systems. The experimental results show that the performance of the native ECL version of the Yinyang K-Means algorithm increased significantly after tuning: from being about three times slower than the standard K-Means implementation in ECL, to become roughly 15% faster than standard K-Means.
机译:大数据处理环境(例如Hadoop,Apache Spark和HPCC Systems)的激增正在推动这些分布式系统中性能分析工具的开发。目标是通过优化大数据应用程序来实现高性能。但是,由于分布式系统的高复杂性和庞大的规模,以细粒度的方式进行性能调整非常具有挑战性。 ECL-Watch是基于数据流的细粒度综合大数据性能分析工具,它利用HPCC Systems中的高级声明性数据流编程语言ECL。作为案例研究,我们在HPCC系统的ECL中实现并优化了Yinyang K-Means机器学习算法。实验结果表明,经过调整后,阴阳K-Means算法的本机ECL版本的性能显着提高:从ECL中的标准K-Means实现慢约三倍,到比标准K快约15%。 -方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号