首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium >Understanding and Modeling Lossy Compression Schemes on HPC Scientific Data
【24h】

Understanding and Modeling Lossy Compression Schemes on HPC Scientific Data

机译:了解和建模HPC科学数据上的有损压缩方案

获取原文

摘要

Scientific simulations generate large amounts of floating-point data, which are often not very compressible using the traditional reduction schemes, such as deduplication or lossless compression. The emergence of lossy floating-point compression holds promise to satisfy the data reduction demand from HPC applications; however, lossy compression has not been widely adopted in science production. We believe a fundamental reason is that there is a lack of understanding of the benefits, pitfalls, and performance of lossy compression on scientific data. In this paper, we conduct a comprehensive study on state-of-the-art lossy compression, including ZFP, SZ, and ISABELA, using real and representative HPC datasets. Our evaluation reveals the complex interplay between compressor design, data features and compression performance. The impact of reduced accuracy on data analytics is also examined through a case study of fusion blob detection, offering domain scientists with the insights of what to expect from fidelity loss. Furthermore, the trial and error approach to understanding compression performance involves substantial compute and storage overhead. To this end, we propose a sampling based estimation method that extrapolates the reduction ratio from data samples, to guide domain scientists to make more informed data reduction decisions.
机译:科学模拟会生成大量的浮点数据,使用传统的还原方案(例如重复数据删除或无损压缩)通常无法很好地压缩这些浮点数据。有损浮点压缩的出现有望满足HPC应用程序对数据缩减的需求。但是,有损压缩尚未在科学生产中广泛采用。我们认为,一个根本原因是缺乏对科学数据的有损压缩的好处,陷阱和性能的了解。在本文中,我们使用真实的和代表性的HPC数据集对包括ZFP,SZ和ISABELA在内的最新有损压缩进行了全面研究。我们的评估揭示了压缩机设计,数据功能和压缩性能之间的复杂相互作用。通过融合斑点检测的案例研究,还研究了准确性降低对数据分析的影响,从而为领域科学家提供了对保真度损失的预期见解。此外,了解压缩性能的反复试验方法涉及大量的计算和存储开销。为此,我们提出了一种基于采样的估计方法,该方法可从数据样本中推断出减少率,以指导领域科学家做出更明智的数据减少决策。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号