【24h】

Accelerating Big Data Applications on Tiered Storage System with Various Eviction Policies

机译:通过各种驱逐策略加速分层存储系统的大数据应用

获取原文

摘要

Utilizing new type devices, such as SSD, to improve I/O performance of hybrid storage has become a tendency recently. Many efforts are made to apply the new type devices to hybrid storage in distributed environment, but most of them are confined to the specific file systems, such as HDFS. Besides, the low performance of HDFS descends the performance of hybrid storage. In this paper, we improve the performance of tiered storage system (one kind of hybrid storage system) in distributed environment with a plughable eviction framework considering that the data on each node is regularly accessed. On top of the eviction framework, we provide a couple of eviction policies, including LRU, LRFU, LIRS and ARC, covering different access patterns to accelerate the upper big data applications. Moreover, our design is general for all tiered storage systems. Then we evaluate the performance of our eviction framework through three widely-used big data applications and discover that LIRS can improve 30% hit ratio than most of other policies when running KMeans and PageRank; ARC can improve maximum 30% hit ratio than other policies when running complicated SQL applications; LRFU can always achieve relatively good performance when the configuration properties are set in reasonable range. We have implemented our prototype on Alluxio, which is a widely-used memory-centric distributed storage system. In addition, these eviction policies contributed by us have been merged into Alluxio and are already being in use.
机译:利用新型设备,如SSD,提高混合储存的I / O性能已成为最近的趋势。许多努力将新型设备应用于分布式环境中的混合存储,但其中大多数被限制在特定文件系统(例如HDF)中。此外,HDF的低性能下降了混合储存的性能。在本文中,考虑到定期访问每个节点上的数据,我们提高分布式环境(一种混合存储系统)的分布式环境中的分布式环境中的性能。在驱逐框架之上,我们提供了一些驱逐政策,包括LRU,LRFU,LIR和ARC,涵盖了不同的访问模式,以加速大数据应用。此外,我们的设计对于所有分层存储系统是通用的。然后,我们通过三种广泛使用的大数据应用评估我们的驱逐框架的表现,并发现LIRS在运行邮件和Pagerank时的大多数其他策略可以提高30%的命中比率;在运行复杂的SQL应用程序时,ARC可以提高比其他策略的最高比率比其他策略更高。当配置属性设置在合理范围内时,LRFU可以始终实现相对良好的性能。我们在Alluxio上实现了我们的原型,这是一个广泛使用的内存中指的分布式存储系统。此外,我们所贡献的这些驱逐政策已被合并为Alluxio并已在使用中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号