首页> 外文会议>2nd international workshop on data-aware distributed computing 2009 >Abstract Storage: Moving File Format-Specific Abstractions into Petabyte-Scale Storage Systems
【24h】

Abstract Storage: Moving File Format-Specific Abstractions into Petabyte-Scale Storage Systems

机译:抽象存储:将特定于文件格式的抽象移动到PB级存储系统中

获取原文

摘要

High-end computing is increasingly I/O bound as computations become more data-intensive, and data transport technologies struggle to keep pace with the demands of large-scale, distributed computations. One approach to avoiding unnecessary I/O is to move the processing to the data, as seen in Google's successful, but relatively specialized, MapReduce system. This paper discusses our investigation towards a general solution for enabling in-situ computation in a peta-scale storage system. We believe our work with flexible, application-specific structured storage is the key to addressing the I/O overhead caused by data partitioning across storage nodes. In order to manage competing workloads on storage nodes, our research in system performance management is leveraged. Our ultimate goal is a general framework for in-situ data-intensive processing, indexing, and searching, which we expect to provide orders of magnitude performance increases for data-intensive workloads.
机译:高端计算越来越多地绑定,因为计算变得更加数据密集型,数据传输技术与大规模分布式计算的需求保持速度。避免不必要的I / O的一种方法是将处理移动到数据,如谷歌的成功,而且相对专业化的MapReduce系统所见。本文讨论了我们对普及尺度存储系统启用原位计算的一般解决方案的调查。我们相信我们使用灵活的应用程序特定的结构化存储的工作是解决跨存储节点数据分区引起的I / O开销的关键。为了管理存储节点上的竞争工作负载,我们在系统性能管理中的研究将利用。我们的最终目标是出于原位数据密集型处理,索引和搜索的一般框架,我们希望为数据密集型工作负载提供数量级性能的顺序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号