Implementation of a deduplication cache mechanism using content-defined chunking

Yoshihiro Oyama; Jun Murakami; Shun Ishiguro; Osamu Tatebe

首页> 外文期刊>International Journal of High Performance Computing and Networking >Implementation of a deduplication cache mechanism using content-defined chunking

【24h】

Implementation of a deduplication cache mechanism using content-defined chunking

机译：使用内容定义的分块实现重复数据删除缓存机制

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many application programs in data-intensive science read and write large files. Large data consume significant memory because the data is loaded into the page cache. Since memory resources are critically valuable in data-intensive computing, reducing the memory footprint consumed by file data is essential. In this paper, we propose a cache deduplication mechanism with content-defined chunking (CDC) for the Gfarm distributed file system. CDC divides a file into variable-size blocks (chunks) based on the contents of the file. The client stores the chunks in the local file system as cache files and reuses them during subsequent file accesses. Deduplication of chunks reduces the amount of transmitted data between clients and servers, and reduces storage and memory requirements. The experimental results demonstrate that the proposed mechanism significantly improves the performance of file-read operations and that the introduction of parallelism reduces the overhead of file-write operations.

机译：数据密集型科学中的许多应用程序都读取和写入大文件。大数据会占用大量内存，因为数据已加载到页面缓存中。由于内存资源在数据密集型计算中至关重要，因此减少文件数据消耗的内存占用空间至关重要。在本文中，我们为Gfarm分布式文件系统提出了一种具有内容定义分块（CDC）的缓存重复数据删除机制。 CDC根据文件内容将文件划分为可变大小的块（块）。客户端将块存储在本地文件系统中作为缓存文件，并在后续文件访问期间重用它们。块的重复数据删除减少了客户端和服务器之间传输的数据量，并减少了存储和内存需求。实验结果表明，该机制显着提高了文件读取操作的性能，并且并行性的引入减少了文件写入操作的开销。

著录项

来源
《International Journal of High Performance Computing and Networking》 |2016年第3期|共16页
作者
Yoshihiro Oyama; Jun Murakami; Shun Ishiguro; Osamu Tatebe;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Distributed file systems; Content-defined chunking; CDC; File cache; Deduplication; Data-intensive science; High-performance computing;

机译：分布式文件系统;内容定义的分块;CDC;文件缓存;重复数据删除;数据密集型科学;高性能计算;

相似文献

外文文献
中文文献
专利

1. Implementation of a deduplication cache mechanism using content-defined chunking [J] . Yoshihiro Oyama, Jun Murakami, Shun Ishiguro, International Journal of High Performance Computing and Networking . 2016,第3期

机译：使用内容定义的分块实现重复数据删除缓存机制
2. A new content-defined chunking algorithm for data deduplication in cloud storage [J] . Ryan N.S. Widodo, Hyotaek Lim, Mohammed Atiquzzaman Future generation computer systems . 2017,第juna期

机译：一种新的内容定义的分块算法，用于云存储中的重复数据删除
3. Implementation and evaluation of proxy caching mechanisms with video quality adjustment [J] . Yoshiaki Taniguchi, Masahiro Sasabe, Naoki Wakamiya, 電子情報通信学会技術研究報告. コミュニケ-ションクオリティ. Communication Quality . 2002,第191期

机译：具有视频质量调整功能的代理缓存机制的实现和评估
4. Zero-Chunk: An Efficient Cache Algorithm to Accelerate the I/O Processing of Data Deduplication [C] . Hongyuan Gao, Chentao Wu, Jie Li, IEEE International Conference on Parallel and Distributed Systems . 2016

机译：零块：一种高效的缓存算法，可加速重复数据删除的I / O处理
5. CacheLight: A Lightweight Approach for Preventing Malicious Use of Cache Locking Mechanisms [D] . Gutierrez, Mauricio Gutierrez 2018

机译：CacheLight：防止恶意使用缓存锁定机制的轻量级方法
6. Does COVID‐19‐related cachexia mimic cancer‐related cachexia? Examining mechanisms clinical biomarkers and potential targets for clinical management [O] . Nagi B. Kumar 2021

机译：covid-19相关的cachexia模仿癌症相关的cachexia吗？检查机制临床生物标志物和临床管理潜在目标
7. Implementation of a Caching Mechanism in a Pervasive Environment [O] . Visvasuresh Govindaswamy, Meraj Khan, Naveen Ramamurthy, 2008

机译：在普适环境中实现缓存机制

Implementation of a deduplication cache mechanism using content-defined chunking

摘要

著录项

相似文献

相关主题

期刊订阅