首页> 外文会议>IEEE/ACS International Conference on Computer Systems and Applications >Data placement strategy for massive data applications based on FCA approach
【24h】

Data placement strategy for massive data applications based on FCA approach

机译:基于FCA方法的海量数据应用数据放置策略

获取原文

摘要

Massive data applications such as E-science applications are characterized by complex treatments on large amounts of data which need to be stored in distributed data centers. In fact, when one task needs several datasets from different data centers, moving these data may cost a lot of time and cause energy's high consumption. Moreover, when the number of the data centers involved in the execution of tasks is high, the total data movement and the execution time increase dramatically and become a bottleneck, since the data centers have a limited bandwidth. Thus, we need a good data placement strategy to minimise the data movement between data centers and reduce the energy consumed. Indeed, many researches are concerned with data placement strategy that distributes data in ways that are advantageous for application execution. In this paper, our data placement strategy aims at grouping the maximum of data and of tasks in a minimal number of data centers. It is based on the Formal Concept Analysis approach (FCA) because its notion of a concept respects our idea since it faithfully represents a group of tasks and data that are required for their execution. It is based on four steps: 1) Hierarchical organization of tasks using Formal Concepts Analysis approach, 2) Selection of candidate concepts, 3) Assigning data in the appropriate data centers and 4) Data replication. Simulations show that our strategy can effectively reduce the data movement and the average query spans compared to the genetic approach.
机译:诸如E-science应用程序之类的海量数据应用程序具有对需要存储在分布式数据中心中的大量数据进行复杂处理的特征。实际上,当一项任务需要来自不同数据中心的多个数据集时,移动这些数据可能会花费大量时间,并导致能源的大量消耗。此外,当执行任务涉及的数据中心数量很多时,由于数据中心的带宽有限,总的数据移动和执行时间会急剧增加并成为瓶颈。因此,我们需要一个好的数据放置策略,以最大程度地减少数据中心之间的数据移动并减少能耗。实际上,许多研究都与数据放置策略有关,该策略以有利于应用程序执行的方式分发数据。在本文中,我们的数据放置策略旨在将最少数量的数据中心中的最大数据和任务分组。它基于正式概念分析方法(FCA),因为它的概念概念尊重我们的想法,因为它忠实地代表了执行任务所需的一组任务和数据。它基于四个步骤:1)使用正式概念分析方法对任务进行分层组织; 2)选择候选概念; 3)在适当的数据中心中分配数据;以及4)数据复制。仿真表明,与遗传方法相比,我们的策略可以有效地减少数据移动和平均查询范围。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号