首页> 外文期刊>International journal of distributed systems and technologies >Aras: A Method with Uniform Distributed Dataset to Solve Data Warehouse Problems for Big Data
【24h】

Aras: A Method with Uniform Distributed Dataset to Solve Data Warehouse Problems for Big Data

机译:Aras:一种具有统一分布式数据集的方法来解决大数据的数据仓库问题

获取原文
获取原文并翻译 | 示例
           

摘要

Because of to the high rate of data growth and the need for data analysis, data warehouse management for big data is an important issue. Single node solutions cannot manage the large amount of information. Information must be distributed over multiple hardware nodes. Nevertheless, data distribution over nodes causes each node to need data from other nodes to execute a query. Data exchange among nodes creates problems, such as the joins between data segments that exist on different nodes, network congestion, and hardware node wait for data reception. In this paper, the Aras method is proposed. This method is a MapReduce-based method that introduces a data set on each mapper. By applying this method, each mapper node can execute its query independently and without need to exchange data with other nodes. Node independence solves the aforementioned data distribution problems. The proposed method has been compared with prominent data warehouses for big data, and the Aras query execution time was much lower than other methods.%Big Data;MapReduce;Data Warehouse;Data Locality
机译:由于数据的高速增长和数据分析的需求,大数据的数据仓库管理是一个重要的问题。单节点解决方案无法管理大量信息。信息必须分布在多个硬件节点上。然而,节点上的数据分布导致每个节点需要来自其他节点的数据来执行查询。节点之间的数据交换会产生问题,例如存在于不同节点上的数据段之间的联接,网络拥塞以及硬件节点等待数据接收。本文提出了Aras方法。此方法是基于MapReduce的方法,它在每个映射器上引入一个数据集。通过应用此方法,每个映射器节点都可以独立执行其查询,而无需与其他节点交换数据。节点独立性解决了上述数据分发问题。将该方法与著名的大数据仓库进行了比较,Aras查询的执行时间比其他方法要短得多。%大数据; MapReduce;数据仓库;数据局部性

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号