首页> 外文期刊>Information Sciences: An International Journal >MR-MVPP: A map-reduce-based approach for creating MVPP in data warehouses for big data applications
【24h】

MR-MVPP: A map-reduce-based approach for creating MVPP in data warehouses for big data applications

机译:MR-MVPP:一种基于地图 - 基于地图,用于在数据仓库中创建MVPP的大数据应用程序

获取原文
获取原文并翻译 | 示例
           

摘要

Materialized view selection (MVS) is the problem of selecting an appropriate set of views to be materialized to speed up analytical query processing of data warehouses. Online analytical processing (OLAP) of queries is an essential application of the MVS problem, in which, the response times of the queries are reduced by storing the selected views. Views are intermediate results of query processing and are selected in the MVS problem to be stored and will then be exploited in answering process of several queries. Views are usually organized as a view representation structure in the MVS problem. Multiple Views Processing Plan (MVPP) is a standard structure used for view representation in the MVS problem. Due to the tremendous amount of data, constructing the MVPP is a challenge in the big data applications. The MR-MVPP (Map-Reduce-based construction of the MVPP) is the proposed method of this paper to address this problem. The MR-MVPP performs a set similarity join (similarity-based join) on the base relations and views using the map-reduce model and the hashing technique. The MVPP construction time in the proposed method is reduced by avoiding redundant calculations in the process of creating the MVPP. The performance of the proposed method is empirically evaluated. According to the results of the experiments, the execution time of the MR-MVPP method is better than the other methods. The average time improvement is about 26.5 units. This improvement is better than the other similar researches in this area and is significant due to the high volume of data in real applications. Moreover, the proposed method works well in terms of the effectiveness of the created MVPP and has about a 50% coverage rate for view selection methods. Deterministic methods are more accurate than hashing methods and can be utilized for set similarity join as future work to probably improve the effectiveness of the constructed MVPP.
机译:物化视图选择(MVS)是指选择一组合适的视图进行物化,以加速数据仓库的分析查询处理。查询的联机分析处理(OLAP)是MVS问题的一个重要应用,通过存储选定的视图,可以缩短查询的响应时间。视图是查询处理的中间结果,在MVS问题中被选择存储,然后在回答多个查询的过程中被利用。在MVS问题中,视图通常被组织为视图表示结构。多视图处理计划(MVPP)是MVS问题中用于视图表示的标准结构。由于数据量巨大,在大数据应用中构建MVPP是一个挑战。MR-MVPP(基于Map-Reduce的MVPP构造)是本文提出的解决这一问题的方法。MR-MVPP使用map reduce模型和哈希技术对基本关系和视图执行一组相似性连接(基于相似性的连接)。该方法避免了创建MVPP过程中的冗余计算,从而缩短了MVPP的构建时间。对该方法的性能进行了实证评估。根据实验结果,MR-MVPP方法的执行时间优于其他方法。平均时间改善约为26.5个单位。这一改进优于该领域的其他类似研究,并且由于实际应用中的大量数据而意义重大。此外,就创建的MVPP的有效性而言,所提出的方法工作良好,并且对于视图选择方法具有大约50%的覆盖率。确定性方法比散列方法更精确,可以用于集合相似性连接,作为未来的工作,可能会提高所构造MVPP的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号