首页> 外文会议>International Conference on Networking, Architecture, and Storage >Group-by Query Process in Middleware of Large Scale Data Intensive Systems
【24h】

Group-by Query Process in Middleware of Large Scale Data Intensive Systems

机译:组 - 通过大规模数据密集系统中间件查询过程

获取原文

摘要

Large scale data intensive systems are available in many fields in recent years, and itpsilas a severe challenge for group-by query of large volume of data in a cluster based on shared-nothing architecture. This paper proposes a design of a parallel query engine (PQE) and its asynchronous improvement (APQE) for group-by queries. PQE and APQE support for pipelined query processing and develop maximum degree of pipeline parallelism. APQE further eliminates the synchronous overhead of multi nodes parallelism, and returns part of final result as early as possible if no data dependency exists. Experimental results demonstrate that, compared to previous 2-step query engine, PQE and APQE can make a significant performance improvement for group-by query of large data sets in a shared-nothing cluster system, as well as obviously better scalability.
机译:近年来许多领域的大规模数据密集型系统可在许多领域获得,并且ITPSilas基于共享无线架构的集群中大量数据查询的群体严重挑战。本文提出了一种单独的查询引擎(PQE)和其异步改进(APQE)的设计,用于逐个查询。 PQE和APQE支持流水线查询处理,并制定最大程度的管道并行度。 APQE进一步消除了多节点并行性的同步开销,如果不存在数据依赖性,则尽早返回最终结果的一部分。实验结果表明,与之前的2步查询引擎相比,PQE和APQE可以对分组的大数据集进行大规模的性能改进,并且在共享的集群系统中,以及明显更好的可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号