...
首页> 外文期刊>Journal of database management >Map-Side Join Processing of SPARQL Queries Based on Abstract RDF Data Filtering
【24h】

Map-Side Join Processing of SPARQL Queries Based on Abstract RDF Data Filtering

机译:基于抽象RDF数据过滤的SPARQL查询的地图侧联接处理

获取原文
获取原文并翻译 | 示例
           

摘要

The amount of RDF data being published on the Web is increasing at a massive rate. MapReduce-based distributed frameworks have become the general trend in processing SPARQL queries against RDF data. Currently, query processing systems that use MapReduce have not been able to keep up with the increase of semantic annotated data, resulting in non-interactive SPARQL query processing. The principal reason is that intermediate query results from join operations in a MapReduce framework are so massive that they consume all available network bandwidth. In this article, the authors present an efficient SPARQL processing system that uses MapReduce and HBase. The system runs a job optimized query plan using their proposed abstract RDF data to decrease the number of jobs and also decrease the amount of input data. The authors also present an efficient algorithm of using Map-side joins while also using the abstract RDF data to filter out unneeded RDF data. Experimental results show that the proposed approach demonstrates better performance when processing queries with a large amount of input data than those found in previous works.
机译:在Web上发布的RDF数据量正以惊人的速度增长。基于MapReduce的分布式框架已成为处理针对RDF数据的SPARQL查询的普遍趋势。当前,使用MapReduce的查询处理系统无法跟上语义注释数据的增长,从而导致非交互式SPARQL查询处理。主要原因是,MapReduce框架中的联接操作产生的中间查询结果非常庞大,以至于它们消耗了所有可用的网络带宽。在本文中,作者提出了一种使用MapReduce和HBase的高效SPARQL处理系统。该系统使用其建议的抽象RDF数据运行作业优化的查询计划,以减少作业数量并减少输入数据量。作者还提出了一种使用Map-side联接的有效算法,同时还使用抽象RDF数据过滤掉不需要的RDF数据。实验结果表明,与以前的工作相比,该方法在处理带有大量输入数据的查询时表现出更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号