首页> 外文会议>International Conference on Big Data Analytics >Big Data Analytics Framework for Spatial Data
【24h】

Big Data Analytics Framework for Spatial Data

机译:空间数据的大数据分析框架

获取原文

摘要

In the world of mobile and Internet, large volume of data is generated with spatial components. Modern users demand fast, scalable and cost-effective solutions to perform relevant analytics on massively distributed data including spatial data. Traditional spatial data management systems are becoming less efficient to meet the current users demand due to poor scalability, limited computational power and storage. The potential approach is to develop data intensive spatial applications on parallel distributed architectures deployed on commodity clusters. The paper presents an open-source big data analytics framework to load, store, process and perform ad-hoc query processing on spatial and non-spatial data at scale. The system is built on top of Spark framework with a new input data source NoSQL database i.e. Cassandra. It is implemented by performing analytics operations like filtration, aggregation, exact match, proximity and K nearest neighbor search. It also provides an application architecture to accelerate ad-hoc query processing by diverting user queries to the suitable framework either Cassandra or Spark via a common web based REST interface. The framework is evaluated by analyzing the performance of the system in terms of latency against variable size of data.
机译:在移动和互联网的世界中,使用空间组件产生大量数据。现代用户要求快速,可扩展且经济高效的解决方案,在包括空间数据的大规模分布式数据上执行相关分析。由于可伸缩性,有限的计算能力和存储,传统的空间数据管理系统越来越高,以满足当前用户需求。潜在方法是在部署在商品集群上的并行分布式体系结构上开发数据密集型空间应用。本文提出了一个开源大数据分析框架,用于加载,存储,处理和在刻度上对空间和非空间数据执行ad-hoc查询处理。该系统基于火花框架之上,具有新的输入数据源NoSQL数据库i.E.Cassandra。它通过执行诸如过滤,聚合,精确匹配,接近度和k最近邻搜索等分析操作来实现。它还提供了一种应用架构,以通过将用户查询转移到合适的框架或通过公共Web的基于Web的REST接口将用户查询转移到合适的框架来加速ad-hoc查询处理。通过在延迟对可变数据的延迟方面分析系统的性能来评估框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号