首页> 外文会议>IEEE International Congress on Big Data >BigExcel: A web-based framework for exploring big data in social sciences
【24h】

BigExcel: A web-based framework for exploring big data in social sciences

机译:BigExcel:基于网络的框架,用于探索社会科学中的大数据

获取原文

摘要

This paper argues that there are three fundamental challenges that need to be overcome in order to foster the adoption of big data technologies in non-computer science related disciplines: addressing issues of accessibility of such technologies for non-computer scientists, supporting the ad hoc exploration of large data sets with minimal effort and the availability of lightweight web-based frameworks for quick and easy analytics. In this paper, we address the above three challenges through the development of `BigExcel', a three tier web-based framework for exploring big data to facilitate the management of user interactions with large data sets, the construction of queries to explore the data set and the management of the infrastructure. The feasibility of BigExcel is demonstrated through two Yahoo Sandbox datasets. The first dataset is the Yahoo Buzz Score data set we use for quantitatively predicting trending technologies and the second is the Yahoo n-gram corpus we use for qualitatively inferring the coverage of important events. A demonstration of the BigExcel framework and source code is available at http://bigdata.cs.st-andrews.ac. uk/projects/bigexcel-exploring-big-data-for-social-sciences/.
机译:本文认为,要促进在非计算机科学相关学科中采用大数据技术,需要克服三个基本挑战:解决非计算机科学家对此类技术的可访问性问题,支持临时探索只需花费最少的精力即可获得大量数据集,并且可以使用轻量级的基于Web的框架进行快速轻松的分析。在本文中,我们通过开发“ BigExcel”(一种基于网络的三层框架,用于探索大数据以促进与大数据集的用户交互的管理)以及构建探索数据集的查询来应对上述三个挑战。以及基础架构的管理。 BigExcel的可行性通过两个Yahoo Sandbox数据集得到了证明。第一个数据集是我们用于定量预测趋势技术的Yahoo Buzz Score数据集,第二个数据集是我们用于定性推断重要事件的覆盖范围的Yahoo n-gram语料库。有关BigExcel框架和源代码的演示,请访问http://bigdata.cs.st-andrews.ac。英国/项目/ bigexcel-探索社会科学大数据/。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号