...
首页> 外文期刊>Quality Control, Transactions >Toward Scalable Systems for Big Data Analytics: A Technology Tutorial
【24h】

Toward Scalable Systems for Big Data Analytics: A Technology Tutorial

机译:迈向大数据分析的可扩展系统:技术指南

获取原文
获取原文并翻译 | 示例
           

摘要

Recent technological advancements have led to a deluge of data from distinctive domains (e.g., health care and scientific sensors, user-generated data, Internet and financial companies, and supply chain systems) over the past two decades. The term big data was coined to capture the meaning of this emerging trend. In addition to its sheer volume, big data also exhibits other unique characteristics as compared with traditional data. For instance, big data is commonly unstructured and require more real-time analysis. This development calls for new system architectures for data acquisition, transmission, storage, and large-scale data processing mechanisms. In this paper, we present a literature survey and system tutorial for big data analytics platforms, aiming to provide an overall picture for nonexpert readers and instill a do-it-yourself spirit for advanced audiences to customize their own big-data solutions. First, we present the definition of big data and discuss big data challenges. Next, we present a systematic framework to decompose big data systems into four sequential modules, namely data generation, data acquisition, data storage, and data analytics. These four modules form a big data value chain. Following that, we present a detailed survey of numerous approaches and mechanisms from research and industry communities. In addition, we present the prevalent Hadoop framework for addressing big data challenges. Finally, we outline several evaluation benchmarks and potential research directions for big data systems.
机译:在过去的二十年中,最新的技术进步导致大量来自不同领域的数据(例如,医疗保健和科学传感器,用户生成的数据,互联网和金融公司以及供应链系统)。创造大数据一词是为了捕捉这种新兴趋势的含义。除了庞大的数据量外,与传统数据相比,大数据还具有其他独特的特征。例如,大数据通常是非结构化的,需要更多的实时分析。这种发展需要用于数据采集,传输,存储和大规模数据处理机制的新系统架构。在本文中,我们提供了针对大数据分析平台的文献调查和系统教程,旨在为非专业读者提供整体图片,并为高级受众灌输自己动手的精神,以定制他们自己的大数据解决方案。首先,我们介绍大数据的定义并讨论大数据的挑战。接下来,我们提供一个系统的框架,将大数据系统分解为四个顺序的模块,即数据生成,数据采集,数据存储和数据分析。这四个模块构成了大数据价值链。接下来,我们对研究和行业界的众多方法和机制进行了详细调查。此外,我们介绍了用于解决大数据挑战的流行Hadoop框架。最后,我们概述了大数据系统的几个评估基准和潜在的研究方向。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号