Real-time processing of IoT events with historic data using Apache Kafka and Apache Spark with dashing framework

机译：使用带有破折号框架的Apache Kafka和Apache Spark实时处理具有历史数据的IoT事件

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

IoT (Internet of Things) is a concept that broadens the idea of connecting multiple devices to each other over the Internet and enabling communication between these devices. Traditionally, the packets are sent over the network for communication only if both, the sender as well as the receiver, are online. This forces the sender and the receiver to be online 24×7; which is not achievable in each and every environment the devices communicates in. Considering the humongous data generated in the communication, it is necessary to store and process this data so that data insights can be identified to improve the organizational benefits. This generated data can be in two forms, real-time as well as existing or historical data. When this data is obtained in real-time and it is processed, even traditional big data technologies do not perform up to the mark. Hence to process this real-time data, streaming of this data is required; which is not a feature of traditional big data technologies. To achieve these objectives, the proposed architecture uses open source technologies such as Apache Kafka, for online and offline consumption of messages, and Apache Spark, to stream, process and provide a structure to the real-time and existing data. A framework known as Dashing is used to present the processed data in a more attractive and readable manner.

机译：物联网（IoT）是一个概念，它扩展了通过Internet将多个设备彼此连接并实现这些设备之间的通信的想法。传统上，只有在发送者和接收者都在线的情况下，才通过网络发送数据包以进行通信。这将强制发送者和接收者处于24×7在线状态；在设备进行通信的每个环境中，这都是无法实现的。考虑到通信中生成的庞大数据，有必要存储和处理此数据，以便识别数据见解以提高组织效益。生成的数据可以采用两种形式，即实时数据，现有数据或历史数据。当实时获取并处理此数据时，即使是传统的大数据技术也无法达到预期的效果。因此，要处理此实时数据，需要对这些数据进行流传输。这不是传统大数据技术的功能。为了实现这些目标，建议的体系结构使用诸如Apache Kafka之类的开源技术来在线和离线使用消息，而Apache Spark则对实时数据和现有数据进行流传输，处理并提供结构。称为短跑的框架用于以更具吸引力和可读性的方式显示处理后的数据。

著录项

来源
《2017 2nd IEEE International Conference on Recent Trends in Electronics, Information amp; Communication Technology》|2017年|1804-1809|共6页
会议地点 Bangalore(IN)
作者
Godson Michael Dsilva; Azharuddin Khan; Gaurav; Siddhesh Bari;
展开▼
作者单位

Information Technology Department, St. John College of Engineering and Technology, Palghar, India;

Information Technology DepartmentSt. John College of Engineering and Technology, Palghar, India;

Joshi, Information Technology Department St. John College of Engineering and Technology, Palghar, India;

Information Technology Department, St. John College of Engineering and Technology, Palghar, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Sparks; Real-time systems; Computer architecture; Big Data; Servers; Performance evaluation;

机译：Sparks;实时系统;计算机体系结构;大数据;服务器;性能评估;;

相似文献

外文文献
中文文献
专利

1. A comparison on scalability for batch big data processing on Apache Spark and Apache Flink [J] . Diego García-Gil, Sergio Ramírez-Gallego, Salvador García, Big Data Analytics . 2017,第1期

机译：Apache Spark和Apache Flink上批处理大数据处理的可伸缩性比较
2. AXS: A Framework for Fast Astronomical Data Processing Based on Apache Spark [J] . Petar Ze?evi?, Colin T. Slater, Mario Juri?, The astronomical journal . 2019,第1期

机译：轴：基于Apache Spark的快速天文数据处理框架
3. An Integrated Data Preprocessing Framework Based on Apache Spark for Fault Diagnosis of Power Grid Equipment [J] . Shi Weiwei, Zhu Yongxin, Huang Tian, Journal of VLSI signal processing systems for signal, image, and video technology . 2017,第2a3期

机译：基于Apache Spark的集成数据预处理框架用于电网设备故障诊断
4. Real-time processing of IoT events with historic data using Apache Kafka and Apache Spark with dashing framework [C] . Godson Michael Dsilva, Azharuddin Khan, Gaurav, IEEE International Conference on Recent Trends in Electronics, Information Communication Technology . 2017

机译：使用Apache Kafka和Apache Spark与历史数据的实时处理带有潇洒的框架
5. Streamlining Big Data Processing Pipelines via Unix Memory Tools, Persistent Spark Datasets, and the Apache Ignite Inmemory File System [D] . Blair, Walter 2018

机译：通过Unix内存工具，持久性Spark数据集和Apache Ignite内存文件系统简化大数据处理管道
6. Big Data Approaches for the Analysis of Large-Scale fMRI Data Using Apache Spark and GPU Processing: A Demonstration on Resting-State fMRI Data from the Human Connectome Project [O] . Roland N. Boubela, Klaudius Kalcher, Wolfgang Huf, 2015

机译：使用Apache Spark和GPU处理的大数据分析方法用于大规模fMRI数据：来自人类Connectome项目的静态fMRI数据的演示
7. A comparison on scalability for batch big data processing on Apache Spark and Apache Flink [O] . 2017

机译：Apache Spark和Apache Flink上批处理大数据处理的可伸缩性比较

Real-time processing of IoT events with historic data using Apache Kafka and Apache Spark with dashing framework

摘要

著录项

相似文献

相关主题

期刊订阅