首页> 外文会议>2017 2nd IEEE International Conference on Recent Trends in Electronics, Information amp; Communication Technology >Real-time processing of IoT events with historic data using Apache Kafka and Apache Spark with dashing framework
【24h】

Real-time processing of IoT events with historic data using Apache Kafka and Apache Spark with dashing framework

机译:使用带有破折号框架的Apache Kafka和Apache Spark实时处理具有历史数据的IoT事件

获取原文
获取原文并翻译 | 示例

摘要

IoT (Internet of Things) is a concept that broadens the idea of connecting multiple devices to each other over the Internet and enabling communication between these devices. Traditionally, the packets are sent over the network for communication only if both, the sender as well as the receiver, are online. This forces the sender and the receiver to be online 24×7; which is not achievable in each and every environment the devices communicates in. Considering the humongous data generated in the communication, it is necessary to store and process this data so that data insights can be identified to improve the organizational benefits. This generated data can be in two forms, real-time as well as existing or historical data. When this data is obtained in real-time and it is processed, even traditional big data technologies do not perform up to the mark. Hence to process this real-time data, streaming of this data is required; which is not a feature of traditional big data technologies. To achieve these objectives, the proposed architecture uses open source technologies such as Apache Kafka, for online and offline consumption of messages, and Apache Spark, to stream, process and provide a structure to the real-time and existing data. A framework known as Dashing is used to present the processed data in a more attractive and readable manner.
机译:物联网(IoT)是一个概念,它扩展了通过Internet将多个设备彼此连接并实现这些设备之间的通信的想法。传统上,只有在发送者和接收者都在线的情况下,才通过网络发送数据包以进行通信。这将强制发送者和接收者处于24×7在线状态;在设备进行通信的每个环境中,这都是无法实现的。考虑到通信中生成的庞大数据,有必要存储和处理此数据,以便识别数据见解以提高组织效益。生成的数据可以采用两种形式,即实时数据,现有数据或历史数据。当实时获取并处理此数据时,即使是传统的大数据技术也无法达到预期的效果。因此,要处理此实时数据,需要对这些数据进行流传输。这不是传统大数据技术的功能。为了实现这些目标,建议的体系结构使用诸如Apache Kafka之类的开源技术来在线和离线使用消息,而Apache Spark则对实时数据和现有数据进行流传输,处理并提供结构。称为短跑的框架用于以更具吸引力和可读性的方式显示处理后的数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号