首页> 外文会议>International conference on big data analytics >Khanan: Performance Comparison and Programming α-Miner Algorithm in Column-Oriented and Relational Database Query Languages
【24h】

Khanan: Performance Comparison and Programming α-Miner Algorithm in Column-Oriented and Relational Database Query Languages

机译:Khanan:面向列和关系数据库查询语言的性能比较和编程α-Miner算法

获取原文

摘要

Process-Aware Information Systems (PAIS) support business processes and generate large amounts of event logs from the execution of business processes. An event log is represented as a tuple of CaselD, Timestamp, Activity and Actor. Process Mining is a new and emerging field that aims at analyzing the event logs to discover, enhance and improve business processes and check conformance between run time and design time business processes. The large volume of event logs generated are stored in the databases. Relational databases perform well for a certain class of applications. However, there is a certain class of applications for which relational databases are not able to scale well. To address the challenges of scalability, NoSQL database systems emerged. Discovering a process model (workflow) from event logs is one of the most challenging and important Process Mining tasks. The α-miner algorithm is one of the first and most widely used Process Discovery techniques. Our objective is to investigate which of the databases (Relational or NoSQL) performs better for a Process Discovery application under Process Mining. We implement the α-miner algorithm on relational (row-oriented) and NoSQL (column-oriented) databases in database query languages so that our application is tightly coupled to the database. We conduct a performance benchmarking and comparison of the α-miner algorithm on row-oriented database and NoSQL column-oriented database. We present the comparison on various aspects like time taken to load large datasets, disk usage, stepwise execution time and compression technique.
机译:流程感知信息系统(PAIS)支持业务流程,并从业务流程的执行中生成大量事件日志。事件日志表示为CaselD,Timestamp,Activity和Actor的元组。流程挖掘是一个新兴领域,旨在分析事件日志以发现,增强和改进业务流程,并检查运行时和设计时业务流程之间的一致性。生成的大量事件日志存储在数据库中。关系数据库对于某些类别的应用程序表现良好。但是,在某些类型的应用程序中,关系数据库无法很好地扩展。为了解决可伸缩性的挑战,出现了NoSQL数据库系统。从事件日志中发现流程模型(工作流)是最具挑战性和最重要的流程挖掘任务之一。 α-miner算法是最早且使用最广泛的过程发现技术之一。我们的目标是调查在Process Mining下哪个数据库(关系数据库或NoSQL)对于Process Discovery应用程序的性能更好。我们以数据库查询语言在关系(行)和NoSQL(列)数据库上实现α-miner算法,以便我们的应用程序与数据库紧密耦合。我们对面向行的数据库和NoSQL面向列的数据库的α-miner算法进行了性能基准测试和比较。我们在各个方面进行了比较,例如加载大型数据集所需的时间,磁盘使用情况,逐步执行时间和压缩技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号