首页> 外文会议> >Implementation of scalable K-Means++ clustering for passengers temporal pattern analysis in public transportation system (BRT Trans Jogja case study)

【24h】

Implementation of scalable K-Means++ clustering for passengers temporal pattern analysis in public transportation system (BRT Trans Jogja case study)

机译：在公共交通系统中实现可扩展的K-Means ++聚类用于乘客时间模式分析（BRT Trans Jogja案例研究）

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The popularity of Bus Rapid Transit (BRT) makes Trans Jogja an alternative of a mass public transportation system for urban mobility. However, without supervision on temporal patterns of passenger's behavior in Trans Jogja on supply and demand, it will result in the decreases of the number of BRT users and the increasing number of private vehicle users, so that traffic jams remain difficult to avoid. Smart Card Automated Fare Collection System (SCAFCS) which is currently used as e-ticketing in Trans Jogja public transport can be used to analyze passengers pattern with data mining approaches. This paper applied SCAFCS data preprocessing with data warehouse mechanism and implemented Hadoop Platform as distributed computing to improve K-Means++ clustering performance on large datasets scalability; in this case, SCAFCS Trans Jogja has a large dataset (volume) and rapid growth data (velocity). Scalable K-Means++ algorithm generates five clusters with characteristics in number of clusters, namely: Very Low, Low, Average, High, Very High. The clusters were used to analyze passengers pattern based on the dimensions of time (temporal), segmentation of passengers (structure) to determine the variability of passengers based on the card they used and transaction peak on boarding location (spatio). Experimental and testing setup was performed by comparing Sum of Square Error (SSE) which is the total squared error of k cluster at the centroid on three algorithms, simple K-Means, K-Means++ and K-Means++ implementation using Hadoop Platform as parallel and distributed computing. K-Means++ with Hadoop Platform implementation generates smaller SSE value than simple K-Means and K-Means++ algorithms; that shows it has good SSE value.

机译：快速公交（BRT）的普及使Trans Jogja成为城市交通的公共交通系统的替代方案。但是，如果不监督跨Jogja的供需状况，就将导致BRT用户数量的减少和私人车辆用户数量的增加，从而使交通拥堵仍然难以避免。当前在Trans Jogja公共交通中用作电子客票的智能卡自动票价收集系统（SCAFCS）可用于通过数据挖掘方法来分析乘客模式。本文将SCAFCS数据预处理与数据仓库机制结合使用，并将Hadoop平台实现为分布式计算，以提高大型数据集可扩展性的K-Means ++集群性能。在这种情况下，SCAFCS Trans Jogja具有大型数据集（体积）和快速增长的数据（速度）。可扩展的K-Means ++算法生成五个簇，这些簇的簇数具有特征，即：非常低，低，平均，高，非常高。这些聚类用于基于时间（时间）维度，乘客细分（结构），基于乘客使用的卡和登机位置（空间）上的交易高峰确定乘客的变异性来分析乘客模式。通过比较平方误差总和（SSE）来进行实验和测试设置，平方误差总和是使用Hadoop平台并行执行的三种算法（简单的K-Means，K-Means ++和K-Means ++）在质心处的k簇的总平方误差。分布式计算。与简单的K-Means和K-Means ++算法相比，采用Hadoop平台实施的K-Means ++产生的SSE值更小；说明它具有良好的SSE值。

著录项

来源
《》|2016年|78-83|共6页
会议地点 Yogyakarta(IN)
作者
Fahmi Dzikrullah; Noor Akhmad Setiawan; Selo Sulistyo;
展开▼
作者单位

Department of Electrical Engineering and Information Technology, Engineering Faculty, Gadjah Mada University, Yogyakarta, Indonesia;

Department of Electrical Engineering and Information Technology, Engineering Faculty, Gadjah Mada University, Yogyakarta, Indonesia;

Department of Electrical Engineering and Information Technology, Engineering Faculty, Gadjah Mada University, Yogyakarta, Indonesia;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Clustering algorithms; Data mining; Data preprocessing; Data warehouses; Scalability; Seminars; Public transportation;

机译：聚类算法;数据挖掘;数据预处理;数据仓库;可伸缩性;研讨会;公共交通;

相似文献

外文文献
中文文献
专利

1. Environmental co-benefits of public transportation improvement initiative: the case of Trans-Jogja bus system in Yogyakarta, Indonesia [J] . Puspita Dirgahayani Journal of Cleaner Production . 2013,第nova1期

机译：公共交通改善倡议的环境共同利益：以印度尼西亚日惹的Trans-Jogja公交系统为例
2. Toward Universal Design in Public Transportation Systems: An Analysis of Low-Floor Bus Passenger Behavior with Video Observations [J] . Hwangbo Hwan, Kim Jiyeon, Kim Sunwoong, Human Factors and Ergonomics in Manufacturing & Service Industries . 2015,第2期

机译：迈向公共交通系统的通用设计：基于视频观察的低层巴士乘客行为分析
3. COMPARATIVE STUDY OF K-MEANS AND K-MEANS++ CLUSTERING ALGORITHMS ON CRIME DOMAIN | Science Publications [J] . Bashar Aubaidan, Masnizah Mohd, Mohammed Albared Journal of computer sciences . 2014,第7期

机译：犯罪域上的K-均值和K-MEANS ++聚类算法的比较研究科学出版物
4. Implementation of scalable K-Means++ clustering for passengers temporal pattern analysis in public transportation system (BRT Trans Jogja case study) [C] . Fahmi Dzikrullah, Noor Akhmad Setiawan, Selo Sulistyo International Annual Engineering Seminar . 2016

机译：公共交通系统中乘客时间模式分析的可扩展K-means ++聚类的实施（BRT Trans Jogja案例研究）
5. Perceptions of public transportation passengers in Athens pertaining to the effects of the 2004 Olympic Games: A path analysis approach. [D] . Doukas, Spiro G. 2007

机译：与2004年奥运会的影响有关的雅典公共交通乘客感知：路径分析方法。
6. Improving public transportation systems with self-organization: A headway-based model and regulation of passenger alighting and boarding [O] . Gustavo Carreón, Carlos Gershenson, Luis A. Pineda 2011

机译：通过自组织改善公共交通系统：基于行进的模型和乘客上下车规定
7. ANALISA PELAYANAN ANGKUTAN UMUM BUS TRANSudJOGJA TRAYEK 1A UNTUK MENUNJANG SISTEMudTRANSPORTASI DARI DAN KE BANDARA ADI SUCIPTOudYOGYAKARTAud(Analysis Services Public Transport Bus Trans Jogja Route 1A to SupportudTransportation System From And To Airport Adi Sucipto Yogyakarta) [O] . DARMAWAN EVAN PRAKOSO, PUTRA PIASCO MAHENDRA 2009

机译：公交业务的公共交通分析JOGJA TRAYEK 1A支持系统ADI SUCIPTO机场的来回运输日惹（分析Trans Jogja 1A号线公共交通巴士服务以提供支持）往返机场的交通系统Adi Sucipto日惹）

Implementation of scalable K-Means++ clustering for passengers temporal pattern analysis in public transportation system (BRT Trans Jogja case study)

摘要

著录项

相似文献

相关主题

期刊订阅