一种基于轨迹数据密度分区的分布式并行聚类方法

王佳玉; 张振宇; 褚征; 吴晓红

首页> 中文期刊> 《中国科学技术大学学报》 >一种基于轨迹数据密度分区的分布式并行聚类方法

一种基于轨迹数据密度分区的分布式并行聚类方法

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

全球定位技术与基于位置服务的发展促进了轨迹大数据的发展.轨迹聚类作为最重要的轨迹分析任务之一,得到了广泛的研究.目前,大多数聚类方法是在单处理机模式下运行,对于大规模的轨迹数据其处理时间较长,难以满足时效性强的轨迹分析任务,为此提出一种基于轨迹数据密度分区的分布式并行聚类方法.首先将整个轨迹数据集抽象在一个矩形区域内,通过该矩形最长维度的变换将数据合理地划分为若干任务量相当的分区,构建可供分布式并行聚类的局部数据集,然后各工作服务器对局部分区分别执行DBSCAN聚类算法,管理服务器对局部聚类结果进行合并与整合.实验结果验证了本方法的有效性,在一定程度上提高了聚类分析的运算效率.%The development of global positioning technology and location-based service have contributed to the development of trajectory big data.Trajectory clustering is one of the most important trajectory analysis tasks and has been extensively studied.Currently,most of the clustering methods operate in a single-processor mode,and large-scale trajectory data processing is a lengthy process,making it difficult to meet the strong timeliness of the trajectory analysis task.To solve the problem,a distributed parallel clustering method based on trajectory density partition is proposed.Firstly,the whole dataset is abstracted in a rectangular region,and the dataset is divided into several partitions with tasks that have almost the same amount by the transformation of the longest dimension of the rectangle,thus constructing the local datasets for distributed parallel clustering.Then the worker servers implement the DBSCAN clustering algorithm for the local partitions respectively,and the manager server merges and integrates the local clustering results.The experimental results show that the algorithm is effective and improves the computational rate of clustering analysis to a certain degree.

著录项

来源
《中国科学技术大学学报》 |2018年第1期|47-56|共10页
作者
王佳玉; 张振宇; 褚征; 吴晓红;
展开▼
作者单位

新疆大学软件学院,乌鲁木齐830008;

新疆大学软件学院,乌鲁木齐830008;

新疆大学信息科学与工程学院,乌鲁木齐830046;

新疆大学软件学院,乌鲁木齐830008;

新疆大学信息科学与工程学院,乌鲁木齐830046;

展开▼
原文格式 PDF
正文语种 chi
中图分类信息处理（信息加工）;
关键词
轨迹大数据; 分布式聚类; DBSCAN算法; 聚类算法;

相似文献

中文文献
外文文献
专利

1. 一种不同的基于数据分区的并行构建密度树聚类算法(PCDTC) [J] . 张云鹏 ,张璐 ,翟正军 . 西北工业大学学报 . 2008,第004期
2. 基于Spark的分布式大数据并行化聚类方法研究 [J] . 陶婧 . 湖北第二师范学院学报 . 2019,第008期
3. 一种基于密度的分布式聚类方法 [J] . 王岩 ,彭涛 ,韩佳育 . 软件学报 . 2017,第011期
4. 一种基于分布式并行模型的海量机载LiDAR点云数据快速滤波方法 [J] . 宇超群 ,邓勇 ,张静 . 信息工程大学学报 . 2021,第001期
5. 一种基于分布式并行系统的流媒体数据分发系统 [J] . 闫巧玲 ,刘心松 ,曹铮 . 计算机应用 . 2008,第003期
6. 数据分区:一种改善基于密度的聚类算法的方法 [C] . 曹晶 ,周水庚 ,范晔 . 第十六届全国数据库学术会议 . 1999
7. 高维大数据分析模型与分布式并行聚类方法研究 [A] . 周昉昉 . 2018

一种基于轨迹数据密度分区的分布式并行聚类方法

摘要

著录项

相似文献

相关主题

期刊订阅