Approaches for Scaling DBSCAN Algorithm to Large Spatial Databases

ZHOU Aoying; ZHOU Shuigeng; CAO Jing

首页> 外文期刊>Journal of Computer Science & Technology >Approaches for Scaling DBSCAN Algorithm to Large Spatial Databases

【24h】

Approaches for Scaling DBSCAN Algorithm to Large Spatial Databases

机译：将DBSCAN算法扩展到大型空间数据库的方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The huge amoullt of information stored in databases owned by cor- porations (e.g., retail, financial, telecom) has spurred a tremendous interest in the area of knowledge discovery and data mining. Clustering, in data mining, is a useful technique for discovering interesting data distributions and patterns in the underlying data, and has many application fields, such as statistical data analysis, pattern recognition, image processing, and other business applications. Although researchers have been working on clustering algorithms for decades, and a lot of algorithms for clustering have been developed, there is still no efficient algorithm for clustering very large databases and high dimensional data. As an outstanding representative of clustering algorithms, DBSCAN algorithm shows good performance in spatial data clustering. However, for large spatial databases, DBSCAN requires large volume of memory support and could incur substantial I/O costs because it operates directly on the entire database. In this paper) several approaches are proposed to scale DBSCAN algorithm to large spatial databases. To begin with, a fast DBSCAN algorithm is developed, which considerably speeds up the original DBSCAN algorithm. Then a sampling based DBSCAN algorithm, a partitioning-based DBSCAN algorithm, and a parallel DBSCAN algorithm are introduced consecutively. Following that, based on the above-proposed algorithms, a synthetic algorithm is also given. Finally some experimental results are given to demonstrate the effectiveness and efficiency of these algorithms.

机译：公司（例如，零售，金融，电信）拥有的数据库中存储的大量信息激起了人们对知识发现和数据挖掘领域的巨大兴趣。在数据挖掘中，聚类是用于发现基础数据中有趣的数据分布和模式的有用技术，并且具有许多应用程序领域，例如统计数据分析，模式识别，图像处理和其他业务应用程序。尽管研究人员一直在研究聚类算法数十年，并且已经开发了许多用于聚类的算法，但是仍然没有有效的算法来聚类大型数据库和高维数据。作为聚类算法的杰出代表，DBSCAN算法在空间数据聚类中表现出良好的性能。但是，对于大型空间数据库，DBSCAN需要大量的内存支持，并且可能直接在整个数据库上运行，因此可能会产生大量的I / O成本。本文中）提出了几种将DBSCAN算法扩展到大型空间数据库的方法。首先，开发了一种快速的DBSCAN算法，该算法大大加快了原始DBSCAN算法的速度。然后依次介绍了基于采样的DBSCAN算法，基于分区的DBSCAN算法和并行DBSCAN算法。然后，基于上述算法，给出了一种综合算法。最后给出了一些实验结果，以证明这些算法的有效性和效率。

著录项

来源
《Journal of Computer Science & Technology》 |2000年第6期|p.509-526|共18页
作者
ZHOU Aoying; ZHOU Shuigeng; CAO Jing;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
spatial database; clustering; fast DBSCAN algorithm; data sam-;

机译：空间数据库;集群快速的DBSCAN算法;数据SAM;

相似文献

外文文献
中文文献
专利

1. Approaches for Scaling DBSCAN Algorithm to Large Spatial Databases [J] . ZHOU Aoying, ZHOU Shuigeng, CAO Jing, Journal of Computer Science & Technology . 2000,第6期

机译：将DBSCAN算法扩展到大型空间数据库的方法
2. Approaches for Scaling DBSCAN Algorithm to Large Spatial Databases [J] . 周傲英, 周水庚, 曹晶, 计算机科学技术学报（英文版） . 2000,第006期

机译：将DBSCAN算法扩展到大型空间数据库的方法
3. Scaling up the DBSCAN algorithm for clustering large spatial databases based on sampling technique [J] . Guan Ji-hong, Zhou Shui-geng, Bian Fu-ling, Wuhan University Journal of Natural Sciences . 2001,第1a2期

机译：扩展基于采样技术的DBSCAN算法以对大型空间数据库进行聚类
4. Combining Sampling Technique with DBSCAN Algorithm for Clustering Large Spatial Databases [C] . Shuigeng Zhou, Aoying Zhou, Jing Cao, Pacific-Asia Conference on Knowledge Discovery and Data Mining . 2000

机译：用DBSCAN算法组合采样技术来聚类大型空间数据库
5. Replicators, majorization and probabilistic databases: New approaches for the analysis of evolutionary algorithms [D] . Menon, Anil Ravindran 1998

机译：复制器，专业化和概率数据库：进化算法分析的新方法
6. ToxDBScan: Large-Scale Similarity Screening of Toxicological Databases for Drug Candidates [O] . Michael Römer, Linus Backert, Johannes Eichner, 1997

机译：ToxDBScan：药物候选毒理学数据库的大规模相似性筛选
7. Scaling DBSCAN-like Algorithms for Event Detection Systems in Twitter [O] . Capdevila, Joan, Pericacho, Gonzalo, Torres, Jordi, 2017

机译：在Twitter中针对事件检测系统扩展类似于DBSCAN的算法

Approaches for Scaling DBSCAN Algorithm to Large Spatial Databases

摘要

著录项

相似文献

相关主题

期刊订阅