Efficient Density Clustering Method for Spatial Data

机译：空间数据的高效密度聚类方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data mining for spatial data has become increasingly important as more and more organizations are exposed to spatial data from sources such as remote sensing, geographical information systems, astronomy, computer cartography, environmental assessment and planning, etc. Recently, density based clustering methods, such as DENCLUE, DBSCAN, OPTICS, have been published and recognized as powerful clustering methods for data mining. These approaches have run time complexity of O(n log n) when using spatial index techniques, R~+ tree and grid cell. However, these methods are known to lack scalability with respect to dimensionality. In this paper, a unique approach to efficient neighborhood search and a new efficient density based clustering algorithm using EIN-rings are developed. Our approach exploits compressed vertical data structures, Peano Trees (P-trees), and fast P-tree logical operations to accelerate the calculation of the density function within EIN-rings. This approach stands in contrast to the ubiquitous approach of vertically scanning horizontal data structures (records). The average run time complexity of our algorithm for spatial data in d-dimension is O(dn n~(1/2)). Our proposed method has comparable cardinality scalability with other density methods for small and medium size of data, but superior speed and dimensional scalability.

机译：随着越来越多的组织暴露于来自遥感，地理信息系统，天文学，计算机制图，环境评估和规划等来源的空间数据，用于空间数据的数据挖掘变得越来越重要。近来，基于密度的聚类方法DENCLUE，DBSCAN，OPTICS等已被出版，并被公认为是用于数据挖掘的强大聚类方法。当使用空间索引技术，R〜+树和网格单元时，这些方法的运行时复杂度为O（n log n）。但是，已知这些方法在尺寸方面缺乏可扩展性。在本文中，开发了一种独特的有效邻域搜索方法以及一种使用EIN环的新型基于密度的高效聚类算法。我们的方法利用压缩的垂直数据结构，Peano树（P树）和快速的P树逻辑运算来加速EIN环内密度函数的计算。这种方法与垂直扫描水平数据结构（记录）的普遍方法形成对比。我们的d维空间数据算法的平均运行时间复杂度为O（dn n〜（1/2））。对于小型和中型数据，我们提出的方法具有与其他密度方法相当的基数可伸缩性，但是速度和维度可伸缩性都很好。

著录项

来源
《7th European Conference on Principles and Practice of Knowledge Discovery in Databases; Sep 22-26, 2003; Cavtat-Dubrovnik, Croatia》|2003年|p.375-386|共12页
会议地点 Cavtat-Dubrovnik(HR);Cavtat-Dubrovnik(HR);Cavtat-Dubrovnik(HR);Cavtat-Dubrovnik(HR)
作者
Fei Pan; Baoying Wang; Yi Zhang; Dongmei Ren; Xin Hu; William Perrizo;
展开▼
作者单位

Computer Science Department North Dakota State University Fargo, ND 58105;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A kernel spatial density estimation allowing for the analysis of spatial clustering. Application to Monsoon Asia Drought Atlas data: A kernel spatial density estimation with some applications [J] . Sophie Dabo-Niang, Leila Hamdad, Camille Ternynck, Stochastic environmental research and risk assessment . 2014,第8期

机译：内核空间密度估计允许对空间聚类进行分析。在季风亚洲干旱地图集数据中的应用：核空间密度估计及一些应用
2. Improved Density Based Spatial Clustering of Applications of Noise Clustering Algorithm for Knowledge Discovery in Spatial Data [J] . Sharma Arvind, Gupta R. K., Tiwari Akhilesh Mathematical Problems in Engineering . 2016,第pta9期

机译：基于改进密度的空间聚类在噪声数据空间信息发现中的应用
3. A Novel Spatial Clustering Method based on Wavelet Network and Density Analysis for Data Stream [J] . Chonghuan Xu Journal of Computers . 2013,第8期

机译：基于小波网络的新型空间聚类方法和数据流的密度分析
4. Efficient Density Clustering Method for Spatial Data [C] . Fei Pan, Baoying Wang, Yi Zhang, European Conference on Principles and Practice of Knowledge Discovery in Databases . 2003

机译：空间数据的高效密度聚类方法
5. Testing Spiral Density-Wave Theory in Disk Galaxies Using Multi-Wavelength Image Data, Star Formation History Maps and Spatially Resolved Stellar Clusters [D] . Abdeen, Mohamed Shameer. 2021

机译：使用多波长图像数据，星形形成历史图和空间解决的恒星集群测试磁盘星系中的螺旋密度波理论
6. Efficient spatial segmentation of large imaging mass spectrometry datasets with spatially aware clustering [O] . Theodore Alexandrov, Jan Hendrik Kobarg -1

机译：具有空间感知聚类的大型成像质谱数据集的有效空间分割
7. A Novel Spatial Clustering Method based on Wavelet Network and Density Analysis for Data Stream [O] . Chonghuan Xu 2013

机译：一种新的基于小波网络和密度分析的数据流空间聚类方法

Efficient Density Clustering Method for Spatial Data

摘要

著录项

相似文献

相关主题

期刊订阅