测绘学报

• 学术论文 •    

一种基于多约束的空间聚类方法

刘启亮1,邓敏2,石岩1,彭东亮1   

  1. 1. 中南大学信息物理工程学院
    2. 中南大学
  • 收稿日期:2010-06-10 修回日期:2010-09-30 出版日期:2011-08-25 发布日期:2011-08-25
  • 通讯作者: 刘启亮

A Novel Spatial Clustering Method Based on Multi-Constraints

  • Received:2010-06-10 Revised:2010-09-30 Online:2011-08-25 Published:2011-08-25

摘要: 空间聚类是挖掘空间分布模式与探测空间异常的重要手段,已成为空间数据挖掘与知识发现领域的一个主要研究方向。随着空间聚类技术研究与应用的深入,迫切需要发展能够普适性的空间聚类算法。该算法一方面能够适应海量、分布复杂的空间数据(如任意形状的空间簇、噪声点及空间密度变化),另一方面又能够综合考虑空间邻近与专题属性相似,且人为干预较少。为此,本文借助Delaunay三角网构建空间邻近关系的优势,通过施加不同层次、不同类型的约束,提出了一种空间聚类的新算法。通过实验分析与比较发现,该算法可以探测复杂结构的空间簇,对噪声点稳健,并且能够同时顾及实体间空间位置与专题属性的相似性,从而验证了本文算法的有效性与优越性。

Abstract: Spatial clustering has played an important role in spatial data mining and spatial analysis. It aims to classify the spatial entities in a database into some meaningful clusters, where entities are similar to one another in same clusters and are dissimilar to the entities in different clusters. In practical applications, spatial clustering algorithm can be used to discover clusters in the spatial database with complicated structures, such as clusters of uneven density, clusters of arbitrary shape, clusters adjacent to each other, and clusters with significant outliers. In the meantime, it is also required that entities in a same cluster should be both nearest in spatial domain and very similar in thematic attribute. For this purpose, a novel multi-constraints based spatial clustering algorithm is proposed in this paper. Delaunay triangulation is first employed to determine the spatial proximate relationships among entities, and then multi-levels constraints are utilized to discover clusters in the spatial database. The presented algorithm is tested on both simulated and practical spatial databases, and its advantages over some well-known algorithms are also fully highlighted.