测绘学报

• 学术论文 •    

基于对数-主成分变换的EM算法用于遥感分类

杨红磊1,彭军还2,李淑慧1,师芸3   

  1. 1. 中国地质大学(北京)
    2. 中国地质大学(北京)土地科学技术学院测量系
    3. 西安科技大学
  • 收稿日期:2009-06-03 修回日期:2010-01-22 出版日期:2010-08-25 发布日期:2010-08-25
  • 通讯作者: 杨红磊

EM algorithm based on Log-principal component transformation for remote sensing classification

  • Received:2009-06-03 Revised:2010-01-22 Online:2010-08-25 Published:2010-08-25

摘要: 用EM方法直接对多光谱数据分类会遭遇方差协方差矩阵奇异和随机选取初值敏感两个问题,前者会导致计算失败,后者因随机选取不同初值而得到不同的分类结果。本文提出对多光谱数据进行对数变换来凸显类型特征,然后进行主成分变换并根据主成分贡献率确定EM算法分类所需主成分数,消除了方差协方差矩阵的奇异性,同时削弱了噪声;对数变换后的第一主成分直方图充分反映了类型信息,由此确定的初始标签作为多个主成分EM分类算法所需初始值,避开了随机选初值的敏感问题。实验证明,所提出的计算方案分类精度优于普通EM方法和传统的K-means 方法。

Abstract: Two nuisances will be encountered in classifying the multi-spectral data set using the EM algorithm. The first is the singularity of the variance–covariance matrix,which will lead to the failure of the computation; the second is the sensitivity to the initial values selected at random ,which will result in different final classification results. This paper suggests that the difference of classes is emphasized by logarithmizing the original data, and then apply the principal component transformation to the logarithmized data. The number of principal components for the EM algorithm is determined according to the contribution rate of more than 85% of all principal components. This not only removes the singularity but weakens the noise; The histogram of the first principal component of the logarithmized data reflects sufficiently the information of the class difference, from which the initial label for multi-dimensional EM algorithm can be efficiently determined, and the sensitivity of the initial value selected at random can be avoided. The experiment shows that the proposed strategy is better than the general EM algorithm and K-means.