Acta Geodaetica et Cartographica Sinica ›› 2019, Vol. 48 ›› Issue (6): 718-726.doi: 10.11947/j.AGCS.2019.20170740

• Photogrammetry and Remote Sensing • Previous Articles     Next Articles

A remote sensing image semantic segmentation method by combining deformable convolution with conditional random fields

ZUO Zongcheng1,2, ZHANG Wen1, ZHANG Dongying3   

  1. 1. School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China;
    2. Autodesk(China) Software Research and Development Co. Ltd., Shanghai 200122, China;
    3. College of Water Conservancy & Environmental Engineering, Zhengzhou University, Zhengzhou 450002, China
  • Received:2017-12-23 Revised:2018-08-01 Online:2019-06-20 Published:2019-07-09
  • Supported by:
    The National Key Research and Development Program of China (No.2017YFC0405806)

Abstract: Currently, deep convolutional neural networks have made great progress in the field of semantic segmentation. Because of the fixed convolution kernel geometry, standard convolution neural networks have been limited the ability to simulate geometric transformations. Therefore, a deformable convolution is introduced to enhance the adaptability of convolutional networks to spatial transformation. Considering that the deep convolutional neural networks cannot adequately segment the local objects at the output layer due to using the pooling layers in neural networks architecture. To overcome this shortcoming, the rough prediction segmentation results of the neural network output layer will be processed by fully connected conditional random fields to improve the ability of image segmentation. The proposed method can easily be trained by end-to-end using standard backpropagation algorithms. Finally, the proposed method is tested on the ISPRS dataset. The results show that the proposed method can effectively overcome the influence of the complex structure of the segmentation object and obtain state-of-the-art accuracy on the ISPRS Vaihingen 2D semantic labeling dataset.

Key words: high-resolution remote sensing image, semantic segmentation, deformable convolution network, conditions random fields

CLC Number: