Acta Geodaetica et Cartographica Sinica ›› 2018, Vol. 47 ›› Issue (5): 620-630.doi: 10.11947/j.AGCS.2018.20170191

Previous Articles     Next Articles

Joint Multi-scale Convolution Neural Network for Scene Classification of High Resolution Remote Sensing Imagery

ZHENG Zhuo1,2, FANG Fang1, LIU Yuanyuan1, GONG Xi1, GUO Mingqiang1, LUO Zhongwen1   

  1. 1. College of Information Engineering, China University of Geosciences, Wuhan 430074, China;
    2. State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, China
  • Received:2017-04-17 Revised:2018-02-09 Online:2018-05-20 Published:2018-06-01
  • Supported by:
    The National Natural Science Foundation of China (Nos.61602429;41701446),Chinese Geologic Survey Project (No.KZ17Z618)

Abstract: High resolution remote sensing imagery scene classification is important for automatic complex scene recognition, which is the key technology for military and disaster relief, etc. In this paper, we propose a novel joint multi-scale convolution neural network (JMCNN) method using a limited amount of image data for high resolution remote sensing imagery scene classification. Different from traditional convolutional neural network, the proposed JMCNN is an end-to-end training model with joint enhanced high-level feature representation, which includes multi-channel feature extractor, joint multi-scale feature fusion and Softmax classifier. Multi-channel and scale convolutional extractors are used to extract scene middle features, firstly. Then, in order to achieve enhanced high-level feature representation in a limit dataset, joint multi-scale feature fusion is proposed to combine multi-channel and scale features using two feature fusions. Finally, enhanced high-level feature representation can be used for classification by Softmax. Experiments were conducted using two limit public UCM and SIRI datasets. Compared to state-of-the-art methods, the JMCNN achieved improved performance and great robustness with average accuracies of 89.3% and 88.3% on the two datasets.

Key words: high resolution remote sensing imagery, scene classification, joint multi-scale convolution neural network, enhanced high-level feature representation, limit datasets

CLC Number: