Journal of Systems Engineering and Electronics ›› 2019, Vol. 30 ›› Issue (2): 238-244.doi: 10.21629/JSEE.2019.02.02

• Electronics Technology • Previous Articles     Next Articles

Effective distributed convolutional neural network architecture for remote sensing images target classification with a pre-training approach

Binquan LI1,*(), Xiaohui HU2()   

  1. 1 School of Automation Science and Electrical Engineering, Beihang University, Beijing 100191, China
    2 Institute of Software, Chinese Academy of Sciences, Beijing 100190, China
  • Received:2017-09-11 Online:2019-04-01 Published:2019-04-28
  • Contact: Binquan LI E-mail:jz05022300@sina.com;hxh@iscas.ac.cn
  • About author:LI Binquan was born in 1986. He is now pursuing his Ph.D. degree with School of Automation Science and Electrical Engineering, Beihang University, Beijing, China. His research interests are deeplearning, computer vision and big data processing. E-mail:jz05022300@sina.com|HU Xiaohui was born in 1960. He is a researcher at Institute of Software, Chinese Academy of Sciences. His research interests include artificial intelligence, information system integration and simulation technology. E-mail:hxh@iscas.ac.cn
  • Supported by:
    the National Natural Science Foundation of China(U1435220);This work was supported by the National Natural Science Foundation of China (U1435220)

Abstract:

How to recognize targets with similar appearances from remote sensing images (RSIs) effectively and efficiently has become a big challenge. Recently, convolutional neural network (CNN) is preferred in the target classification due to the powerful feature representation ability and better performance. However, the training and testing of CNN mainly rely on single machine. Single machine has its natural limitation and bottleneck in processing RSIs due to limited hardware resources and huge time consuming. Besides, overfitting is a challenge for the CNN model due to the unbalance between RSIs data and the model structure. When a model is complex or the training data is relatively small, overfitting occurs and leads to a poor predictive performance. To address these problems, a distributed CNN architecture for RSIs target classification is proposed, which dramatically increases the training speed of CNN and system scalability. It improves the storage ability and processing efficiency of RSIs. Furthermore, Bayesian regularization approach is utilized in order to initialize the weights of the CNN extractor, which increases the robustness and flexibility of the CNN model. It helps prevent the overfitting and avoid the local optima caused by limited RSI training images or the inappropriate CNN structure. In addition, considering the efficiency of the Naïve Bayes classifier, a distributed Naïve Bayes classifier is designed to reduce the training cost. Compared with other algorithms, the proposed system and method perform the best and increase the recognition accuracy. The results show that the distributed system framework and the proposed algorithms are suitable for RSIs target classification tasks.

Key words: convolutional neural network (CNN), distributed architecture, remote sensing images (RSIs), target classification, pre-training