This article proposes a novel hierarchical residual network with attention mechanism (HResNetAM) for hyperspectral image spectral-spatial classification to improve the performance of conventional deep learning networks. The straightforward convolutional neural network based models have limitations in exploiting the multi-scale spatial and spectral features, and this is the key factor in dealing with the high-dimensional nonlinear characteristics present in hyperspectral images. The proposed hierarchical residual network can extract multi-scale spatial and spectral features at a granular level, so the receptive fields range of this network will be increased, which can enhance the feature representation ability of the model. Besides, we utilize the attention mechanism to set adaptive weights for spatial and spectral features of different scales, and this can further improve the discriminative ability of extracted features. Furthermore, the double branch structure is also exploited to extract spectral and spatial features with corresponding convolution kernels in parallel, and the extracted spatial and spectral features of multiple scales are fused for hyperspectral image classification. Four benchmark hyperspectral datasets collected by different sensors and at different acquisition time are employed for classification experiments, and comparative results reveal that the proposed method has competitive advantages in terms of classification performance when compared with other state-ofthe-art deep learning models.
Recently, the deep learning models have achieved great success in hyperspectral images classification. However, most of the deep learning models fail to obtain satisfactory results under the condition of small samples due to the contradiction between the large parameter space of the deep learning models and the insufficient labeled samples in hyperspectral images. To address the problem, a deep model based on the induction network is designed in this paper to improve the classification performance of hyperspectral images under the condition of small samples. Specifically, the typical meta-training strategy is adopted, enabling the model to acquire stronger generalization ability, so as to accurately distinguish the new classes with only a few labeled samples (e.g., 5 samples per class). Moreover, in order to deal with the disturbance caused by the various characteristics of the samples in the same class in HSI, the classwise induction module is introduced utilizing the dynamic routing algorithm, which can induce the sample-wise representations to the class-wise level representations. The obtained class-wise level representations possess better separability, allowing the designed model to generate more accurate and robust classification results. Extensive experiments are carried out on three public hyperspectral images to verify the effectiveness of the proposed method. The results demonstrate that our method outperforms existing deep learning methods under the condition of small samples.
Convolutional neural networks (CNNs) have strong feature extraction capability, which have been used to extract features from the hyperspectral image. Local binary pattern (LBP) is a simple but powerful descriptor for spatial features, which can lessen the workload of CNNs and improve the classification accuracy. In order to make full use of the feature extraction capability of CNNs and the discrimination of LBP features, a novel classification method combining dual-channel CNNs and LBP is proposed. Specifically, a one-dimensional CNN (1D-CNN) is adopted to process original hyperspectral data to extract hierarchical spectral features and another same 1D-CNN is applied to process LBP features to further extract spatial features. Then, the concatenation of two fully connected layers from the two CNNs, which fused features, is fed into a softmax classifier to complete the classification. The experimental results demonstrate that the proposed method can provide 98.52%, 99.54% and 99.54% classification accuracy on the Indian Pines, University of Pavia and Salinas data, respectively. And the proposed method can also obtain good performance even with limited training samples.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.