Object detection has made tremendous progress in natural images over the last decade. However, the results are hardly satisfactory when the natural image object detection algorithm is directly applied to satellite images. This is due to the intrinsic differences in the scale and orientation of objects generated by the bird’s-eye perspective of satellite photographs. Moreover, the background of satellite images is complex and the object area is small; as a result, small objects tend to be missing due to the challenge of feature extraction. Dense objects overlap and occlusion also affects the detection performance. Although the self-attention mechanism was introduced to detect small objects, the computational complexity increased with the image’s resolution. We modified the general one-stage detector YOLOv5 to adapt the satellite images to resolve the above problems. First, new feature fusion layers and a prediction head are added from the shallow layer for small object detection for the first time because it can maximally preserve the feature information. Second, the original convolutional prediction heads are replaced with Swin Transformer Prediction Heads (SPHs) for the first time. SPH represents an advanced self-attention mechanism whose shifted window design can reduce the computational complexity to linearity. Finally, Normalization-based Attention Modules (NAMs) are integrated into YOLOv5 to improve attention performance in a normalized way. The improved YOLOv5 is termed SPH-YOLOv5. It is evaluated on the NWPU-VHR10 dataset and DOTA dataset, which are widely used for satellite image object detection evaluations. Compared with the basal YOLOv5, SPH-YOLOv5 improves the mean Average Precision (mAP) by 0.071 on the DOTA dataset.
Hyperspectral images are widely used for classification due to its rich spectral information along with spatial information. To process the high dimensionality and high nonlinearity of hyperspectral images, deep learning methods based on convolutional neural network (CNN) are widely used in hyperspectral classification applications. However, most CNN structures are stacked vertically in addition to using a onefold size of convolutional kernels or pooling layers, which cannot fully mine the multiscale information on the hyperspectral images. When such networks meet the practical challenge of a limited labeled hyperspectral image dataset—i.e., “small sample problem”—the classification accuracy and generalization ability would be limited. In this paper, to tackle the small sample problem, we apply the semantic segmentation function to the pixel-level hyperspectral classification due to their comparability. A lightweight, multiscale squeeze-and-excitation pyramid pooling network (MSPN) is proposed. It consists of a multiscale 3D CNN module, a squeezing and excitation module, and a pyramid pooling module with 2D CNN. Such a hybrid 2D-3D-CNN MSPN framework can learn and fuse deeper hierarchical spatial–spectral features with fewer training samples. The proposed MSPN was tested on three publicly available hyperspectral classification datasets: Indian Pine, Salinas, and Pavia University. Using 5%, 0.5%, and 0.5% training samples of the three datasets, the classification accuracies of the MSPN were 96.09%, 97%, and 96.56%, respectively. In addition, we also selected the latest dataset with higher spatial resolution, named WHU-Hi-LongKou, as the challenge object. Using only 0.1% of the training samples, we could achieve a 97.31% classification accuracy, which is far superior to the state-of-the-art hyperspectral classification methods.
Our recent studies on nodular damage in dielectric multilayer mirrors were first reviewed, and the main findings are taken as a foundation to further investigate the influence of seed absorptivity and asymmetrical boundary on the laser-induced damage of nodules. Experimental results showed that the seed absorptivity had a big influence on laser-induced damage thresholds (LIDTs) of the prepared nodules. A direct link between the cross-sectional |E|2 distributions and damage morphologies of nodules was found, which can perfectly explain the observed dependence of LIDTs on seed absorptivity. Another series of asymmetrical nodules were also studied in this work. The measured LIDTs of asymmetrical nodules were about 40%-70% lower than the LIDTs of the symmetrical nodules initiating from the same-sized SiO2 seeds. The weaker mechanical stability and the nonuniform |E|2 distributions are two main reasons for the lower laser damage resistance of the asymmetrical nodules.
Change detection (CD) aims to identify differences in scenes observed at different times. Hyperspectral image (HSI) is preferred for the understanding of land surface changes, since it can provide essential and unique features for CD. However, due to the high-dimensionality and limited data, the HSI-CD task is challenged. While model-driven CD methods are hard to achieve high accuracy due to the weak detection performance for fine changes, data-driven CD methods are hard to be generalized due to the limited data sets. The state-of-art method is to combine a single model-driven method with a data-driven convolutional neural network (CNN). Wherein the pseudo-labels can be generated automatically by the model-driven method and then fed to CNN for training. However, the final detection accuracy is limited by the model-driven method which produces pseudo-labels with one-sidedness and low accuracy. Therefore, the generation of credible pseudo-labels is anticipated and crucial for such a combination. Herein, a novel strategy, the combination of two complementary model-driven methods, structural similarity (SSIM) and change vector analysis (CVA), is proposed to generate credible labels for training a subsequent CNN. The results show that the final accuracy is higher than that of SSIM and CVA. The main contributions of this paper are threefold: 1) a new paradigm for generating credible labels is proposed. 2) SSIM is used for the first time for HSI-CD tasks. 3) an unsupervised end-to-end framework is presented for the HSI-CD. Experimental results demonstrate the effectiveness of the proposed framework.Index Terms-Change detection (CD), deep learning, heterogeneous images, hyperspectral image (HSI), structural similarity (SSIM).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.