The ghost phenomenon in synthetic aperture radar (SAR) imaging is primarily caused by azimuth or range ambiguities, which cause difficulties in SAR target detection application. To mitigate this influence, we propose a ship target detection method in spaceborne SAR imagery, using a hierarchical convolutional neural network (H-CNN). Based on the nature of ghost replicas and typical target classes, a two-stage CNN model is built to detect ship targets against sea clutter and the ghost. First, regions of interest (ROIs) were extracted from a large imaged scene during the coarse-detection stage. Unwanted ghost replicas represented major residual interference sources in ROIs, therefore, the other CNN process was executed during the fine-detection stage. Finally, comparative experiments and analyses, using Sentinel-1 SAR data and various assessment criteria, were conducted to validate H-CNN. Our results showed that the proposed method can outperform the conventional constant false-alarm rate technique and CNN-based models.Remote Sens. 2019, 11, 620 2 of 17 recognition [11]. In addition, the conventional constant false alarm rate (CFAR) technique is a typical detection method based on the grayscale feature. However, complicated and cluttered backgrounds severely affect CFAR detection performance [12].In recent years, ship target detection based on deep learning (DL) has been widely studied [13,14], using the typical model of convolutional neural network (CNN) [15]. Liu et al. [16] presented a ship detection method, namely sea-land segmentation-based convolutional neural network (SLS-CNN), which combines a SLS-CNN detector, saliency computation, and corner features. Furthermore, Zhao et al. [17] proposed a spaceborne SAR ship detection algorithm based on low complexity CNN. Some other well-known CNN-based target detection methods include faster region-CNN (Faster R-CNN), you only look once (YOLO) list model, etc. For example, Li et al. [18,19] improved detection performance using Faster R-CNN, to successfully provide a densely connected multi-scale neural network [19]. This method is used to solve multi-scale and multi-scene problems in SAR ship detection. Feature maps are fused by densely connecting different feature map layers, rather than information from single feature maps, which represent top-to-down feature map connections. The R-CNN method is used for target recognition in large scene SAR images [20]. Furthermore, Hamza and Cai used YOLOv2 for ship detection [21], which introduced a multitude of enhancements into the original YOLO model.However, these methods may be no longer effective when ghost replicas exist in an imaged scene. The ghost phenomenon is an intrinsic effect of SAR's ambiguity, both in azimuth and range [22,23]. Range ambiguity occurs when different backscattered echoes-one related to a transmitted pulse and the other due to a previous transmission-temporarily overlap during the receiving operation [24]. On the other hand, azimuth ambiguity is caused by the aliasing of each target's Doppler phase...