“…Compared to satellite-based instruments, ground-based remote sensing instruments, for example, whole-sky imager and total-sky imager (Long et al, 2006), could capture high-resolution ground-based cloud images, which provides new opportunities for monitoring and understanding regional sky conditions. Benefiting from these ground-based cloud images, considerable approaches are proposed to implement ground-based cloud classification task using hand-crafted features, such as texture, color, structure, and so on (Kazantzidis et al, 2012;Liu et al, 2011;Xiao et al, 2016;Zhuo et al, 2014). Recently, witnessing the success of deep learning in a variety of research fields (Gao et al, 2018;He et al, 2019;Labati et al, 2019;Milletari et al, 2016;Shi et al, 2016), numerous methods are proposed to learn robust and discriminative deep features for automatic ground-based cloud classification in the framework of convolutional neural network (CNN).…”