“…Deep Convolutional Neural Networks (DCNNs) are able to adaptively approximate the relationship between image information and land information through multi-layer transformations ( Zhu et al, 2017 ). Thus, compared with conventional land cover classification methods, deep models can accurately characterize complex contextual information contained in high-resolution images ( Tong et al, 2020 , Huang et al, 2018a , Zhang et al, 2019 , Srivastava et al, 2019 , Zhong et al, 2020 ). Although deep models have reported great superiorities in many remote sensing issues ( Zhu et al, 2017 , Ma et al, 2019 , Zhu et al, 2021 ), their performance strongly relies on the quality and quantity of training data ( LeCun et al, 2015 , Xia et al, 2017 , Ding et al, 2021 ), resulting in two main problems in applying them to real-world land cover mapping:…”