“…Recently, semantic segmentation, a commonly used term in computer vision where each pixel within the input imagery is assigned to a particular class, has been a widely used technique in diverse earth-related applications [108]. Various architectures, such as fully convolutional networks (FCNs), SegNet [109], U-Net [110], and DeepLab V3+ [111], have been used successfully to delineate tree and vegetation species [70,98,100,101,103,105,106,[112][113][114][115][116][117][118][119][120][121][122][123][124], crops [51,57,58,102,125,126], wetlands [107,127], and weeds [61,99] from various remotely sensed data. For instance, Freudenberg et al [128] utilized U-Net architecture to detect oil and coconut palms from WorldView 2, 3 satellite images.…”