“…In details, the pretrained backbones of the baseline methods [13] , [16] , [17] , [18] , [19] , [20] , [21] , [22] , [23] , [24] , [25] were selected and fine-tuned with the selected datasets. The training parameters were chosen with the experimental data during the fine-tuning of backbones of the baseline methods [13] , [16] , [17] , [18] , [19] , [20] , [21] , [22] , [23] , [24] , [25] , and same method was adopted for the proposed model. Additionally, the same experimental protocol (including five-fold cross-validation) was adopted for the performance comparisons of the baseline methods [13] , [16] , [17] , [18] , [19] , [20] , [21] , [22] , [23] , [24] , [25] and the proposed model.…”