“…We employ five different metrics namely, the accuracy, precision, recall, F-measure, and area under the curve (AUC) to assess the efficiency of the proposed classification model and other compared benchmarks. These metrics are widely used in medical area and are formulated as follows [ 76 , 77 ]: where TP, FP, TN and FN given in Eqs (17) – (21) refer to True Positive, False Positive, True Negative and False Negative, respectively. Given a test dataset and a deep learning classification Algorithm, TP represents the proportion of positive (i.e., COVID-19) samples that are correctly labeled as COVID-19 by the classification algorithm; FP represents the proportion of negative (i.e., non-COVID-19) samples that are mislabeled as positive; TN is the proportion of negative samples that are correctly labeled as normal and FN is the proportion of positive samples that are mislabeled as negative by the classification algorithm.…”